[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference spezko::cluster

Title:+ OpenVMS Clusters - The best clusters in the world! +
Notice:This conference is COMPANY CONFIDENTIAL. See #1.3
Moderator:PROXY::MOORE
Created:Fri Aug 26 1988
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:5320
Total number of notes:23384

5300.0. "FDD cluster and Cluexit questions..." by VAXRIO::MAURO () Fri Apr 25 1997 20:56

Hi,

I have a FDDI/SCSI cluster with two ASRVS, one 2100 and one 8400. I'm
having CLUEXIT crashes in both systems, at least one time a day. I already
checked pool expansion failures, pagefiles full and other stuffs that would
justify VC to be closed. During my analysis the most significant thing I
found were plenty of errors in the Bus FWA (SCA) and errors logged in
errorlog few seconds before the CLUEXIT. Right now I have the RECNXINTERVAL
set to 120 seconds and the cluster is more stable. The Field Service people
is trying to find a HW error, but they still suspect of a Software problem.

I already applyed the patches: ALPLVAC01_062 and ALPLAN04_062...

My questions are: Is there any problem work with RECNXINTERVAL set to 120,
since its default is 20?
Are you  able to tell me if the NCP line counters, displayed in the end of this
text, presents a HW error situation in the FDDI bus?

Any help will be very welcome, Mauro.

    
SDA> sh port/add=81508A88

Bus Addr  Bus     LAN Address    Error Count Last Error   Time of Last Error
--------  ---  ----------------- ----------- ---------- -----------------------
81441100  LCL  00-00-00-00-00-00           0
8150D980  FWA  00-00-F8-63-1E-5E         413  0000002C  24-APR-1997 13:39:46.83
				   ^^^^^^^^

SDA> sh port/bus=bus_fwa

--- BUS: 8150D980  (FWA)  Device: FW_DEFPA  LAN Address: 00-00-F8-63-1E-5E ---
                                   LAN Hardware Address: 00-00-F8-63-1E-5E
Status: 00000803 run,online,restart
------- Transmit ------  ------- Receive -------  ---- Structure Addresses ---
Msg Xmt         2736592  Msg Rcv         3442083  PORT Address        815093C0
  Mcast Msgs       9745    Mcast Msgs       9341  VCIB Addr           8150DB40
  Mcast Bytes   1315575    Mcast Bytes   1046192  HELLO Message Addr  8150DBE0
Bytes Xmt     559339379  Bytes Rcv     627724257  BYE Message Addr    8150DDA0
Outstand I/Os         0  Buffer Size        4382  Delete BUS Rtn Adr  86ABB5A8
Xmt Errors          413  Rcv Ring Size        31                              
Last Xmt Error 0000002C         Time of Last Xmt Error 24-APR-1997 13:39:46.83
--- Receive Errors ----  ------ BUS Timer ------  ----- Datalink Events ------
TR Mcast Rcv          0  Handshake TMO  86ABC2A8  Last 24-APR-1997 13:39:49.97
Rcv Bad SCSID         0  Listen TMO     86ABC2AC  Last Event          00001200
Rcv Short Msgs        0  HELLO timer           2  Port Usable                3
Fail CH Alloc         0  HELLO Xmt err       411  Port Unusable              2
Fail VC Alloc         0  ^^^^^^^^^^^^^^^^^^^^^^   Address Change             0
Wrong PORT            0                           Port Restart Fail          0


                 --- Virtual Circuit (VC) 81559580 ---
Remote System Name:  SIRIUS (1:ALPHA)   Remote SCSSYSTEMID:  1031               
Local System ID:  222 (DE)              Status: 0005 open,path                  
------ Transmit -------  ------ VC Closures ----  ---- Congestion Control ----
Msg Xmt         2737860  SeqMsg TMO            1  Pipe Quota/Slo/Max   3/ 2/31
  Unsequence          9  CC DFQ Empty          0  Pipe Quota Reached       490
  Sequence      2711613  Topology Change       0  Xmt C/T              130/192
  ReXmt         241/283  NPAGEDYN Low          0  RndTrp uS          2330+1329
  Lone ACK        25997                           UnAcked Msgs               0
Bytes Xmt     470423009                           CMD Queue Len/Max        0/9
------- Receive -------  - Messages  Discarded -  ----- Channel Selection ----
Msg Rcv         3449941  No Xmt Chan           0  Preferred Channel   81557EC0
  Unsequence          9  Rcv Short Msg         0  Delay Time          FF5C71CC
  Sequence      3406578  Illegal Seq Msg       0  Buffer Size             4382
  ReRcv             128  Bad Checksum          0  Channel Count              1
  Lone ACK        43229  TR DFQ Empty          0  Channel Selections       244
  Cache               0  TR MFQ Empty          0  Protocol               1.4.0
  Ill ACK             0  CC MFQ Empty          0  Open 24-APR-1997 13:39:51.11
Bytes Rcv     560397345  Cache Miss            0  Cls  24-APR-1997 13:39:43.12
        -- Channel Summary for Virtual Circuit (SIRIUS) 81559580 --

Address     Type    Xmt Time Size Preferred    Best        Last State Change
--------  --------- -------- ---- ---------  --------   -----------------------
81557EC0  Preferred FF5C437C 4382       246         0   24-APR-1997 13:39:51.11
          ^^^^^^^^
           Dead when Cluexit happens

SYSMAN> do mcr ncp show kn line count                   
%SYSMAN-I-OUTPUT, command execution on node SIRIUS
 
 
Known Line Counters as of 24-APR-1997 15:42:26
 
Line = FPA-0
 
      >65534  Seconds since last zeroed
    42955844  Data blocks received
     4119963  Multicast blocks received
           0  Receive failure
 >4294967294  Bytes received
   471504270  Multicast bytes received
           0  Data overrun
    41813611  Data blocks sent
      212757  Multicast blocks sent
 >4294967294  Bytes sent
    30718424  Multicast bytes sent
           0  Send failure
         121  Unrecognized frame destination
           0  System buffer unavailable
           2  User buffer unavailable
 >4294967294  MAC frame count
           2  MAC error count
           0  MAC lost count
           0  Ring initializations initiated
         255  Ring initializations received
           0  Ring beacons initiated
           0  Duplicate address test failures
           0  Duplicate tokens detected
           0  Ring purge errors
           0  FCI strip errors
           0  Traces initiated
           0  Traces received
           0  Directed beacons received
           0  Elasticity buffer errors
           0  LCT rejects
           0  LEM rejects
           0  Link errors
           7  Connections completed

 
%SYSMAN-I-OUTPUT, command execution on node ORION
 
 
Known Line Counters as of 24-APR-1997 15:42:26
 
Line = FPA-0
 
       21444  Seconds since last zeroed
     4089933  Data blocks received
      276710  Multicast blocks received
           0  Receive failure
   719844596  Bytes received
    33271818  Multicast bytes received
           0  Data overrun
     3071187  Data blocks sent
       12696  Multicast blocks sent
   686156422  Bytes sent
     1778761  Multicast bytes sent
           0  Send failure
          69  Unrecognized frame destination
           0  System buffer unavailable
           0  User buffer unavailable
  1093860756  MAC frame count
           1  MAC error count
          57  MAC lost count
           0  Ring initializations initiated
          12  Ring initializations received
           2  Ring beacons initiated
           0  Duplicate address test failures
           0  Duplicate tokens detected
           0  Ring purge errors
           0  FCI strip errors
           0  Traces initiated
           0  Traces received
           0  Directed beacons received
           2  Elasticity buffer errors
           0  LCT rejects
           0  LEM rejects
          59  Link errors
           9  Connections completed

 
SYSMAN> 

    
T.RTitleUserPersonal
Name
DateLines
5300.1LAN problem indicated so far...STAR::BOAENLANclusters/VMScluster Tech. OfficeTue Apr 29 1997 14:4543
Hmm,

	The PEdriver BUS structure is telling you that the LAN driver
is returning HELLO packets with transmit fail status:

	HELLO Xmt err       411 

	There have been 411 failures out of 9745 multicast transmits.
Since PEdriver only uses multicast for HELLOs. that's a pretty high
failure rate. Especially if they occasionally happen in bursts.
This error is indicative of a LAN adapter or driver problem.

HELLO transmit failures can result in CHANNELS & VCs on other nodes
being closed due to CHANNEL listen timeouts.


There is nothing that indicates a problem in PEdriver so far.

Try SHOW LAN /COUNT and see what the LAN driver error counters show for
FWA.

 recommend using SHOW PORT /CH/VC=...
then a SHOW /BUS and SHOW LAN/COUNT

to get a set of correlated information.

You mention the error log, but don't say what kind of errors are being
logged and how often. I'd guess you're logging PEdriver VC closures?

PEdriver is retransmitting about 1 in 1000 packets which is on the high
end of the normal range, but the roundtrip times look good. I suspect that
normal LAN burstiness is causing an occasional retransmit. The
ReRcv counts hint at this.  Important info
is that normal traffic packets are NOT getting transmit errors. This seems
to be a problem that only affects multicast transmits...

  Is this a dedicated FDDI ring between
the 2 alphas, or is it shared with other network traffic?

'Gards, Verell



5300.2Informations...VAXRIO::MAUROTue Apr 29 1997 17:10516
    Both nodes from the cluster are connected to DECcon 900 MX, that is
    part of a very large FDDI ring, with over 800 stations. Right now was
    installed a second direct FDDI conection between both nodes. In the
    ASRV8400 was installed a second DEFPA and in the ASRV2100 a DEFEA. What
    I didn't understand is that even in this second conection are we can
    see errors through SHOW PORT in both FDDI buses, but the connectio
    using the DEFEA doesn't presents errors, as you can see in the end of
    this entry.
    
    The errors looged in errorlog, just few seconds before teh CLUEXIT
    were FATAL DATALINK ERRORS pointing to FWAX. I think the system
    are still up because the RECNXINTERVAL, because it still shows VC
    closure but without to crash.
    
    



		  --- Port Descriptor Table (PDT) 8151D488 ---

Type: 03 pe
Characteristics: 0000 

Msg Header Size           32  Flags               0000  Port Map        00000000
Max Xfer Bcnt       FFFFFFFF  Counter CDRP    00000000                          
Poller Sweep              30  Load Vector     81541FCC                          
Fork Block W.Q.     8151D560  Load Class            10                          
UCB Address         8151CF80  Connection W.Q. 81571F94                          
ADP Address         00000000  Yellow Q.       8151D5B8                          
Max VC timeout            16  Red Q.          8151D5C0                          
SCS Version                2  Disabled Q.     81452E90                          

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 3
VMScluster data structures




		 --- Port Block 8151F200 ---

Status: 0001 authorize
VC Count: 2
Secs Since Last Zeroed: 173424

SBUF Size             484     LBUF Size         4786     Fork Count    22099108
SBUF Count             10     LBUF Count           1     Refork Count         0
SBUF Max              768     LBUF Max           384     Last Refork   00000000
SBUF Quo               10     LBUF Quo             1     SCS Messages  21281159
SBUF Miss             232     LBUF Miss           18     VC Queue Cnt    422877
SBUF Allocs      15214425     LBUF Allocs      93778     TQE Received   1734244
SBUFs In Use            0     LBUFs In Use         0     Timer Done     1734244
Peak SBUF In Use       12     Peak LBUF In Use     2     RWAITQ Count     92329
SBUF Queue Empty        0     LBUF Queue Empty     0     LDL Buf/Msg      74000
TR SBUF Queue Empty     0     Ticks/Second        10     ACK Delay      1000000
No SBUF for ACK         0     Listen Timeout       8     Hello Interval      30

Bus Addr  Bus     LAN Address    Error Count Last Error   Time of Last Error
--------  ---  ----------------- ----------- ---------- -----------------------
81441100  LCL  00-00-00-00-00-00           0
81523980  FWA  00-00-F8-63-1E-5E        1177  0000002C  28-APR-1997 09:04:48.88
81525980  FRA  08-00-2B-B1-1A-FA           0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 4
VMScluster data structures



		 --- Virtual Circuit (VC) 8152BD40 ---
Remote System Name:  ORION  (1:ALPHA)   Remote SCSSYSTEMID:  1032               
Local System ID:  223 (DF)              Status: 0005 open,path                  
------ Transmit -------  ------ VC Closures ----  ---- Congestion Control ----
Msg Xmt               3  SeqMsg TMO            0  Pipe Quota/Slo/Max   1/ 8/ 8
  Unsequence          3  CC DFQ Empty          0  Pipe Quota Reached         0
  Sequence            0  Topology Change       0  Xmt C/T                  0/1
  ReXmt             0/0  NPAGEDYN Low          0  RndTrp uS          3000000+0
  Lone ACK            0                           UnAcked Msgs               0
Bytes Xmt           198                           CMD Queue Len/Max        0/0
------- Receive -------  - Messages  Discarded -  ----- Channel Selection ----
Msg Rcv               2  No Xmt Chan           0  Preferred Channel   81527F00
  Unsequence          3  Rcv Short Msg         0  Delay Time          00000000
  Sequence            0  Illegal Seq Msg       0  Buffer Size             1412
  ReRcv               0  Bad Checksum          0  Channel Count              1
  Lone ACK            0  TR DFQ Empty          0  Channel Selections         1
  Cache               0  TR MFQ Empty          0  Protocol               1.4.0
  Ill ACK             0  CC MFQ Empty          0  Open 27-APR-1997 13:21:27.76
Bytes Rcv           100  Cache Miss            0  Cls  17-NOV-1858 00:00:00.00

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 5
VMScluster data structures



 -- Preferred Channel (CH:81527F00) for Virtual Circuit (VC:8152BD40) ORION  --
State: 0004 open                Status: 0B path,open,rmt_hwa_valid
BUS: 81441100  (LCL)  Lcl Device: loopback  Lcl LAN Address: 00-00-00-00-00-00
Rmt Name: LCL         Rmt Device: Unknown   Rmt LAN Address: 00-00-00-00-00-00
Rmt Seq #: 0001   Open:27-APR-1997 13:21:27.76  Closed:17-NOV-1858 00:00:00.00
------- Transmit ------  ------- Receive -------  ----- Channel Selection ----
Lcl CH Seq #       0001  Msg Rcv           74000  Average Xmt Time    00000000
Msg Xmt               6    Mcast Msgs      73994  Remote Buffer Size      1412
  Ctrl Msgs           3    Mcast Bytes   7251412  Max Buffer Size         1412
  Ctrl Bytes        294    Ctrl Msgs           3  Best Channel               0
Bytes Xmt           492    Ctrl Bytes        294  Preferred Channel          1
Rmt Ring Size         8  Bytes Rcv       7251904  Retransmit Penalty         0
---------------  Channel Errors  ---------------  Xmt Error Penalty          0
Handshake TMO         0  Short CC Msgs         0  ------- Channel Timer ------
Listen TMO            0  Incompat Chan         0  Timer Entry Flink   86ABE824
Bad Authorize         0  No MSCP Srvr          0              Blink   86ABE824
Bad ECO               0  Disk Not Srvd         0  Last Ring Index           07
Bad Multicast         0  Old TR Msgs           0  Protocol               1.4.0
Topology Change       0                           Supported Services  00000000

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 6
VMScluster data structures



		 --- Virtual Circuit (VC) 81571DC0 ---
Remote System Name:  SIRIUS (1:ALPHA)   Remote SCSSYSTEMID:  1031               
Local System ID:  222 (DE)              Status: 0005 open,path                  
------ Transmit -------  ------ VC Closures ----  ---- Congestion Control ----
Msg Xmt        15310334  SeqMsg TMO            0  Pipe Quota/Slo/Max  10/ 9/31
  Unsequence          3  CC DFQ Empty          0  Pipe Quota Reached      3722
  Sequence     14980492  Topology Change       0  Xmt C/T              223/640
  ReXmt       2140/2140  NPAGEDYN Low          0  RndTrp uS         9202+30991
  Lone ACK       327699                           UnAcked Msgs               1
Bytes Xmt    2600539668                           CMD Queue Len/Max        0/7
------- Receive -------  - Messages  Discarded -  ----- Channel Selection ----
Msg Rcv        17509675  No Xmt Chan           0  Preferred Channel   81577780
  Unsequence          3  Rcv Short Msg         0  Delay Time          F880A172
  Sequence     17077027  Illegal Seq Msg       0  Buffer Size             4382
  ReRcv            1365  Bad Checksum          0  Channel Count              2
  Lone ACK       431281  TR DFQ Empty          0  Channel Selections     24502
  Cache               0  TR MFQ Empty          0  Protocol               1.4.0
  Ill ACK             0  CC MFQ Empty          0  Open 27-APR-1997 13:21:33.38
Bytes Rcv    2797695253  Cache Miss            0  Cls  17-NOV-1858 00:00:00.00

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 7
VMScluster data structures



 -- Preferred Channel (CH:81577780) for Virtual Circuit (VC:81571DC0) SIRIUS --
State: 0004 open                Status: 0B path,open,rmt_hwa_valid
BUS: 81525980  (FRA)  Lcl Device: FR_DEFEA  Lcl LAN Address: 08-00-2B-B1-1A-FA
Rmt Name: FWA         Rmt Device: FW_DEFPA  Rmt LAN Address: 00-00-F8-40-F2-A4
Rmt Seq #: 0002   Open:27-APR-1997 13:21:49.18  Closed:17-NOV-1858 00:00:00.00
------- Transmit ------  ------- Receive -------  ----- Channel Selection ----
Lcl CH Seq #       0001  Msg Rcv        10018413  Average Xmt Time    F880A172
Msg Xmt         7982661    Mcast Msgs      75698  Remote Buffer Size      4382
  Ctrl Msgs           2    Mcast Bytes   7418404  Max Buffer Size         4382
  Ctrl Bytes        196    Ctrl Msgs           1  Best Channel           11417
Bytes Xmt    1355848761    Ctrl Bytes         98  Preferred Channel      12066
Rmt Ring Size        31  Bytes Rcv    1656322638  Retransmit Penalty      1093
---------------  Channel Errors  ---------------  Xmt Error Penalty          0
Handshake TMO         0  Short CC Msgs         0  ------- Channel Timer ------
Listen TMO            0  Incompat Chan         0  Timer Entry Flink   8156FF00
Bad Authorize         0  No MSCP Srvr          0              Blink   86ABE83C
Bad ECO               0  Disk Not Srvd         0  Last Ring Index           0A
Bad Multicast         0  Old TR Msgs           0  Protocol               1.4.0
Topology Change       0                           Supported Services  00000000

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 8
VMScluster data structures



 -- Active Channel (CH:8156FF00) for Virtual Circuit (VC:81571DC0) SIRIUS --
State: 0004 open                Status: 0B path,open,rmt_hwa_valid
BUS: 81523980  (FWA)  Lcl Device: FW_DEFPA  Lcl LAN Address: 00-00-F8-63-1E-5E
Rmt Name: FWB         Rmt Device: FW_DEFPA  Rmt LAN Address: 00-00-F8-63-1C-2B
Rmt Seq #: 0004   Open:28-APR-1997 09:05:38.60  Closed:28-APR-1997 09:04:32.26
------- Transmit ------  ------- Receive -------  ----- Channel Selection ----
Lcl CH Seq #       0003  Msg Rcv         7641443  Average Xmt Time    F880B7EC
Msg Xmt         7327678    Mcast Msgs      74475  Remote Buffer Size      4382
  Ctrl Msgs           3    Mcast Bytes   7298550  Max Buffer Size         4382
  Ctrl Bytes        294    Ctrl Msgs           6  Best Channel           10994
Bytes Xmt    1244691397    Ctrl Bytes        588  Preferred Channel      12436
Rmt Ring Size        31  Bytes Rcv    1261148391  Retransmit Penalty      1046
---------------  Channel Errors  ---------------  Xmt Error Penalty          0
Handshake TMO         0  Short CC Msgs         0  ------- Channel Timer ------
Listen TMO            2  Incompat Chan         0  Timer Entry Flink   86ABE83C
Bad Authorize         0  No MSCP Srvr          0              Blink   81577780
Bad ECO               0  Disk Not Srvd         0  Last Ring Index           0A
Bad Multicast         0  Old TR Msgs           0  Protocol               1.4.0
Topology Change       0                           Supported Services  00000000

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 9
VMScluster data structures



--- BUS: 81523980  (FWA)  Device: FW_DEFPA  LAN Address: 00-00-F8-63-1E-5E ---
                                   LAN Hardware Address: 00-00-F8-63-1E-5E
Status: 00000803 run,online,restart
------- Transmit ------  ------- Receive -------  ---- Structure Addresses ---
Msg Xmt         7410867  Msg Rcv         7645313  PORT Address        8151F200
  Mcast Msgs      76905    Mcast Msgs      74497  VCIB Addr           81523B40
  Mcast Bytes  10382175    Mcast Bytes   8343664  HELLO Message Addr  81523BE0
Bytes Xmt    1500035855  Bytes Rcv    1368838615  BYE Message Addr    81523DA0
Outstand I/Os         0  Buffer Size        4382  Delete BUS Rtn Adr  86ABDBA8
Xmt Errors         1177  Rcv Ring Size        31                              
Last Xmt Error 0000002C         Time of Last Xmt Error 28-APR-1997 09:04:48.88
--- Receive Errors ----  ------ BUS Timer ------  ----- Datalink Events ------
TR Mcast Rcv          0  Handshake TMO  86ABE8A8  Last 28-APR-1997 09:04:52.02
Rcv Bad SCSID         0  Listen TMO     86ABE8AC  Last Event          00001200
Rcv Short Msgs        0  HELLO timer           6  Port Usable                3
Fail CH Alloc         0  HELLO Xmt err      1177  Port Unusable              2
Fail VC Alloc         0                           Address Change             0
Wrong PORT            0                           Port Restart Fail          0


OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 10
VMScluster data structures



--- BUS: 81525980  (FRA)  Device: FR_DEFEA  LAN Address: 08-00-2B-B1-1A-FA ---
                                   LAN Hardware Address: 08-00-2B-B1-1A-FA
Status: 00000803 run,online,restart
------- Transmit ------  ------- Receive -------  ---- Structure Addresses ---
Msg Xmt         8064652  Msg Rcv        10044063  PORT Address        8151F200
  Mcast Msgs      74570    Mcast Msgs      75723  VCIB Addr           81525B40
  Mcast Bytes  10066950    Mcast Bytes   8480976  HELLO Message Addr  81525BE0
Bytes Xmt    1635825962  Bytes Rcv    1801295763  BYE Message Addr    81525DA0
Outstand I/Os         0  Buffer Size        4382  Delete BUS Rtn Adr  86ABDBA8
Xmt Errors            0  Rcv Ring Size        31                              

--- Receive Errors ----  ------ BUS Timer ------  ----- Datalink Events ------
TR Mcast Rcv          0  Handshake TMO  86ABE8A8  Last 27-APR-1997 13:21:30.71
Rcv Bad SCSID         0  Listen TMO     86ABE8AC  Last Event          00001200
Rcv Short Msgs        0  HELLO timer          17  Port Usable                1
Fail CH Alloc         0  HELLO Xmt err         0  Port Unusable              0
Fail VC Alloc         0                           Address Change             0
Wrong PORT            0                           Port Restart Fail          0


OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 11
LAN Data Structures



              -- FWA Counters Information 29-APR-1997 13:49:55 --

Octets received           1718377268    Octets sent               1672414326
PDUs received               10389082    PDUs sent                    8267020
Mcast octets received      211211034    Mcast octets sent           14326715
Mcast PDUs received          1787594    Mcast PDUs sent               103488
Unrec indiv dest PDUs             27    PDUs sent, deferred                0
Unrec mcast dest PDUs        1154517    PDUs sent, one coll                0
Data overruns                      0    PDUs sent, mul coll                0
Unavail station buffs              0    Excessive collisions               0
Unavail user buffers               0    Late collisions                    0
CRC errors                         0    Carrier check failure              0
Alignment errors                   0    Last carrier failure            None
Rcv data length err                0    Coll detect chk fail               0
Frame size errors                  0    Short circuit failure              0
Frames too long                    0    Open circuit failure               0
Seconds since zeroed          174317    Transmits too long                 0
Station failures                   0    Send data length err               0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 12
LAN Data Structures



           -- FWA Counters Information (cont) 29-APR-1997 13:49:55 --

Transmit underrun                  0    Dup tokens detected                0
Transmit failure                   0    Ring purge errors                  0
Frame status error                 0    FCI strip errors                   0
Frame length error                 0    Traces initiated                   0
MAC frame count           8805954605    Traces received                    0
MAC error count                    0    Directed beacons rcvd              0
MAC lost count                    83    Elasticity buffer err              1
Ring inits initiated               0    LCT rejects                        0
Ring inits received               49    LEM rejects                        0
Ring beacon initiated              1    Link errors                       23
DAT test failures                  0    Connections completed             10
Link buffer unavail                0                                        

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 13
LAN Data Structures



           -- FWA Counters Information (cont) 29-APR-1997 13:49:55 --

No work transmits              47466    Ring avail transitions             8
Buffer_Addr transmits              0    Ring unavail transitions           5
SVAPTE/BOFF transmits              0    Loopback sent                      0
Global page transmits              0    System ID sent                   580
Bad PTE transmits                  0    ReqCounters sent                   0
Restart pending counter            0    Internal counters size            72
+00 Device interrupts       18213983    +2C Too many segments              0
+04 Command errors                 0    +30 Too few segments               0
+08 Transmits failed            1213    +34 RESETs issued                  3
+0C Receive errors                 0    +38 Fatal errs (soft tmo)          2
+10 Transmit timeouts              2    +3C EEPROM update tmo              0
+14 Command timeouts               0    +40 Last rcv err status     00000000
+18 CSR command timeouts           0    +44 Last cmd err status     00000000
+1C Init timeouts                  0    +48 Generic (or unused)     8150A000
+20 Unalign 1seg xcopies           0    +4C Generic (or unused)     81518000
+24 Unalign 2seg xcopies           0    +50 Generic (or unused)     8150A000
+28 Global page transmits          0    +54 Generic (or unused)     00000008

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 14
LAN Data Structures



         -- FWA1 60-07 (SCA) Counters Information 29-APR-1997 13:49:55 --

Octets received           1368858861    Octets sent               1527593619
PDUs received                7645427    PDUs sent                    7410939
Mcast octets received        8344336    Mcast octets sent           10224090
Mcast PDUs received            74503    Mcast PDUs sent                75734
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 15
LAN Data Structures



        -- FWA2 60-03 (DECNET) Counters Information 29-APR-1997 13:49:55 --

Octets received              8726632    Octets sent                 10009463
PDUs received                 133263    PDUs sent                      86248
Mcast octets received        5099054    Mcast octets sent            3504790
Mcast PDUs received            86637    Mcast PDUs sent                23913
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 16
LAN Data Structures



         -- FWA3 60-04 (LAT) Counters Information 29-APR-1997 13:49:55 --

Octets received             17978929    Octets sent                 24878502
PDUs received                 330419    PDUs sent                     334226
Mcast octets received        5506185    Mcast octets sent             485903
Mcast PDUs received            59302    Mcast PDUs sent                 2981
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 17
LAN Data Structures



          -- FWA4 08-00 (IP) Counters Information 29-APR-1997 13:49:55 --

Octets received            134763685    Octets sent                109331281
PDUs received                 974891    PDUs sent                     415632
Mcast octets received       89271585    Mcast octets sent                  0
Mcast PDUs received           280951    Mcast PDUs sent                    0
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 18
LAN Data Structures



         -- FWA5 08-06 (ARP) Counters Information 29-APR-1997 13:49:55 --

Octets received              6034060    Octets sent                    18944
PDUs received                 131217    PDUs sent                        290
Mcast octets received        6032810    Mcast octets sent                980
Mcast PDUs received           131186    Mcast PDUs sent                   20
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 19
LAN Data Structures



            -- FWA6 00-00 Counters Information 29-APR-1997 13:49:55 --

Octets received                    0    Octets sent                        0
PDUs received                      0    PDUs sent                          0
Mcast octets received              0    Mcast octets sent                  0
Mcast PDUs received                0    Mcast PDUs sent                    0
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 20
LAN Data Structures



         -- FWA7 80-41 (LAST) Counters Information 29-APR-1997 13:49:55 --

Octets received                32000    Octets sent                    22440
PDUs received                    500    PDUs sent                        264
Mcast octets received          32000    Mcast octets sent              22440
Mcast PDUs received              500    Mcast PDUs sent                  264
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0


OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 21
LAN Data Structures



              -- FRA Counters Information 29-APR-1997 13:49:59 --

Octets received           1907618640    Octets sent               1843131248
PDUs received               10110090    PDUs sent                    8217279
Mcast octets received       12042582    Mcast octets sent           13718746
Mcast PDUs received            89782    Mcast PDUs sent                99375
Unrec indiv dest PDUs              0    PDUs sent, deferred                0
Unrec mcast dest PDUs              0    PDUs sent, one coll                0
Data overruns                      0    PDUs sent, mul coll                0
Unavail station buffs              0    Excessive collisions               0
Unavail user buffers               0    Late collisions                    0
CRC errors                         0    Carrier check failure              0
Alignment errors                   0    Last carrier failure            None
Rcv data length err                0    Coll detect chk fail               0
Frame size errors                  0    Short circuit failure              0
Frames too long                    0    Open circuit failure               0
Seconds since zeroed          174327    Transmits too long                 0
Station failures                   0    Send data length err               0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 22
LAN Data Structures



           -- FRA Counters Information (cont) 29-APR-1997 13:49:59 --

Transmit underrun                  0    Dup tokens detected                1
Transmit failure                   0    Ring purge errors                  0
Frame status error                 0    FCI strip errors                   0
Frame length error                 0    Traces initiated                   0
MAC frame count             12062467    Traces received                    0
MAC error count                    0    Directed beacons rcvd              0
MAC lost count                     0    Elasticity buffer err              0
Ring inits initiated               0    LCT rejects                        0
Ring inits received                6    LEM rejects                        0
Ring beacon initiated              0    Link errors                        0
DAT test failures                  0    Connections completed              4
Link buffer unavail                0                                        

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 23
LAN Data Structures



           -- FRA Counters Information (cont) 29-APR-1997 13:49:59 --

No work transmits              46256    Ring avail transitions             2
Buffer_Addr transmits              0    Ring unavail transitions           1
SVAPTE/BOFF transmits          82480    Loopback sent                      0
Global page transmits              0    System ID sent                   584
Bad PTE transmits                  0    ReqCounters sent                   0
Restart pending counter            0    Internal counters size            72
+00 Device interrupts       17930755    +2C Too many segments              0
+04 Command errors                 0    +30 Too few segments               0
+08 Transmits failed               0    +34 RESETs issued                  1
+0C Receive errors                 0    +38 Fatal errs (soft tmo)          0
+10 Transmit timeouts              0    +3C EEPROM update tmo              0
+14 Command timeouts               0    +40 Last rcv err status     00000000
+18 CSR command timeouts           0    +44 Last cmd err status     00000000
+1C Init timeouts                  0    +48 Generic (or unused)     8151E000
+20 Unalign 1seg xcopies     8170109    +4C Generic (or unused)     8152C000
+24 Unalign 2seg xcopies       46256    +50 Generic (or unused)     81528000
+28 Global page transmits          0    +54 Generic (or unused)     00000008

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 24
LAN Data Structures



         -- FRA1 60-07 (SCA) Counters Information 29-APR-1997 13:49:59 --

Octets received           1801505773    Octets sent               1662928377
PDUs received               10045212    PDUs sent                    8065243
Mcast octets received        8481648    Mcast octets sent           10067625
Mcast PDUs received            75729    Mcast PDUs sent                74575
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 25
LAN Data Structures



        -- FRA2 60-03 (DECNET) Counters Information 29-APR-1997 13:49:59 --

Octets received              4995599    Octets sent                180088677
PDUs received                  64614    PDUs sent                     151186
Mcast octets received        2646746    Mcast octets sent            3536993
Mcast PDUs received            13789    Mcast PDUs sent                23952
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

OpenVMS (TM) Alpha Operating System, Version V6.2-1H3 -- System Dump Analysis	29-APR-1997 13:48:21.77		    Page 26
LAN Data Structures



         -- FRA3 80-41 (LAST) Counters Information 29-APR-1997 13:49:59 --

Octets received                16896    Octets sent                    22440
PDUs received                    264    PDUs sent                        264
Mcast octets received          16896    Mcast octets sent              22440
Mcast PDUs received              264    Mcast PDUs sent                  264
Unavail user buffer                0    Multicast not enabled              0
Last UUB time                   None    User buffer too small              0

5300.3Replace hardware till problem goes awaySTAR::STOCKDALEWed Apr 30 1997 15:1612
>>MAC lost count                    83    Elasticity buffer err              1
>>Ring inits received               49    LEM rejects                        0
>>Ring beacon initiated              1    Link errors                       23

>>+08 Transmits failed            1213    +34 RESETs issued                  3
>>+0C Receive errors                 0    +38 Fatal errs (soft tmo)          2
>>+10 Transmit timeouts              2    +3C EEPROM update tmo              0

It sounds like you have a hardware network problem, either a broken adapter, or
concentrator, or some other hardware on the FDDI ring.

- Dick
5300.4Why DEFEA doesn't log errors?VAXRIO::MAUROMon May 19 1997 20:1319
    Don't get upset with me, but the Hardware people ask me every day about
    this problem, mainly because they installed a second FDDI interface in
    both cpus, making a direct connection betweem them. They are not able
    to understand how can happen PEA0 (DATALINK ERROR) errors in this new
    connection, since it doesn't goes through the FDDI ring.
    
    The matter is that the EISA FDDI interface (FRA = SYS$FRDRIVER)
    doesn't log errors, while the PCI FDDI interface (FWA = SYS$FWDRIVER)
    in the other node still log errors (SDA>SHOW PORt/ADDR=PDT_ADDRESS) and
    they are connected directly. Is there any I can check or suggest them
    to check? Does it justify to open an IPMT?
    
    The SYS$FWDRIVER is from 31-JAN-1997 18:08 and was installed with
    ALPLAN04_062 (most recent patch).
    
    One more question: In this environment with two FDDI connections, how
    the PEdriver select the preferred path?
    
    Thanks, Mauro.