[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference wonder::turbolaser

Title:TurboLaser Notesfile - AlphaServer 8200 and 8400 systems
Notice:Welcome to WONDER::TURBOLASER in it's new homeshortly
Moderator:LANDO::DROBNER
Created:Tue Dec 20 1994
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:1218
Total number of notes:4645

1072.0. "TURBOLASER HANG " by DAIVC::ENGKOS () Mon Jan 27 1997 09:58

Hello All,

I have read notes 295.0 and 359.0 about "Hang" problem at Turbolaser.
And I've done all information and sugestion from both notes to solved 
the problem such as DECevent, O.S. pathes. And it still monitoring till
I shared this problem.

Our "TL" has the following configuration :

     o D-UNIX 3.2G with patches for V3.2G
     0 Dual CPU
     0 1 GB Memory
     o KFTHA consist of :
       
 PCI-PIU#1 (hose#0) :
       
    - 3 units KZPSA (A10) that connected to Internal SBB, TZ875 Autoloader,
      and HSZ50 at SW800.
  
 PCI-PIU#2 (hose#1) :

    - 2 units KZPSA (A10) and 4 units KZPAA connected to Internal SBB, HSZ50 at 
      SW800, 3 units TZ87 and CD-ROM.


All disks at Internal SBB and SW800 has configured with LSM and has a mirror.
        

The problem occured intermittently, and it rather dificult for me to estimate the time.
Everytime the problem occured, we checked that hose-error led was on and KFTHA led off.
We suspect that the problem caused by the KFTHA module at that time.

But after installed the DECevent, the result of diagnostic tool of DECevent make me really 
confused because the result mention that there was a problem with the system configuration
and I have to replace all module.

I really need your sugestion..!!


rgrds

doni
SSE DIGITAL-INDONESIA


There is some report from DECevent :


DECevent V2.3


******************************** ENTRY    1 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             0. 
Timestamp of occurrence              26-JAN-1997 19:32:20   
Host name                            utpci1 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000001 
CPU logging event (mperr) x00000000 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      110. Generalized Machine State Type 

SWI Minor class                   3. System configuration 

--CONFIGURATION SUBPKT--               

FRU CLASS                     x0001  ** TLSB FRU Subpkt ** 

  Device Type                 x8014  Turbo-Laser Dual CPU, 4meg Bcache 

  TLSB Node #                     0. 
  FRU Name                           KN7CE-AB 
  Serial Number                        

************************               

FRU CLASS                     x0001  ** TLSB FRU Subpkt ** 

  Device Type                 x5000  Turbo-Laser Memory Module 

  TLSB Node #                     1. 
  FRU Name                           MS7CC 
  Serial Number                      ZG64401927 

************************               

FRU CLASS                     x0001  ** TLSB FRU Subpkt ** 

  Device Type                 x5000  Turbo-Laser Memory Module 

  TLSB Node #                     7. 
  FRU Name                           MS7CC 
  Serial Number                      ZG64401911 

************************               

FRU CLASS                     x0001  ** TLSB FRU Subpkt ** 

  Device Type                 x2000  Turbo-Laser I/O Module 

  TLSB Node #                     8. 
  FRU Name                           KFTHA 
  Serial Number                      AY62121481 

************************               

FRU CLASS                     x0002  * Hose to IO Bus Adptr * 

  Device Type                 xEF00  PCIA 

  Tiop                            8. 
  Hose                            0. 
  Slot                            0. 
  FRU Name                           DWLPA 
  Serial Number                      AY63810750 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00091011  DEC_FASTNI 

  Tiop                            8. 
  Hose                            0. 
  Slot                            0. 
  FRU Name                           TULIP 
  PCI Ident Field (LO)    x000000C3 
  PCI Ident Field (HIGH)  x00001000 
  Bar Length                  x0048 
  Base Address 0          x0000000004333000 
  Size 0                  x00000100 
  Base Address 1          x0000000000183000 
  Size 1                  x00000100 
  Base Address 2          x0000000000000000 
  Size 2                  x00000000 
  Base Address 3          x00000000FFFFFFFF 
  Size 3                  xFFFFFFFF 
  Base Address 4          x00000000FFFFFFFF 
  Size 4                  xFFFFFFFF 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00081011  DEC_KZPSA 

  Tiop                            8. 
  Hose                            0. 
  Slot                            0. 
  FRU Name                           KZPSA 
  PCI Ident Field (LO)    x000000C3 
  PCI Ident Field (HIGH)  x00002800 
  Bar Length                  x0048 
  Base Address 0          x0000000004320000 
  Size 0                  x00010000 
  Base Address 1          x0000000004200000 
  Size 1                  x00100000 
  Base Address 2          x0000000000182000 
  Size 2                  x00001000 
  Base Address 3          x0000000004332000 
  Size 3                  x00001000 
  Base Address 4          x0000000000000000 
  Size 4                  x00000000 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00081011  DEC_KZPSA 

  Tiop                            8. 
  Hose                            0. 
  Slot                            0. 
  FRU Name                           KZPSA 
  PCI Ident Field (LO)    x000000C3 
  PCI Ident Field (HIGH)  x00003800 
  Bar Length                  x0048 
  Base Address 0          x0000000004310000 
  Size 0                  x00010000 
  Base Address 1          x0000000004100000 
  Size 1                  x00100000 
  Base Address 2          x0000000000181000 
  Size 2                  x00001000 
  Base Address 3          x0000000004331000 
  Size 3                  x00001000 
  Base Address 4          x0000000000000000 
  Size 4                  x00000000 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00081011  DEC_KZPSA 

  Tiop                            8. 
  Hose                            0. 
  Slot                            0. 
  FRU Name                           KZPSA 
  PCI Ident Field (LO)    x000000C3 
  PCI Ident Field (HIGH)  x00004800 
  Bar Length                  x0048 
  Base Address 0          x0000000004300000 
  Size 0                  x00010000 
  Base Address 1          x0000000004000000 
  Size 1                  x00100000 
  Base Address 2          x0000000000180000 
  Size 2                  x00001000 
  Base Address 3          x0000000004330000 
  Size 3                  x00001000 
  Base Address 4          x0000000000000000 
  Size 4                  x00000000 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0002  * Hose to IO Bus Adptr * 

  Device Type                 xEF00  PCIA 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           DWLPA 
  Serial Number                      AY64616950 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00081011  DEC_KZPSA 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           KZPSA 
  PCI Ident Field (LO)    x000000C7 
  PCI Ident Field (HIGH)  x00000800 
  Bar Length                  x0048 
  Base Address 0          x0000000004210000 
  Size 0                  x00010000 
  Base Address 1          x0000000004100000 
  Size 1                  x00100000 
  Base Address 2          x0000000000181000 
  Size 2                  x00001000 
  Base Address 3          x0000000004221000 
  Size 3                  x00001000 
  Base Address 4          x0000000000000000 
  Size 4                  x00000000 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00011000  NCR_810 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           KZPAA 
  PCI Ident Field (LO)    x000000C7 
  PCI Ident Field (HIGH)  x00001800 
  Bar Length                  x0048 
  Base Address 0          x0000000004222300 
  Size 0                  x00000100 
  Base Address 1          x0000000000182300 
  Size 1                  x00000100 
  Base Address 2          x0000000000000000 
  Size 2                  x00000000 
  Base Address 3          x00000000FFFFFFFF 
  Size 3                  xFFFFFFFF 
  Base Address 4          x00000000FFFFFFFF 
  Size 4                  xFFFFFFFF 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00081011  DEC_KZPSA 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           KZPSA 
  PCI Ident Field (LO)    x000000C7 
  PCI Ident Field (HIGH)  x00002800 
  Bar Length                  x0048 
  Base Address 0          x0000000004200000 
  Size 0                  x00010000 
  Base Address 1          x0000000004000000 
  Size 1                  x00100000 
  Base Address 2          x0000000000180000 
  Size 2                  x00001000 
  Base Address 3          x0000000004220000 
  Size 3                  x00001000 
  Base Address 4          x0000000000000000 
  Size 4                  x00000000 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00011000  NCR_810 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           KZPAA 
  PCI Ident Field (LO)    x000000C7 
  PCI Ident Field (HIGH)  x00003800 
  Bar Length                  x0048 
  Base Address 0          x0000000004222200 
  Size 0                  x00000100 
  Base Address 1          x0000000000182200 
  Size 1                  x00000100 
  Base Address 2          x0000000000000000 
  Size 2                  x00000000 
  Base Address 3          x00000000FFFFFFFF 
  Size 3                  xFFFFFFFF 
  Base Address 4          x00000000FFFFFFFF 
  Size 4                  xFFFFFFFF 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00011000  NCR_810 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           KZPAA 
  PCI Ident Field (LO)    x000000C7 
  PCI Ident Field (HIGH)  x00004800 
  Bar Length                  x0048 
  Base Address 0          x0000000004222100 
  Size 0                  x00000100 
  Base Address 1          x0000000000182100 
  Size 1                  x00000100 
  Base Address 2          x0000000000000000 
  Size 2                  x00000000 
  Base Address 3          x00000000FFFFFFFF 
  Size 3                  xFFFFFFFF 
  Base Address 4          x00000000FFFFFFFF 
  Size 4                  xFFFFFFFF 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               

FRU CLASS                     x0005  * PCI FRU Subpkt * 

  Device Type             x00011000  NCR_810 

  Tiop                            8. 
  Hose                            1. 
  Slot                            0. 
  FRU Name                           KZPAA 
  PCI Ident Field (LO)    x000000C7 
  PCI Ident Field (HIGH)  x00005800 
  Bar Length                  x0048 
  Base Address 0          x0000000004222000 
  Size 0                  x00000100 
  Base Address 1          x0000000000182000 
  Size 1                  x00000100 
  Base Address 2          x0000000000000000 
  Size 2                  x00000000 
  Base Address 3          x00000000FFFFFFFF 
  Size 3                  xFFFFFFFF 
  Base Address 4          x00000000FFFFFFFF 
  Size 4                  xFFFFFFFF 
  Base Address 5          x00000000FFFFFFFF 
  Size 5                  xFFFFFFFF 

************************               


******************************** ENTRY    2 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             1. 
Timestamp of occurrence              26-JAN-1997 19:32:20   
Host name                            utpci1 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000001 
CPU logging event (mperr) x00000000 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      300. Start-Up ASCII Message Type 

SWI Minor class                   9. ASCII Message 
SWI Minor sub class               3. Startup 

ASCII Message 
    Alpha boot: available memory from 0x1800000 to 0x3ffbe000 
    Digital UNIX V3.2G (Rev. 62); Sun Jan 26 19:29:17 GMT+0700 1997  
    physical memory = 1024.00 megabytes. 
    available memory = 999.75 megabytes. 
    using 3923 buffers containing 30.64 megabytes of memory 
    Firmware revision: 4.1 
    PALcode: OSF version 1.21 
    AlphaServer 8400 Model EV56/440 
    Master cpu at slot 0. 
    Created FRU table configuration errorlog packet 
    tiop0 at tlsb0 node 8 
    tiop0: cpu interrupt mask being set as 1. 
    pci0 at tiop0 slot 0 
    tu0: DECchip 21140-AA: Revision: 1.2 
    tu0 at pci0 slot 2 
    tu0: DEC Fast Ethernet Interface, hardware address: 00-00-F8-1E-25-6E 
    tu0: console mode: selecting 10BaseT (UTP) port: half duplex: no link 
    pza0 at pci0 slot 5 
    pza0 firmware version: DEC  P01  A10    
    scsi0 at pza0 slot 0 
    rz1 at scsi0 bus 0 target 1 lun 0 (DEC     RZ28M    (C) DEC 0616) 
    rz2 at scsi0 bus 0 target 2 lun 0 (DEC     RZ28M    (C) DEC 0616) 
    rz3 at scsi0 bus 0 target 3 lun 0 (DEC     RZ28M    (C) DEC 0616) 
    pza1 at pci0 slot 7 
    pza1 firmware version: DEC  P01  A10    
    scsi1 at pza1 slot 0 
    rz9 at scsi1 bus 1 target 1 lun 0 (DEC     HSZ50-AX         V50Z) 
    rz10 at scsi1 bus 1 target 2 lun 0 (DEC     HSZ50-AX         V50Z) 
    rz11 at scsi1 bus 1 target 3 lun 0 (DEC     HSZ50-AX         V50Z) 
    rz12 at scsi1 bus 1 target 4 lun 0 (DEC     HSZ50-AX         V50Z) 
    pza2 at pci0 slot 9 
    pza2 firmware version: DEC  P01  A10    
    scsi2 at pza2 slot 0 
    tz21 at scsi2 bus 2 target 5 lun 0 (DEC     TZ875    (C) DEC 9B3C) 
    pci1 at tiop0 slot 1 
    pza3 at pci1 slot 1 
    pza3 firmware version: DEC  P01  A10    
    scsi3 at pza3 slot 0 
    rz25 at scsi3 bus 3 target 1 lun 0 (DEC     HSZ50-AX         V50Z) 
    rz26 at scsi3 bus 3 target 2 lun 0 (DEC     HSZ50-AX         V50Z) 
    rz27 at scsi3 bus 3 target 3 lun 0 (DEC     HSZ50-AX         V50Z) 
    rz28 at scsi3 bus 3 target 4 lun 0 (DEC     HSZ50-AX         V50Z) 
    psiop0 at pci1 slot 3 
    Loading SIOP: script c0001900, reg 4222300, data 406e38a0 
    scsi4 at psiop0 slot 0 
    rz37 at scsi4 bus 4 target 5 lun 0 (DEC     RRD45   (C) DEC  0436) 
    pza4 at pci1 slot 5 
    pza4 firmware version: DEC  P01  A10    
    scsi5 at pza4 slot 0 
    rz41 at scsi5 bus 5 target 1 lun 0 (DEC     RZ28M    (C) DEC 0616) 
    rz42 at scsi5 bus 5 target 2 lun 0 (DEC     RZ28M    (C) DEC 0568) 
    rz43 at scsi5 bus 5 target 3 lun 0 (DEC     RZ28M    (C) DEC 0568) 
    rz44 at scsi5 bus 5 target 4 lun 0 (DEC     RZ28D    (C) DEC 0010) 
    psiop1 at pci1 slot 7 
    Loading SIOP: script c000d900, reg 4222200, data c0019ca0 
    scsi6 at psiop1 slot 0 
    tz53 at scsi6 bus 6 target 5 lun 0 (DEC     TZ87     (C) DEC 9B3C) 
    psiop2 at pci1 slot 9 
    Loading SIOP: script c001f900, reg 4222100, data 406e40a0 
    scsi7 at psiop2 slot 0 
    tz58 at scsi7 bus 7 target 2 lun 0 (DEC     TZ87     (C) DEC 9B3C) 
    psiop3 at pci1 slot 11 
    Loading SIOP: script c002b900, reg 4222000, data 406e44a0 
    scsi8 at psiop3 slot 0 
    tz66 at scsi8 bus 8 target 2 lun 0 (DEC     TZ87     (C) DEC 9B3C) 
    TLMEM at node 7 
    TLMEM at node 1 
    Dual TLEP at node 0 
    lvm0: configured. 
    lvm1: configured. 
    dli: configured 
    SuperLAT. Copyright 1993 Meridian Technology Corp. All rights reserved. 
      

T.RTitleUserPersonal
Name
DateLines
1072.1Why 4 KZPAAs...?????WONDER::MUZZIMon Jan 27 1997 13:336
    
    
    Why do you have 4 KZPAAs on this system...? Only one is supported...an
    that's only support as a connection to the CD-ROM.
    
    
1072.23 units of KZPAA for TZ87 Tape driveDAIVC::ENGKOSWed Jan 29 1997 04:2512
    Thanks for your quick reply.
    Do you mean TL support one KZPAA only..?
    Actually, we used that KZPAAs for TZ87 Tape drive, because our 
    customer need it for parallel backup of their application.
    
    How about if we'd like to add on another device that needed KZPAA
    for the interface..??
    
    Rgrds
    engkos 
    daivc::engkos
    
1072.3Only one KZPAA supported..WONDER::MUZZIWed Jan 29 1997 12:2514
    
    
    Only ONE KZPAA is supported...and only as a connection to a CDrom only.
    It's in to SOC. The supproted connection is thru KZPSA/KFTIA-differential 
    to DWZZA-VAs. Additional single ended devices need to be connected thru
    KZPSA/DWZZA. It's a more costly connection...but that's the way it is.
    The problem with the KZPAA is with the SCSI chip it uses (53c710..?).
    It run on scripts that live in main memory. So everytime it wants/has
    to do something it has to go to main memory to get the scripts.
    
    
    	-Mark-
    
    
1072.4What is the root cause ?DAIVC::AGUSSUSANTOThu Jan 30 1997 07:1212
    I am just curious, in my understand that if more than one KZPAA
    installed it will consumed memory source rather than make the system
    hang or crash . Anyway, I heard that the 3 KZPAA was removed and the
    problem still exist.
    
    Do you have any idea ? It is very difficult to find out the root cause
    since nothing can do but power recycle every time the system hang,
    means it is no way to get the latest information which resident in the
    memory due to its refreshed every time the system do the initialization.
                                                             
    rgds,
    as
1072.5Please check Power Regulator EPU valueLANDO::DROBNERTurboLaser Engineering - 8200/8400Thu Jan 30 1997 12:4416
    I am going to put my similiar reply here as an early note stream.
    
    Please give us the complete system configuration;  what we would like
    to see is; 1) 8200 or 8400 style cabinet. 2) Part number and quantity
    of power regulators in the cabinet. 3) The modules and where they
    are; system bus, PCI bus (DWLPA/B, quantity).
    
    Reading these notes, I would guess you have a 8400 style cabinet and
    one power regulator (H7263-AA/AB or H7263-AC/AD) in this cabinet.  If
    this is the case - please look in the 8400 SOC article and calculate
    the "EPU" value that the system is using (JAN-97 update, page 2.191).
    If you are close to the EPU value of 80, but not above and you have
    only one power regulator in the system - I would recommond adding a
    second power regulator or replacing the orginal.
    
    /Howard 
1072.6Total EPU value = 68DAIVC::AGUSSUSANTOFri Jan 31 1997 00:3922
    Three power regulater (H7263-AB) were in the system (DA-292FD-BB),
    means it was configured as an N+1 redundant power. Table below is the
    complete system configuration.
    
    OPTION     		EPU	QTY	TOTAL EPU
    
    Base Server         30	1	30
    KFTHA-AA		3	1	 3
    MS7CC-DA		5	2	10
    DWLPB-BA		1	2	 2
    KZPSA-BB		1	7	 7
    DE500-XA	 	1	1	 1
    DWZZB-VW		0	2	 0
    DWZZA-VA		0	2	 0
    RZ28M-VW		1	6	 6
    TZ87-VA		3	3	 9
    
    TOTAL				68
    
    Any ideas are welcome
    
    /AS                                                 
1072.7check for unix patches...?WONDER::MUZZIFri Jan 31 1997 12:3410
    
    
    You might want to check to see if there are any patches for unix/tape
    problems. It wouldn't be the first time that I've seen unix hang the
    system and it be a software issue.
    
    
    	-Mark-
    
    
1072.8Already appliedDAIVC::AGUSSUSANTOMon Feb 03 1997 04:584
    I have a complete one patches for V3.2G and it was already applied to
    the system at installation period. FYI, below is the location of patch
    
    ftp://oskits.zk3.dec.com/patches/osf/v3.2g/v3.2g_bpatch.tar
1072.9I HAVE THE SAME PROBLEMSNETRIX::"cesarato@jpo.mts.dec.com"CesaratoWed Feb 19 1997 15:1637
 Hi,
 I have the same problems :random system hang. When it happen you
 can do restart only. I have had two crashes where DIA reported
 two different memory simm's with ECC error but it was a different problem.
 The system is a 8400/440 with 4 GB memory (2 board 2GB at node 2 and 6)
 2 twin CPU, 2 PCI bus with 8 KZPSA A10, 1 memory channel, 1 de500, 1 de435,
 1 defpa.
 AT THE kzpsa are connected :
 2 kzpsa for 1 TL826
 4 kzpsa for 4 hsz40

 Software configuration:

 OSF/1 3.2G
 ADVFS
 LSM
 ORACLE 7.2
 POLYCENTER NSR 4.2B
 EBU
 Some parameters have been changed for oracle as
 shared memory  at 2GB
 Shared memory seg 32
 MAXVAS = MACHINE_PHYSYCAL_MEMORY
 maxprc =1024


 There is LSM configured with mirrorset on internal disks connected at TIA
 and 60GB mirrorset on HSZ40's. The volumes are used from oracle 7.2
 like row devices.
 I have checked firmwares, installed patches for OSF/1 3.2G, but
 the problem is still present.

 Any ideas 


[Posted by WWW Notes gateway]