[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference netcad::hub_mgnt

Title:DEChub/HUBwatch/PROBEwatch CONFERENCE
Notice:Firmware -2, Doc -3, Power -4, HW kits -5, firm load -6&7
Moderator:NETCAD::COLELLADT
Created:Wed Nov 13 1991
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:4455
Total number of notes:16761

2695.0. "DECHUB 900 resets: errlog decoding ?" by MLNCSC::CAREMISE (and then they were ...four !) Thu Aug 31 1995 10:07

	A customer of mine is having a problem with one of the DECHub 900
	( HW : 1.1-6  FW: 3.1-0 ) installed on his netwk.
	The hw installed in the Hub is composed of 1 DECbridge 900 MX 
	(slot 8) and 6 DECrepeater 900TM (slots 1 to 6)
	
	What happened last night, caused the Hub to reset the entire confi-
	guration to scratch ( no more virtual LAN's, IP address ecc...)	
	with the disappointment of more than 150 users !

	No power off's were logged , but in the errorlog dump on the Hub
	we find two entries like :
	
		Entry = 89
		Time stamp = 0 2614
		Reset count = 20
		TRAP @411 in bufpool.c
	-----------------------------------
		Entry = 88
		Time stamp = 0 2735
		Reset count = 19
		TRAP @411 in bufpool.c


	How the hell can I decode this encrypted infos ??

	Is there some documentation where can I find trobleshooting hints	
	on this problem ?

	Any help will be appreciated .

	Sergio.
    
T.RTitleUserPersonal
Name
DateLines
2695.1what the trap meansTOOK::MILLBRANDTanswer mamThu Aug 31 1995 13:3518
Hi Sergio -

The common code base for the V3.1 DEChub 900 is no longer
on disk, so I can't readily look up the exact line of code
that was trapped.  However, the trap was probably related
to a buffer being returned to pool twice - two separate
code routines thought they owned it.

A number of bugs of this nature, including a tendency
to run out of buffers, have been fixed in the latest
release of code for the DEChub 900 and the DECbridge900.
A new code release for the DECrepeater 900TM is still
being prepared.

You should read the notes in this conference about
upgrading, and prepare to upgrade your site.

	Dotsie
2695.2will upg fix the problem ?MLNCSC::CAREMISEand then they were ...four !Thu Aug 31 1995 14:494
    
    Do you mean , upgrading to v. 4.02 will fix the problem ?
    
    
2695.3TOOK::MILLBRANDTanswer mamThu Aug 31 1995 16:3121
>    Do you mean , upgrading to v. 4.02 will fix the problem ?

What I mean is that many, many fixes were made between V3.1 
and V4.0.2, and V4.0.2 is the one we currently support.  I
can't promise it will solve your problem, but the odds of
someone looking at it will be much higher.

A trap on an occurrence such as a buffer returning twice
is a bit of a catch-all.  There is no indication of what
code functions or routines were both using the same buffer.
Thus, your log entry alone can't tell us what caused the
mishandled buffers, and we can't tell if that matches a
known fix.  Perhaps the error logs on the modules in the
hub show some activity at about the same time as the crash.

There will be an upgrade of V4.0.2 available in N weeks,
where N = "contact the product manager".  If this crash
is infrequent and upgrading is not easy at your site,
you could wait.  Otherwise, just do it!

	Dotsie
2695.4..troubleshooting procedures ?MLNCSC::CAREMISEand then they were ...four !Fri Sep 01 1995 07:5513
    
    	Thanks a lot for the explanation .
    	
    	The last isssue remain unsolved : How can a FS engineer decode such
    	a entries , or better where are the troubleshooting procedures
    	documented ?
    
    	Just in case other problems arises .....8-)
    
    	Ciao e grazie.
    
    	Sergio.
    
2695.5NETCAD::DOODYMichael DoodyFri Sep 01 1995 12:5929
     	Here's your errorlog entry as an example:

		Entry = 89
		Time stamp = 0 2614
		Reset count = 20
		TRAP @411 in bufpool.c


		Entry = 89 
	This is the 89th entry written to the errorlog for this Hub. You
	can only see the last 6 entries; older ones are overwritten.


		Time stamp = 0 2614
	This is how long the Hub was operational at the time the errorlog
	entry was written. This Hub was up 26.14 seconds.


		Reset count = 20
	The hub has been reset 20 times at the time the entry was written.


		TRAP @411 in bufpool.c
	The Hub crashed at line 411 in the source code module bufpool.c.
    
    
    	This is the format of the MAM's errorlog entries. Other products' 
    	will be more or less similar depending on the particular
    	requirements of the product.