[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference bulova::decw_jan-89_to_nov-90

Title:DECWINDOWS 26-JAN-89 to 29-NOV-90
Notice:See 1639.0 for VMS V5.3 kit; 2043.0 for 5.4 IFT kit
Moderator:STAR::VATNE
Created:Mon Oct 30 1989
Last Modified:Mon Dec 31 1990
Last Successful Update:Fri Jun 06 1997
Number of topics:3726
Total number of notes:19516

197.0. "FYI-changing server write retry control parameters" by STAR::TAN () Tue Feb 14 1989 15:12

Problem Description:

  Occasionally when the server has replies and events to write to a client 
  and network output buffers are unavailable to perform the write operation, 
  the current server would attempt the same write for a number of times
  before disconnecting the non-responsive client.

  In the duration of the retries, the server would not serve any other
  client, and to the user, it would appear that the server is hung.

Here is how to change the default behavior:

  The retry count and retry interval are set with some internal default
  values, but can be modified by defining a couple of logical names:

	DECW$SERVER_RETRY_WRITE_MIN
        DECW$SERVER_RETRY_WRITE_MAX

  DECW$SERVER_RETRY_WRITE_MIN defines the interval between retries in the unit
  of 1/10 of a ms; for example, the default is 15000, which is .15 second.
  DECW$SERVER_RETRY_WRITE_MAX defines the amount of time the server will
  spend in retry before forcing the misbehaved client to be disconnected.
  The default values for this is currently set to be 300000, which is about
  20 retries, in 3 seconds.

  PLEASE NOTE:
    When server changes are allowed the timer unit will be upgraded to be in ms.

T.RTitleUserPersonal
Name
DateLines
197.1any hints ?VISA::BIJAOUITomorrow Never KnowsTue Feb 14 1989 16:217
    Could you please give us some typical values that would make the server
    more reliable, without causing any big pain to the whole system (I
    mean, so the server doesn't have a pathologic behaviour).
    
    
    Pierre.

197.2Kids! Have fun! Experiment!DECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Tue Feb 14 1989 23:3412
    Well, the default values were supposed to be reasonable, but as Edith
    implied, the timing came out to be 10ths of milliseconds instead of
    milliseconds.  Thus, if you multiply the retry interval default values
    will see more what we intended.
    
    Actually,  you might try leaving the retry interval the same and
    multiplying only the number of retries.  That will give it more chances
    to succeed before it finally gives up.
    
    Burns
    

197.3QuestionsGOBBLR::MULHERENKelly Mulheren, GObE & NetEdWed Feb 15 1989 15:5417
A few questions about defining these logical names:

1. When can they be defined? On a running server, strictly at server startup,
   between successive "session quit/session start" pairs?

2. What logical name table should they be defined in? System, DECwindows?

I'm very anxious to experiment with these. The default values make it difficult
to do interactive transformations on moderately complex GObE and NetEd objects
from remote clients without having connections aborted. Several products that
use GObE and NetEd have reported this problem to us and I'd love to have a
solution, or at least a temporary workaround.

Thanks,

Kelly

197.4DECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Wed Feb 15 1989 16:4610
They should get read every time the server resets (that means at startup and
everytime you log out...when the cursor moves itself to the middle).

As to the table, the server looks in LNM_FILE_DEV, but it has previously
modified LNM_FILE_DEV to have DECW$SERVER0_TABLE at the beginning (assuming
server 0, which is the only one we support).  Thus you should be able to
put it in the SYSTEM table, as long as it does not appear in DECW$SERVER0_TABLE.

Burns

197.5re: .1 & .3STAR::TANWed Feb 15 1989 17:0314
    Re:.3
    
       I would just define them in the DECW$SERVER0_TABLE to play it safe.
    
    Re:.1
    
       You can define them as what Burns has suggested, or just make them
       10 times the default values. That should give you a longer, yet
       tolerable retry period, so you can get your work done.  This is
       strictly for V51 server.
    
    
    	Edith

197.6VISA::BIJAOUITomorrow Never KnowsThu Feb 16 1989 05:188
197.7re: .6STAR::TANThu Feb 16 1989 13:5410
    Sorry, I meant to say for version 1 server.  There is no change
    on this for v52 server either.  But on version 2 server(no idea which
    version of vms it would tie in), I plan to change the default retry 
    interval unit to ms, so that the retry interval will be every 1.5 cpu sec.,
    and a total of 20 retries.
    
    
    	Edith
    

197.8retry logicals for v52 serverSTAR::TANWed May 31 1989 16:028
    For the v52 decwindows server, these two logicals are pre-defined
    to different values in the decw$startserver.com.  After some 
    experiment, we found a smaller interval and more retries to be
    a reasonable setting.  So they are set to be .5 second interval and
    60 retries.
    
    		Edith

197.9What about ...VISA::BIJAOUITomorrow Never KnowsWed May 31 1989 16:537
    For DECnet connections, what about making the retry period as long as
    DECnet time-out for links (is it 240sec time-out by default). That
    would make the whole lot consistent, wouldn't it ?
    
    
    Pierre.

197.10We'll stay with our defaultsSTAR::TANThu Jun 01 1989 16:437
    We are interested only to keep the server alive, and a reasonably short 
    delay, approixmately 30 seconds, is about as much as we can tolerate.
    But you can always experiment with setting the max to the decnet
    timeout value to see whether it helps in your environment. 
    
    	Edith

197.11Come AgainCSC32::JJONESJeff JonesMon Jun 12 1989 21:5722
    Please, oh, please - Am running VMS 5.2-42W and for two main remote
    applications (DECwrite and BOOKreader) I can only maintain a remote
    link for, on good days, 5 to 10 minutes without getting XLIB IO ERROR.
    
    My default settings are as follows:
    
    	DECW$SERVER_RETRY_WRITE_MAX = 30000
    	DECW$SERVER_RETRY_WRITE_MIN = 500
    
    Have tried setting them to 1000000 and 25000, respectively. If anything
    it got worse. But the problem is real sporadic.
    What should I have them set to?
    
    [extra]
    	Have not been able to test this out yet but have others which
    are plauged with the same problem, they are on THINWIRE (like me).
    Got a person on THICKWIRE, and he swears that he never sees the
    problem. Could this somehow be related to the problem?
    
    Please respond,
    jjjones/Colorado CSC DECwindows Support

197.12What's your configuration and network load?EPIK::BUEHLERLet's get out there and kick some bitts.Tue Jun 13 1989 12:578
    What is the network load on the machine you are running DECwrite from?
    Some users have suggested that the Xlib IO Error problems stem from
    network loading.  [This may be obvious to those in the know]  One user
    disabled recognition of boot requests on their boot member and he says
    that the error rate dropped considerably.
    
John

197.13Try this....VINO::WITHROWRobert WithrowTue Jun 13 1989 13:4819
> Have not been able to test this out yet but have others which
> are plauged with the same problem, they are on THINWIRE (like me).
> Got a person on THICKWIRE, and he swears that he never sees the
> problem. Could this somehow be related to the problem?

I had similar problems with a VS2000 once when another person connected
improperly to the same thinwire port.  I experienced *very* slow decwindows
operation and droped links!  If this is your situation, I suggest you take
the following actions to see if it helps:

  1) Disconnect *everything* from the thinwire port (on the wall).
  2) Connect *only* your workstation.  Perform the connection *exactly*
     as described the the hardware users guide for your workstation.  Use
     *exactly* the same hardware/terminator/cable as described!
  3) Use the *as installed* defaults for the parameters you are twiddling.

I'll think you will then have a stable decwindows link situation.  You can then
try adding other hardware to the thinwire and see what happens.

197.14VWSENG::KLEINSORGEToys 'R' UsTue Jun 13 1989 13:5411
    
    I occasionally go through the office looking for improperly terminated
    thinwire connections.  Using the standard DECconnect stuff in our
    offices you can tell a bad connection almost immediately - it usually
    is a "Tee" connector hooked to  the DECconnect box.  This usually means
    2 terminators in the office.  There can only be ONE terminator on this
    type configuration.  The standard problem is people with 2, 3 or more
    devices in their office with thinwire connections.
    
    

197.15is this new math?NEURON::NICHOLSONA belly as big as an oil spillSun Jun 18 1989 00:0817
    I have been confused by some of the information in this note.
    
    In .0, it was stated that the unit for retry is 1/10 of a millisecond
    and that the default setting is 15000 or .15 seconds.  I don't think
    that math works, i.e. 15000 times 1/10 of a millisecond is 1.5.
    
    In .7 it was stated that the unit didn't change for v52 server.
    
    In .8 it states that the logicals are set in v5.2 startup files
    to have a retry of .5 second and 60 retries.  The values that I
    see in my server table are 500 for MIN and 30000 for MAX.  That
    would only be true if the unit were milliseconds.
    
    What's going on?
    Thanks,
    Mark

197.16No new math, just human errorSTAR::TANMon Jun 19 1989 15:168
    No, this is not new math. It was a programming error internal to the
    server, which caused some confusion of the unit used in the retry 
    count for the vms V52 decwindows server.  For the vms V2 decwindows
    server, the default value for the write-retry logicals will be those
    stated in 197.8.  I apologize for the confusion caused.
    
    	Edith

197.17Let's get all the info in one place!BOOTIS::WESTONFish shaped hysteriaThu Jul 20 1989 15:2421
Let's get all the info in one place.  Can someone in the know confirm or 
correct the following.

For the V52 server:

    The units are in mS.

    The default value for DECW$WSERVER_RETRY_MIN is 500 (i.e., 0.5 seconds).

    The default value for DECW$WSERVER_RETRY_MAX is 30000 (i.e., 30 seconds).

    The logicals should be defined /TABLE=DECW$SERVER0_TABLE in 
	SYS$MANAGER:DECW$STARTSERVER.COM

    To make the new values take effect, I must log out of DECwindows and back in
	again.

Please confirm or correct this information.

-Les.

197.18One correctionDECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Fri Jul 21 1989 19:4729
Let's get all the info in one place.  Can someone in the know confirm or 
correct the following.

For the V52 server:

    The units are in mS.
***Correct

    The default value for DECW$WSERVER_RETRY_MIN is 500 (i.e., 0.5 seconds).
***Correct, except for the "w" in the name

    The default value for DECW$WSERVER_RETRY_MAX is 30000 (i.e., 30 seconds).
***Correct (again, except for the "W")

    The logicals should be defined /TABLE=DECW$SERVER0_TABLE in 
	SYS$MANAGER:DECW$STARTSERVER.COM

***No.  This will work, but the correct place to put them is in your
***SYS$MANAGER:DECW$PRIVATE_SERVER_SETUP.COM file, assuming you want something
***different from the default.  If you define these things as global symbols,
***the right thing will go into the right table.

    To make the new values take effect, I must log out of DECwindows and back in
	again.

***Correct.

Burns

197.19Happening on a new 3100 SPX machine...LAIDBK::ELLISONThat is truly a wetbrain concept.Wed May 02 1990 20:5514
    
    I'm running into this error on a VAXstation 3100 SPX with local
    connections. The server dies when it happens. (At least I that's whats
    causinag the server to die.)
    
    I know that there are some logicals that can be defined to get more
    logging info for the server, but I can't find them here in the notes
    file...
    
    Could some one point me in the right direction or provide an assist?
    
    Thanks -
    	Jan
    	dtn 533-7787
197.20Get the server logDECWIN::FISHERPrune Juice: A Warrior's Drink!Wed May 02 1990 21:451
    You should always get a log DECW$SERVER_0_ERROR.LOG in SYS$MANAGER.
197.21I'm looking for more detail thoughLAIDBK::ELLISONThat is truly a wetbrain concept.Wed May 02 1990 22:149
    
    Yeh- I know that, the problem is that there doesn't seem to be enough 
    (sometimes any) info to make a firm determination.
    
    I've seen reference to a logical name which would turn on a more
    detailed level of logging, but I just can't seem to find it...
    
    	Jan
    	dtn 533-7787
197.22No variations for server loggingSTAR::VATNEPeter Vatne, VMS DevelopmentWed May 02 1990 23:104
>    I've seen reference to a logical name which would turn on a more
>    detailed level of logging, but I just can't seem to find it...

There is no logical name to make the server log any additional information.
197.23RatsLAIDBK::ELLISONThat is truly a wetbrain concept.Wed May 02 1990 23:243
    
    Rats- Oh well, thanks for the help
    
197.24STAR::KLEINSORGEFred Kleinsorge, VMS DevelopmentThu May 03 1990 13:405
    
    How about - DECW$SERVER_DUMP
    
    This should get a process dump on the server...
    
197.25DECWIN::FISHERPrune Juice: A Warrior's Drink!Thu May 03 1990 15:4610
That is only if the server dies, and besides, I have never been able to do
anything useful with a server dump.

I think you'll find that if the server log does not say anything about a
process being thrown off, (with more than just "connection closed") that
it was not the server which initiated the shutdown (in particular it was not
because the event queue filled up.  I'm pretty sure there is a log message for
that.)

Burns
197.26More on thisLAIDBK::ELLISONThat is truly a wetbrain concept.Thu May 03 1990 20:2616
    
    I'm still trying to get solid information on these crashes. 
    
    The symtoms are that the user 'quits' via the session manager and the
    server goes away, no logo comes up, no login dialog box. In fact it's
    like being at an LA100 glass teletype...
    
    The server log file is either empty or has the error in the .-?
    note indicated...
    
    	Jan
    
    I was sure that some kind of logical name existed for increasing
    logging information. Maybe it was for the Session Manager?
    
    
197.27DECWIN::JMSYNGEJames M Synge, VMS DevelopmentMon May 14 1990 16:058
    Jan,
    	Have you looked at the accounting data to see what the final
    	status for the server process was?  If you don't have accounting
    	turned on, please do so, as this data is useful.
    
    	Which version of VMS are you running?
    
    James
197.28Patch availableDECWIN::ROSENBLUMTue Jun 05 1990 14:314
This looks like the same problem described in note 2665

There is now a patch available thru the CSC, more info
in 2665