[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference pamsrc::decmessageq

Title:	NAS Message Queuing Bus
Notice:	KITS/DOC, see 4.*; Entering QARs, see 9.1; Register in 10
Moderator:	PAMSRC::MARCUSEN

Created:	Wed Feb 27 1991
Last Modified:	Fri Jun 06 1997
Last Successful Update:	Fri Jun 06 1997
Number of topics:	2898
Total number of notes:	12363

2774.0. "killed app poisons queue and qe (v3.2A/Solaris)?" by WHOS01::ELKIND (Steve Elkind, Digital SI @WHO) Tue Feb 18 1997 01:56

    One of my users is having a problem using DmQ for HP-UX v 3.2A (no
    ECO).  If he kills (-15) a process attached to a single-reader
    permanent queue, the queue can no longer be attached to, and the QE can
    no longer be stopped with dmqmonc.

    This is happening on a Solaris (2.5?) system.  I have asked him to try
    to repeat this behavior on an HP-UX system (where I've never seen the
    behavior), and also to set PAMS_TRACE and DMQ_IPI_TRACE variables on
    the application process.  When I get back to the site on Wednesday, I
    will install the ECO and see if it makes a difference, although judging
    by the readme file onthe dowload page it should not.  In the meantime
    - does this sound familiar to anyone?

    > I am having a problem with DMQ32A.  If a server attaches to a permanent
    > queue (also permanent active), and the server process is killed, the 
    > queue is not released.  When I try to restart the server, I get a
    > [attach] error, with error code -106 (PAMS__DECLARED).
    > 
    > Even worse, when I try to bring down DmQ to clear the problem, using
    > dmqmonc, the dmqqe just doesn't die, even if I shut down bus and group.
    > If I manually kill dmqqe, I still have some ipc resource hanging, so
    > I have to clear the resource manually.
    > 
    > I know my server didn't do [detach] when killed.  But, shouldn't
    > DmQ still take care of this situation any way?  This didn't happen with 
    > DMQ30B or earlier.
    > 
    > Could you take a look at this and report it as a DmQ problem, if it is?
    > If it's some error on my side, please let me know what I did wrong.

T.R	Title	User	Personal Name	Date	Lines
2774.1	NR	XHOST::SJZ	Rocking the Messaging Desktop !	`Tue Feb 18 1997 02:50`	11
	not reproducible here on any platform. one thing to check is if their server process is still out there after the SIGTERM. for instance, if the process has mostly exited, but the process is a child of anoher process that has not handled the SIGCHLD signal, then the process may be out there as a zombie process. if the process is still out there in any form, then we leave it alone (don't detach it's queues). _sjz.
2774.2	thanks	WHOS01::ELKIND	Steve Elkind, Digital SI @WHO	`Tue Feb 18 1997 22:39`	2
	Thanks, zach, I'll look into it. This problem is on the same host on which we are having the dmqgcp signal 11 problems, naturally.
2774.3		XHOST::SJZ	Rocking the Messaging Desktop !	`Tue Feb 18 1997 23:10`	5
	well, if your gcp is dead then there is no one to clean up after processes that abnormally exit. _sjz.
2774.4	sorry for the confusion	WHOS01::ELKIND	Steve Elkind, Digital SI @WHO	`Wed Feb 19 1997 00:43`	3
	sorry to confuse you. The gcp problem was another user, bus, and group from this problem - not related at all other than being on the same machine.
2774.5		XHOST::SJZ	Rocking the Messaging Desktop !	`Thu Feb 20 1997 12:35`	6
	next timet his happens, please check the state of your dmqgcp process. you can use ps, and/or top if you have it. what i am looking for is CPU consumption. _sjz.