[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference smurf::ase

Title:ase
Moderator:SMURF::GROSSO
Created:Thu Jul 29 1993
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:2114
Total number of notes:7347

2047.0. "can't relocate service.. timeout error" by WRKSYS::ALONGI () Tue May 06 1997 18:32

Hi,

I am running DUNIX V4.0B and ASE V1.4 on 2 turbolasers.  I can't fail 
over a service from 1 to the other.  Here is the error message when I try
to fail over from bigbird to zoe.   Any ideas what I am doing wrong.  This 
use to work and I don't think I changed anything. 

May  6 14:06:31 bigbird ASE: bigbird Agent Notice: stopping service asearchive
May  6 14:06:34 bigbird ASE: zoe Director Notice: stopped asearchive on bigbird
May  6 14:06:34 bigbird ASE: zoe Agent Notice: starting service asearchive
May  6 14:07:37 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  6 14:08:39 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  6 14:08:40 bigbird ASE: zoe Director ***ALERT: Unable to start service asea
rchive on zoe.
May  6 14:08:40 bigbird ASE: zoe Agent Notice: starting service asearchive
May  6 14:09:42 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  6 14:10:44 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  6 14:10:45 bigbird ASE: bigbird Agent Notice: starting service asearchive
May  6 14:10:49 bigbird ASE: zoe Director Notice: started asearchive on bigbird
May  6 14:10:49 bigbird ASE: bigbird AseMgr Error: Service is on bigbird.eng.pko
.dec.com, not zoe - Relocation not successful.
May  6 14:10:49 bigbird ASE: bigbird AseMgr Error: Unable to relocate `asearchiv
e` to `zoe`.

T.RTitleUserPersonal
Name
DateLines
2047.1BACHUS::DEVOSManu Devos NSIS Brussels 856-7539Tue May 06 1997 20:397
    Hi,
    
    You got a timeout  on the nfs_mountd completion. Are you sure that the
    system you tried to relocate to is NFS configured, that /etc/hosts is
    still identical to the first member?
    
    Manu.
2047.2WRKSYS::ALONGIWed May 07 1997 12:167
yup.. the hosts files are identical.  And the system is definitely nfs
configured exactly the same way as the other one

Anything else I can check?

Thanks,
Doreen
2047.3HELP !!WRKSYS::ALONGIWed May 07 1997 17:45101
HELP!!

I made things much worse now.  I tried to delete the node that I couldn't
relocate to and it won't delete.  I have been sitting here for about 15 minutes.

Member to delete: zoe

Is this correct (y/n) [y]:
Deleting member 'zoe'...
................................................................................
...................................................


The log file says

May  7 13:20:33 bigbird ASE: zoe Director Warning: Director exiting...
May  7 13:20:33 bigbird ASE: bigbird Agent Notice: starting a new director
May  7 13:21:50 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:21:51 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseusers/lmnt/graphx-users: not currently mounted
May  7 13:21:51 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseusers/lmnt/graphx-users already unmounted
May  7 13:22:54 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:22:54 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asearchive/lmnt/archive: not currently mounted
May  7 13:22:54 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asearchive/lmnt/archive already unmounted
May  7 13:23:48 bigbird ASE: bigbird AseMgr Warning: timeout waiting on Reply to
 ASE_DELETE_MEMBER
May  7 13:23:48 bigbird ASE: bigbird AseMgr Notice: director request timed out,
retrying...
May  7 13:23:56 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:23:57 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asebigbird/lmnt/bigbird: not currently mounted
May  7 13:23:57 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asebigbird/lmnt/bigbird already unmounted
May  7 13:24:59 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:24:59 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseneon/lmnt/neon: not currently mounted
May  7 13:24:59 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseneon/lmnt/neon already unmounted
May  7 13:25:51 bigbird ASE: zoe Director Warning: timeout waiting on Reply to A
SE_DELETE_MEMBER
May  7 13:25:51 bigbird ASE: zoe Director Notice: deleted member zoe
May  7 13:25:53 bigbird ASE: bigbird AseMgr Warning: msgsvc: discarding unclaime
d reply. seq: 11
May  7 13:26:00 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:27:01 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:28:02 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:28:53 bigbird ASE: bigbird AseMgr Warning: timeout waiting on Reply to
 ASE_DELETE_MEMBER
May  7 13:28:54 bigbird ASE: bigbird AseMgr Notice: director request timed out,
retrying...
May  7 13:29:03 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May  7 13:29:04 bigbird ASE: zoe Agent Notice: restarting Agent!
May  7 13:29:04 bigbird ASE: zoe Director Warning: msgsvc: discarding unclaimed
reply. seq: 12
May  7 13:30:57 bigbird ASE: zoe Director Notice: msgSvc: unclaimed timeout


When I try to run asemgr and it hangs on the node I am already running the
delete on

# asemgr

....


And if I try it on the node that I tried to delete I get

# asemgr

Enter a comma-separated list of all hostnames you want as ASE servers.
Enter Members:
# tail -f daemon.log
May  7 13:29:03 zoe ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/nfs_m
ountd action completion
May  7 13:29:04 zoe ASE: zoe Agent Notice: restarting Agent!
May  7 13:29:04 zoe ASE: zoe Director Warning: msgsvc: discarding unclaimed repl
y. seq: 12
May  7 13:29:04 zoe ASE: local AseMgr Notice:
May  7 13:29:08 zoe ASE: zoe Agent Notice: in install state
May  7 13:29:09 zoe ASE: local HSM Notice: Network interface fta0 16.122.176.221
 UP
May  7 13:29:10 zoe ASE: local HSM ***ALERT: HSM_NI_STATUS:16.122.176.221:UP
May  7 13:29:10 zoe ASE: local Simulator Notice: snd: exiting...
May  7 13:30:57 zoe ASE: zoe Director Notice: msgSvc: unclaimed timeout
May  7 13:32:22 zoe ASE: local AseMgr Notice: Agent is in INSTALL STATE


HELP.. what should I do.  Fortunately the services are still available.

Doreen
2047.4a little more infoWRKSYS::ALONGIWed May 07 1997 18:1126
I finally got

May  7 13:55:59 zoe ASE: bigbird AseMgr Error: Unable to get DB.
May  7 14:00:12 zoe ASE: zoe AseMgr Error: Member change failed
May  7 14:00:12 zoe ASE: zoe AseMgr Error: Delete member failed.  Check syslog's
 daemon log for reason.


but then I tried to do a status of the services and it is not giving me back
anything but dots.

Enter your choice: s

                Obtaining ASE Status

    m)  Display the status of the members
    s)  Display the status of a service
    l)  Display the location of logger(s)
    v)  Display the level of logging

    x)  Exit to the Main Menu            ?)  Help

Enter your choice [x]: s

........................................................

2047.5BACHUS::DEVOSManu Devos NSIS Brussels 856-7539Wed May 07 1997 21:2611
    Doreen,
    
    You definitely have a problem with nfs_mountd. I am at home and have
    not access to the script, but maybe you can check the contents of
    /etc/exports file for a .INCLUDE line and the subsequent files. The
    /var/ase/sbin/nfs_mountd is a script, so maybe you can place a "set -x"
    at the beginning to find what is happening by looking in the
    daemon.log?
    
    Manu.
    
2047.6corrupt databaseWRKSYS::ALONGIThu May 08 1997 11:068
panic over

The database files had different sizes and  dates so I copied the good one
over to the bad one and rebooted and everything is working now.  I can 
relocate services too.

thanks
doreen