Netboot Mailing List (by thread)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Hanging mountd



Hello,

I have recently started to experience a strange problem with rpc.mountd in
redhat 5.1. We are running a number of sites using a Linux server and a
number of diskless workstations with etherboot. Workstations are used as
character-based terminals, to telnet to the server, plus they have a
printer connected, accesible via lpd. These sites are remote. Each of
the servers got two network cards, one to connect to the WAN backbone,
while the hub with the workstations is connected to the second card. 

Unfortunately we started running into major problems as rpc.mountd
intermittently hangs, and though it still appeares in ps table it would
not serve any mount requests. If I do showmount localhost I am getting rpc
timed out.

Sometimes this happens when a server just came up, sometimes after it has
been working for a while, after restarting mountd everything works again.
If I send a signal (HUP or USR1) in this hanging state, mountd dies
instead of rereading /etc/exports. And I am getting no syslog messages. 

The problem is intermittent, I can not reproduce it at will, it
happens now and then at random. I guess if I only had one site I would not
have noticed this, but we are running about 140 sites set up like this, so
I am getting about 10 sites that are hanging every day.  I tried to
upgrade to nfs 2.2beta37 but that did not make any difference, same story
as with redhat's 2.2beta29. It does not seem to be happening with redhat
4.2 based systems though. I will configure a couple of sites to run with
-d call option, but I will have to wait for the problem to reoccur. 

My /etc/exports

/tftpboot/rootfs.2     ws01.ucs(rw,no_root_squash)
/tftpboot/rootfs.3     ws02.ucs(rw,no_root_squash)
/tftpboot/rootfs.4     ws03.ucs(rw,no_root_squash)
/tftpboot/rootfs.5     ws04.ucs(rw,no_root_squash)
/tftpboot/rootfs.6     ws05.ucs(rw,no_root_squash)
/tftpboot/rootfs.7     ws06.ucs(rw,no_root_squash)
/tftpboot/rootfs.8     ws07.ucs(rw,no_root_squash)
/tftpboot/rootfs.9     ws08.ucs(rw,no_root_squash)
/tftpboot/rootfs.10     ws09.ucs(rw,no_root_squash)
/tftpboot/rootfs.11     ws10.ucs(rw,no_root_squash)
/tftpboot/rootfs.12     ws11.ucs(rw,no_root_squash)
/tftpboot/rootfs.13     ws12.ucs(rw,no_root_squash)
/tftpboot/rootfs.14     ws13.ucs(rw,no_root_squash)
/tftpboot/rootfs.15     ws14.ucs(rw,no_root_squash)
/tftpboot/rootfs.16     ws15.ucs(rw,no_root_squash)
/tftpboot/rootfs.17     ws16.ucs(rw,no_root_squash)
/tftpboot/rootfs.18     ws17.ucs(rw,no_root_squash)
/tftpboot/rootfs.19     ws18.ucs(rw,no_root_squash)
/tftpboot/rootfs.20     ws19.ucs(rw,no_root_squash)
/tftpboot/rootfs.21     ws20.ucs(rw,no_root_squash)
/tftpboot/rootfs.22     ws21.ucs(rw,no_root_squash)
/tftpboot/rootfs.23     ws22.ucs(rw,no_root_squash)
/tftpboot/rootfs.24     ws23.ucs(rw,no_root_squash)
/tftpboot/rootfs.25     ws24.ucs(rw,no_root_squash)
/tftpboot/rootfs.26     ws25.ucs(rw,no_root_squash)
/tftpboot/rootfs.27     ws26.ucs(rw,no_root_squash)
/tftpboot/rootfs.28     ws27.ucs(rw,no_root_squash)
/tftpboot/rootfs.29     ws28.ucs(rw,no_root_squash)
/tftpboot/rootfs.30     ws29.ucs(rw,no_root_squash)
/tftpboot/rootfs.31     ws30.ucs(rw,no_root_squash)
/tftpboot/rootfs.32     ws31.ucs(rw,no_root_squash)
/tftpboot/rootfs.33     ws32.ucs(rw,no_root_squash)
/tftpboot/rootfs.34     ws33.ucs(rw,no_root_squash)
/tftpboot/rootfs.35     ws34.ucs(rw,no_root_squash)
/tftpboot/rootfs.36     ws35.ucs(rw,no_root_squash)
/tftpboot/rootfs.37     ws36.ucs(rw,no_root_squash)
/tftpboot/rootfs.38     ws37.ucs(rw,no_root_squash)
/tftpboot/rootfs.39     ws38.ucs(rw,no_root_squash)
/tftpboot/rootfs.40     ws39.ucs(rw,no_root_squash)
/tftpboot/rootfs.41     ws40.ucs(rw,no_root_squash)
/tftpboot/rootfs.42     ws41.ucs(rw,no_root_squash)
/tftpboot/rootfs.43     ws42.ucs(rw,no_root_squash)
/tftpboot/rootfs.44     ws43.ucs(rw,no_root_squash)
/tftpboot/rootfs.45     ws44.ucs(rw,no_root_squash)
/tftpboot/rootfs.46     ws45.ucs(rw,no_root_squash)
/tftpboot/rootfs.47     ws46.ucs(rw,no_root_squash)
/tftpboot/rootfs.48     ws47.ucs(rw,no_root_squash)
/tftpboot/rootfs.49     ws48.ucs(rw,no_root_squash)
/tftpboot/rootfs.50     ws49.ucs(rw,no_root_squash)
/tftpboot/rootfs.51     ws50.ucs(rw,no_root_squash)
/tftpboot/rootfs.52     ws51.ucs(rw,no_root_squash)
/tftpboot/rootfs.53     ws52.ucs(rw,no_root_squash)
/tftpboot/rootfs.54     ws53.ucs(rw,no_root_squash)
/tftpboot/rootfs.55     ws54.ucs(rw,no_root_squash)
/tftpboot/rootfs.56     ws55.ucs(rw,no_root_squash)
/tftpboot/rootfs.57     ws56.ucs(rw,no_root_squash)
/tftpboot/rootfs.58     ws57.ucs(rw,no_root_squash)
/tftpboot/rootfs.59     ws58.ucs(rw,no_root_squash)
/tftpboot/rootfs.60     ws59.ucs(rw,no_root_squash)
/tftpboot/rootfs.61     ws60.ucs(rw,no_root_squash)
/tftpboot/rootfs.62     ws61.ucs(rw,no_root_squash)
/tftpboot/rootfs.63     ws62.ucs(rw,no_root_squash)
/tftpboot/rootfs.64     ws63.ucs(rw,no_root_squash)
/tftpboot/rootfs.65     ws64.ucs(rw,no_root_squash)
/tftpboot/rootfs.66     ws65.ucs(rw,no_root_squash)
/tftpboot/rootfs.67     ws66.ucs(rw,no_root_squash)
/tftpboot/rootfs.68     ws67.ucs(rw,no_root_squash)
/tftpboot/rootfs.69     ws68.ucs(rw,no_root_squash)
/tftpboot/rootfs.70     ws69.ucs(rw,no_root_squash)
/tftpboot/rootfs.71     ws70.ucs(rw,no_root_squash)
/tftpboot/rootfs.72     ws71.ucs(rw,no_root_squash)
/tftpboot/rootfs.73     ws72.ucs(rw,no_root_squash)
/tftpboot/rootfs.74     ws73.ucs(rw,no_root_squash)
/tftpboot/rootfs.75     ws74.ucs(rw,no_root_squash)
/tftpboot/rootfs.76     ws75.ucs(rw,no_root_squash)
/tftpboot/rootfs.77     ws76.ucs(rw,no_root_squash)
/tftpboot/rootfs.78     ws77.ucs(rw,no_root_squash)
/tftpboot/rootfs.79     ws78.ucs(rw,no_root_squash)
/tftpboot/rootfs.80     ws79.ucs(rw,no_root_squash)
/tftpboot/rootfs.81     ws80.ucs(rw,no_root_squash)
/tftpboot/rootfs.82     ws81.ucs(rw,no_root_squash)
/tftpboot/rootfs.83     ws82.ucs(rw,no_root_squash)
/tftpboot/rootfs.84     ws83.ucs(rw,no_root_squash)
/tftpboot/rootfs.85     ws84.ucs(rw,no_root_squash)
/tftpboot/rootfs.86     ws85.ucs(rw,no_root_squash)
/tftpboot/rootfs.87     ws86.ucs(rw,no_root_squash)
/tftpboot/rootfs.88     ws87.ucs(rw,no_root_squash)
/tftpboot/rootfs.89     ws88.ucs(rw,no_root_squash)
/tftpboot/rootfs.90     ws89.ucs(rw,no_root_squash)
/tftpboot/rootfs.91     ws90.ucs(rw,no_root_squash)
/tftpboot/rootfs.92     ws91.ucs(rw,no_root_squash)
/tftpboot/rootfs.93     ws92.ucs(rw,no_root_squash)
/tftpboot/rootfs.94     ws93.ucs(rw,no_root_squash)
/tftpboot/rootfs.95     ws94.ucs(rw,no_root_squash)
/tftpboot/rootfs.96     ws95.ucs(rw,no_root_squash)
/tftpboot/rootfs.97     ws96.ucs(rw,no_root_squash)
/tftpboot/rootfs.98     ws97.ucs(rw,no_root_squash)
/tftpboot/rootfs.99     ws98.ucs(rw,no_root_squash)
/tftpboot/rootfs.100     ws99.ucs(rw,no_root_squash)
/tftpboot/bin          *.ucs(ro,no_root_squash)
/tftpboot/sbin         *.ucs(ro,no_root_squash)
/tftpboot/lib          *.ucs(ro,no_root_squash)
/tftpboot/boot         *.ucs(ro,no_root_squash)
/tftpboot/usr          *.ucs(ro,no_root_squash)
/tftpboot/etc          *.ucs(ro,no_root_squash)
/tftpboot/var          *.ucs(ro,no_root_squash)
/tftpboot/home         *.ucs(rw,no_root_squash)
/tftpboot/be0          *.ucs(rw,no_root_squash)

The process table on a workstation:

  PID TTY STAT TIME COMMAND
    1  ?  S    0:02 init [3] 
    2  ?  SW   0:00 (kflushd)
    3  ?  SW<  0:00 (kswapd)
    4  ?  SW   0:00 (nfsiod)
    5  ?  SW   0:00 (nfsiod)
    6  ?  SW   0:00 (nfsiod)
    7  ?  SW   0:00 (nfsiod)
   91  ?  S    0:56 syslogd 
   93  ?  S    0:00 klogd -c 1 
  101  ?  S    0:00 lpd -l 
  104  ?  S    0:00 inetd 
  118  ?  S    0:00 update (bdflush) 
18505  ?  S    0:00 ypbind (master) 
18506  ?  S    0:00 ypbind (slave) 
18530   1 S    0:01 telnet -l su01 172.30.160.1 
18531  ?  S    0:00 in.rshd 
18532  ?  S    0:00 sh -c ps ax 
18533  ?  R    0:00 ps ax 
   95  ?  S    0:17 portmap 

Any help will be greatly appreciated,

Regards,

Alex



---------------------------------------------------------------------------
  Alex Harin, PhD           Universal Computer Services
  Tel  +27 11 339-6111      PO Box 31266 Braamfontein 2017, South Africa
  Cell +27 82 900-4779      20th Floor, Total House, Smit St, Braamfontein 
  Fax  +27 11 339-3421      Johannesburg, South Africa
  alex@ucs.co.za ********** http://www.ucs.co.za ****************************



===========================================================================
This Mail was sent to netboot mailing list by:
Alex Harin <aoh@ucs.co.za>
To get help about this list, send a mail with 'help' as the only string in
it's body to majordomo@baghira.han.de. If you have problems with this list,
send a mail to netboot-owner@baghira.han.de.



For requests or suggestions regarding this mailing list archive please write to netboot@gkminix.han.de.