Bug 105241 - [nfs] problem with Linux NFS server up/down combined with zombied umount process and a manual mount request
Summary: [nfs] problem with Linux NFS server up/down combined with zombied umount proc...
Status: Closed Overcome By Events
Alias: None
Product: Base System
Classification: Unclassified
Component: kern (show other bugs)
Version: 6.1-RELEASE
Hardware: Any Any
: Normal Affects Only Me
Assignee: freebsd-bugs (Nobody)
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-11-07 12:30 UTC by Roger Jørgensen
Modified: 2017-08-27 01:37 UTC (History)
1 user (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Roger Jørgensen 2006-11-07 12:30:23 UTC
Not sure how to describe this or document it but we've seen it several 
times here and always related to the NFS server going down/up again, AND maybe 
most important. A zombie umount command, and then someone issuing mount, 
not on the same filesystem but on another NFS mounts

This just cause the entire system to freeze up, it respond on all ports, 
ping and everything, and as long as you are not try to access any disk 
devices, or commands that access any disks you're fine. If you access any 
disks the command just freeze like the rest of the system. And there are 
nothing in the logs that give any hints that there are troubles somewhere, 
guess the disk-layer freeze before it can be logged :/

You can not log onto the system on the console, not through ssh, nothing 
work basically and only way out are a hard poweroff/on.

Fix: 

reboot the box :}
How-To-Repeat: Have no clear idea howto recreate it but can give some idea on howto trigger it...

* linux nfs server exporting any directory, freebsd box mounting it.
* Take down the linux NFS server hard or whatever way you wish
* issue a umount request or try to access a nfs share that aren't available
  anymore, that command MUST be zombied, not possible to kill it
* get the NFS server up and try to remount all shares manual, umount and
  mount etc. 

And with some luck you might be able to trigger this. Might be just a
simple error somewhere that aren't catched and handled correctly.