| Summary: | deadlocking when using vnode disk file and quotas | ||
|---|---|---|---|
| Product: | Base System | Reporter: | chrisx77 <chrisx77> |
| Component: | kern | Assignee: | freebsd-bugs (Nobody) <bugs> |
| Status: | Closed FIXED | ||
| Severity: | Affects Only Me | ||
| Priority: | Normal | ||
| Version: | Unspecified | ||
| Hardware: | Any | ||
| OS: | Any | ||
|
Description
chrisx77
2000-08-23 17:40:01 UTC
On Wed, 23 Aug 2000 09:32:12 MST, chrisx77@hotmail.com wrote: > >Number: 20804 > >Category: kern > >Synopsis: deadlocking when using vnode disk file and quotas [...] > This is being submitted at the request of Robert Watson. Robert, do you want this PR for yourself, or was it your intention to have it assigned to phk? Ciao, Sheldon. State Changed From-To: open->closed Duplicate of PR 20787. On Thu, 24 Aug 2000, Sheldon Hearn wrote: > On Wed, 23 Aug 2000 09:32:12 MST, chrisx77@hotmail.com wrote: > > > >Number: 20804 > > >Category: kern > > >Synopsis: deadlocking when using vnode disk file and quotas > [...] > > > This is being submitted at the request of Robert Watson. > > Robert, do you want this PR for yourself, or was it your intention to > have it assigned to phk? If phk will accept it, that's probably best, as I believe md is his baby, and it's a recursive locking issue relating to UFS and md. If not, I'll take it but it may take me a bit to solve the problem as I'll be moving to Massachusetts next week, and will be dealing with packing, furniture, etc. :-) I did request that a PR be filed as I think this is an important bug for us to work through. Robert N M Watson robert@fledge.watson.org http://www.watson.org/~robert/ PGP key fingerprint: AF B5 5F FF A6 4A 79 37 ED 5F 55 E9 58 04 6A B1 TIS Labs at Network Associates, Safeport Network Services Poul-Henning, Do you have a chance to look at this bug, which was raised on -fs recently? It has to do, I believe, with a recursive lock/deadlock issue relating to the md device and the UFS quota implementation, presumably a lock ordering issue relating to the md device vnode lock and the vnode lock for the quota data file on that file system. I haven't had a chance to look much further, and apparently it mostly manifests in jail (I'm not sure I understand why that might be the case -- possibly having to do with the starting point (root) for recursive name lookups with lock requests). Thanks, Robert N M Watson robert@fledge.watson.org http://www.watson.org/~robert/ PGP key fingerprint: AF B5 5F FF A6 4A 79 37 ED 5F 55 E9 58 04 6A B1 TIS Labs at Network Associates, Safeport Network Services ---------- Forwarded message ---------- Date: Thu, 24 Aug 2000 11:19:33 +0200 From: Sheldon Hearn <sheldonh@uunet.co.za> To: rwatson@freebsd.org Cc: freebsd-gnats-submit@freebsd.org Subject: Re: kern/20804: deadlocking when using vnode disk file and quotas On Wed, 23 Aug 2000 09:32:12 MST, chrisx77@hotmail.com wrote: > >Number: 20804 > >Category: kern > >Synopsis: deadlocking when using vnode disk file and quotas [...] > This is being submitted at the request of Robert Watson. Robert, do you want this PR for yourself, or was it your intention to have it assigned to phk? Ciao, Sheldon. In message <Pine.NEB.3.96L.1000824093709.31571B-100000@fledge.watson.org>, Robe rt Watson writes: > >Poul-Henning, > >Do you have a chance to look at this bug, which was raised on -fs >recently? It has to do, I believe, with a recursive lock/deadlock issue >relating to the md device and the UFS quota implementation, presumably a >lock ordering issue relating to the md device vnode lock and the vnode >lock for the quota data file on that file system. I haven't had a chance >to look much further, and apparently it mostly manifests in jail (I'm not >sure I understand why that might be the case -- possibly having to do with >the starting point (root) for recursive name lookups with lock requests). > >Thanks, > > Robert N M Watson Robert, No I haven't even looked at it. The jail stuff doesn't even know what a lock is, much less touch one, so the jail involvement was not enough to make me put it on my busy schedule... -- Poul-Henning Kamp | UNIX since Zilog Zeus 3.20 phk@FreeBSD.ORG | TCP/IP since RFC 956 FreeBSD coreteam member | BSD since 4.3-tahoe Never attribute to malice what can adequately be explained by incompetence. State Changed From-To: closed->open PR grunt on drugs. On Thu, 24 Aug 2000 15:43:02 +0200, Poul-Henning Kamp wrote:
> No I haven't even looked at it. The jail stuff doesn't even know
> what a lock is, much less touch one, so the jail involvement was
> not enough to make me put it on my busy schedule...
From Robert's earlier comments, it sounds more like an MD thing than a
jail thing?
Ciao,
Sheldon.
On Thu, 24 Aug 2000, Poul-Henning Kamp wrote: > In message <Pine.NEB.3.96L.1000824093709.31571B-100000@fledge.watson.org>, Robe > rt Watson writes: > > > >Poul-Henning, > > > >Do you have a chance to look at this bug, which was raised on -fs > >recently? It has to do, I believe, with a recursive lock/deadlock issue > >relating to the md device and the UFS quota implementation, presumably a > >lock ordering issue relating to the md device vnode lock and the vnode > >lock for the quota data file on that file system. I haven't had a chance > >to look much further, and apparently it mostly manifests in jail (I'm not > >sure I understand why that might be the case -- possibly having to do with > >the starting point (root) for recursive name lookups with lock requests). > > No I haven't even looked at it. The jail stuff doesn't even know > what a lock is, much less touch one, so the jail involvement was > not enough to make me put it on my busy schedule... My impression was that the bug had more to do with "md" -- is md yours, or someone else's? The reason I'm looking at the jail issue is that that's how the reporter of the bug described it. The interaction with jail() has to do with chroot() and recursive name lookup and locking: when a process is jailed, the name lookup, and hence tree walking and lock handling, starts at a different point in the tree, probably with a root in the md file system, which could affect the locking issues. Robert N M Watson robert@fledge.watson.org http://www.watson.org/~robert/ PGP key fingerprint: AF B5 5F FF A6 4A 79 37 ED 5F 55 E9 58 04 6A B1 TIS Labs at Network Associates, Safeport Network Services In message <84319.967125031@axl.fw.uunet.co.za>, Sheldon Hearn writes: > > >On Thu, 24 Aug 2000 15:43:02 +0200, Poul-Henning Kamp wrote: > >> No I haven't even looked at it. The jail stuff doesn't even know >> what a lock is, much less touch one, so the jail involvement was >> not enough to make me put it on my busy schedule... > >>From Robert's earlier comments, it sounds more like an MD thing than a >jail thing? The only thing magic about MD is it's access time... -- Poul-Henning Kamp | UNIX since Zilog Zeus 3.20 phk@FreeBSD.ORG | TCP/IP since RFC 956 FreeBSD coreteam member | BSD since 4.3-tahoe Never attribute to malice what can adequately be explained by incompetence. On Thu, 24 Aug 2000, Poul-Henning Kamp wrote: > In message <84319.967125031@axl.fw.uunet.co.za>, Sheldon Hearn writes: > > > >> No I haven't even looked at it. The jail stuff doesn't even know > >> what a lock is, much less touch one, so the jail involvement was > >> not enough to make me put it on my busy schedule... > > > >>From Robert's earlier comments, it sounds more like an MD thing than a > >jail thing? > > The only thing magic about MD is it's access time... I guess the magic question for the bug reporter would be, ``if you run edquota from within a jail on a non-md disk, does the same lock problem occur''. He may have listed this in the original PR, but I don't recall. Robert N M Watson robert@fledge.watson.org http://www.watson.org/~robert/ PGP key fingerprint: AF B5 5F FF A6 4A 79 37 ED 5F 55 E9 58 04 6A B1 TIS Labs at Network Associates, Safeport Network Services In message <Pine.NEB.3.96L.1000824100047.31571D-100000@fledge.watson.org>, Robe rt Watson writes: >On Thu, 24 Aug 2000, Poul-Henning Kamp wrote: > >> In message <84319.967125031@axl.fw.uunet.co.za>, Sheldon Hearn writes: >> > >> >> No I haven't even looked at it. The jail stuff doesn't even know >> >> what a lock is, much less touch one, so the jail involvement was >> >> not enough to make me put it on my busy schedule... >> > >> >>From Robert's earlier comments, it sounds more like an MD thing than a >> >jail thing? >> >> The only thing magic about MD is it's access time... > >I guess the magic question for the bug reporter would be, ``if you run >edquota from within a jail on a non-md disk, does the same lock problem >occur''. He may have listed this in the original PR, but I don't recall. It think this is a preexisting bug which jail/chroot + MD exposes. -- Poul-Henning Kamp | UNIX since Zilog Zeus 3.20 phk@FreeBSD.ORG | TCP/IP since RFC 956 FreeBSD coreteam member | BSD since 4.3-tahoe Never attribute to malice what can adequately be explained by incompetence. I've experienced this problem in FreeBSD 5.3-RELEASE as well, and I believe it's tied to the quota.user file. Placing the quota.user file outside the md filesystem by mounting the md device using the userquota=/anywhere/outside/the/loopback/filesystem does *not* result in a deadlock and, in fact, seems to work perfectly. Tony Arcieri State Changed From-To: open->feedback may be fixed State Changed From-To: feedback->closed Feedback timeout (> 3 months). |