Summary: | more and more processes get stuck waiting for ufs and zfs until system is rendered inaccessible | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | Base System | Reporter: | Robert Clausecker <fuz> | ||||||||||
Component: | kern | Assignee: | freebsd-bugs (Nobody) <bugs> | ||||||||||
Status: | Open --- | ||||||||||||
Severity: | Affects Some People | CC: | chris, koobs, ota | ||||||||||
Priority: | --- | Keywords: | needs-qa | ||||||||||
Version: | 12.0-RELEASE | ||||||||||||
Hardware: | amd64 | ||||||||||||
OS: | Any | ||||||||||||
Attachments: |
|
Description
Robert Clausecker
2019-08-22 21:41:27 UTC
Thank you for the repoyrt Robert Can you provide the exact system information (uname -a), and include - pkg version -v output (as an attachment) - /var/run/dmesg.boot output (as an attachment) And when the symptoms are observable: - ps (-aux at least) output (as an attachment ) - vmstat -z output (as an attachment) Created attachment 206815 [details]
/var/run/dmesg.boot
Created attachment 206816 [details]
pkg version -v
Created attachment 206819 [details]
ps -auxc
Created attachment 206820 [details]
vmstat -z
The situation has reappeared with a bunch of ftpd instances being stuck in ufs and zfs wait channels. I'll leave the stuck box up for an hour or so in case you need further information. Once processes start to lock up, the machine kicks me out of my SSH session when I run "pkg update". I can't log in again (neither from FTP, SSH, or console), but existing connections continue to work. I have to hard reboot the machine then. Weird. Today my backup script hang with chflags process (I chflags recursively a lot of files) in ufs state. I use FreeBSD 12-STABLE (kernel/userland from 9 August) and UFS SU+J. SSH was responsive but server was pingable. The only way was to hard reset the server. (In reply to Robert Clausecker from comment #7) By the way, were all problems happened during disk replacement? (In reply to ota from comment #9) Yes, but I recall that I had another lockup after the disk replacement was done. What finally mitigated the problem was locking down the number of simultaneous FTP connections to an unreasonably low number (200) and disabling Apache 2.4. I think this reduced the load sufficiently to avoid the issue. |