Bug 198228 - zfs loader failed to load kernel from degraded mirror (disk with SMART errors)
Summary: zfs loader failed to load kernel from degraded mirror (disk with SMART errors)
Status: New
Alias: None
Product: Base System
Classification: Unclassified
Component: kern (show other bugs)
Version: 10.1-RELEASE
Hardware: Any Any
: --- Affects Only Me
Assignee: freebsd-fs (Nobody)
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-03-03 15:20 UTC by Bjoern A. Zeeb
Modified: 2015-03-07 15:15 UTC (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Bjoern A. Zeeb freebsd_committer freebsd_triage 2015-03-03 15:20:09 UTC
After a power glitch one of the zfsroot machines in the lab did not come up anymore.

The machine in questions had
1 SSD with freebsd-boot on it + l2arc cache
2 HDD in a ZFS mirror setup.

The first of the two HDDs (according to BIOS sort order at least), had SMART event errors (as notified by BIOS upon boot).

While loading the loader still worked fine, loading the kernel was not possible anymore and the twiddle went very slowly on the text segment.  We aborted after about 8 hours.

Removing the disk with SMART error notification and the system booted back up just fine with a degraded mirror.


Further diagnosis is no longer possible as the machine in question was needed and the disk is going to RMA.  However ZFS loader should be able to deal with this situation (better).  Someone with enough loader + zfs knowledge could go and have a look and either report read errors (checksum mismatches, ...) or if detecting a mirror trying the other source instead or providing a possibly to more easily select which source to use (abort, print message, let user set the variable and retry).


PS: There is also the point that ^c does not work during the time we are trying to load entities so breaking into the loader shell was not possible, which is somewhat unhelpful.