ZFS Recovery

From Lolly's Wiki
Jump to navigationJump to search


Panic at boot time

Siehe SunAlert 233602 : Solaris 10 Assertion Failure in ZFS May Cause a System Panic:

The best recovery for this is to do the following:

    1. Set the following in /etc/system:
set zfs:zfs_recover=1
set aok=1
    2. Import the pool using 'zpool import'
    3. Run a full scrub on the pool using 'zpool scrub'
    4. Use 'zdb -d' and make sure that there is no ondisk corruption reported
    5. Once the pool comes to a clean state, comment / remove the added entries in /etc/system.

Zurückgehen auf einen früheren Uberblock

# zpool import defect_pool
cannot import 'defect_pool': I/O error
        Destroy and re-create the pool from
        a backup source.

Unter /etc/zfs:

# cd /etc/zfs
# strings zpool.cache | nawk '/c[0-9]+t/'
...
/dev/dsk/c7t0d0s0
...
# zdb -l /dev/dsk/c7t0d0s0 | nawk '$1=="name:"{print;exit;}'
    name: 'defect_pool'

Für einen ZPool im Solaris Cluster:

# cd /var/cluster/run/HAStoragePlus/zfs/
# strings defect_pool.cachefile | nawk '/c[0-9]+t/'
0/dev/dsk/c8t600A0B80006E103C000008164E51CDD2d0s0
0/dev/dsk/c8t600A0B80006E10E40000D47D4E51CF9Ed0s0

oder

# zpool import -o readonly=on -c defect_pool.cachefile
# zdb -lu /dev/dsk/c8t600A0B80006E103C000008164E51CDD2d0s0 | nawk '/txg =/{txg=$NF}/timestamp =/{printf "txg %d\t%s\n",txg,$0}' | sort -n -k 2n,2n | uniq | tail -10
txg 40353851            timestamp = 1352184849 UTC = Tue Nov  6 07:54:09 2012
txg 40353852            timestamp = 1352184849 UTC = Tue Nov  6 07:54:09 2012
txg 40353853            timestamp = 1352184849 UTC = Tue Nov  6 07:54:09 2012
txg 40353870            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012
txg 40353871            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012
txg 40353872            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012
txg 40353873            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012
txg 40353874            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012
txg 40353875            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012
txg 40353879            timestamp = 1352185334 UTC = Tue Nov  6 08:02:14 2012

# zpool import -o readonly=on -T <txg> defect_pool

Also z.B. auf Tue Nov 6 07:54:09 2012 -> txg 40353853

# zpool import -T 40353853 defect_pool
Pool defect_pool returned to its state as of Tue Nov 06 07:32:33 2012.
Discarded approximately 22 minutes of transactions.

PANIC, NOTICE: spa_import_rootpool: error 19

Die Lösung ist, den Pool und das Device explizit anzugeben. Wenn beim booten also kommt:

NOTICE: spa_import_rootpool: error 19
Cannot mount root on /pci@0,0/pci8086,340a@3/pci1000,3150@0/sd@1,0:a

panic[cpu0]/thread=fffffffffbc28820: vfs_mountroot: cannot mount root

Hilft ein Boot in den Failsafe mode und editieren der /a/rpool/boot/grub/menu.lst, oder Eingabe der Parameter in der Grub-Commandline:

title s10x_u8wos_08a
findroot (s10x_u8wos_08a,0,a)
bootfs rpool/ROOT/s10x_u8wos_08a
kernel$ /platform/i86pc/multiboot -B zfs-bootfs=rpool/ROOT/s10x_u8wos_08a,bootpath="/pci@0,0/pci8086,340a@3/pci1000,3150@0/sd@1,0:a"
module /platform/i86pc/boot_archive