From: andre.boegelsack on
Usually this means you have a bad disk and you should replace it. Does
zpool status -v report any error or any bad disk?
From: Richard B. Gilbert on
Michael Laajanen wrote:
> Hi,
>
> I am running Solaris 10 on a number of x86 nodes and have root and a
> second filesystem mounted over SATA to two harddisk with ZFS.
>
> I am currently testing this SATA link to the harddisks using a simple
> script which
>
> - creates a 40GB file1
> - copy from file1 to file2
> - removes file1
> - copy file2 to file1
>
> This is repeated over and over, this one of the nodes reportes error as
> below, anyone know what it actually means?
>
> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep 5 09:39:13 siu5 Error for command 'read sector' Error Level: Fatal
> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Requested Block
> 123893504, Error Block: 123893541
> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Sense Key:
> uncorrectable data error
> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
> error code: 0x7
> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep 5 09:39:17 siu5 Error for command 'read sector' Error Level: Fatal
> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Requested Block
> 123893504, Error Block: 123893541
> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Sense Key:
> uncorrectable data error
> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
> error code: 0x7
> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep 5 09:39:21 siu5 Error for command 'read sector' Error Level: Fatal
> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Requested Block
> 123893504, Error Block: 123893541
> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Sense Key:
> uncorrectable data error
> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
> error code: 0x7
> Sep 5 09:39:25 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep 5 09:39:25 siu5 Error for command 'read sector' Error Level: Fatal
> S
>
> /michael

It suggests, to me, that you have a bad block on your disk. Some
drive, controller, and device driver combinations will mark the block
bad and "revector" to a spare block.

This is not a cause for concern UNLESS it happens again. A series of
such events suggests that you should make a full backup if you still can
and replace the drive.
From: Michael Laajanen on
Hi,

I am running Solaris 10 on a number of x86 nodes and have root and a
second filesystem mounted over SATA to two harddisk with ZFS.

I am currently testing this SATA link to the harddisks using a simple
script which

- creates a 40GB file1
- copy from file1 to file2
- removes file1
- copy file2 to file1

This is repeated over and over, this one of the nodes reportes error as
below, anyone know what it actually means?

Sep 5 09:39:13 siu5 gda: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
Sep 5 09:39:13 siu5 Error for command 'read sector' Error Level: Fatal
Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Requested Block
123893504, Error Block: 123893541
Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Sense Key:
uncorrectable data error
Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
error code: 0x7
Sep 5 09:39:17 siu5 gda: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
Sep 5 09:39:17 siu5 Error for command 'read sector' Error Level: Fatal
Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Requested Block
123893504, Error Block: 123893541
Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Sense Key:
uncorrectable data error
Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
error code: 0x7
Sep 5 09:39:21 siu5 gda: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
Sep 5 09:39:21 siu5 Error for command 'read sector' Error Level: Fatal
Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Requested Block
123893504, Error Block: 123893541
Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Sense Key:
uncorrectable data error
Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
error code: 0x7
Sep 5 09:39:25 siu5 gda: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
Sep 5 09:39:25 siu5 Error for command 'read sector' Error Level: Fatal
S

/michael
From: Zfs.. on
On Sep 7, 5:15 pm, Michael Laajanen <michael_laaja...(a)yahoo.com>
wrote:
> Hi,
>
> I am running Solaris 10 on a number of x86 nodes and have root and a
> second filesystem mounted over SATA to two harddisk with ZFS.
>
> I am currently testing this SATA link to the harddisks using a simple
> script which
>
> - creates a 40GB file1
> - copy from file1 to file2
> - removes file1
> - copy file2 to file1
>
> This is repeated over and over, this one of the nodes reportes error as
> below, anyone know what it actually means?
>
> Sep  5 09:39:13 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep  5 09:39:13 siu5    Error for command 'read sector'  Error Level: Fatal
> Sep  5 09:39:13 siu5 gda: [ID 107833 kern.notice]  Requested Block
> 123893504, Error Block: 123893541
> Sep  5 09:39:13 siu5 gda: [ID 107833 kern.notice]  Sense Key:
> uncorrectable data error
> Sep  5 09:39:13 siu5 gda: [ID 107833 kern.notice]  Vendor 'Gen-ATA '
> error code: 0x7
> Sep  5 09:39:17 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep  5 09:39:17 siu5    Error for command 'read sector'  Error Level: Fatal
> Sep  5 09:39:17 siu5 gda: [ID 107833 kern.notice]  Requested Block
> 123893504, Error Block: 123893541
> Sep  5 09:39:17 siu5 gda: [ID 107833 kern.notice]  Sense Key:
> uncorrectable data error
> Sep  5 09:39:17 siu5 gda: [ID 107833 kern.notice]  Vendor 'Gen-ATA '
> error code: 0x7
> Sep  5 09:39:21 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep  5 09:39:21 siu5    Error for command 'read sector'  Error Level: Fatal
> Sep  5 09:39:21 siu5 gda: [ID 107833 kern.notice]  Requested Block
> 123893504, Error Block: 123893541
> Sep  5 09:39:21 siu5 gda: [ID 107833 kern.notice]  Sense Key:
> uncorrectable data error
> Sep  5 09:39:21 siu5 gda: [ID 107833 kern.notice]  Vendor 'Gen-ATA '
> error code: 0x7
> Sep  5 09:39:25 siu5 gda: [ID 107833 kern.warning] WARNING:
> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
> Sep  5 09:39:25 siu5    Error for command 'read sector'  Error Level: Fatal
> S
>
> /michael

Scrub your zpool

zpool scrub mypool

And see if it reports any dodgy data on one of the disks. If it does,
replace the disk.
From: Michael Laajanen on
Hi all,

Zfs.. wrote:
> On Sep 7, 5:15 pm, Michael Laajanen <michael_laaja...(a)yahoo.com>
> wrote:
>> Hi,
>>
>> I am running Solaris 10 on a number of x86 nodes and have root and a
>> second filesystem mounted over SATA to two harddisk with ZFS.
>>
>> I am currently testing this SATA link to the harddisks using a simple
>> script which
>>
>> - creates a 40GB file1
>> - copy from file1 to file2
>> - removes file1
>> - copy file2 to file1
>>
>> This is repeated over and over, this one of the nodes reportes error as
>> below, anyone know what it actually means?
>>
>> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.warning] WARNING:
>> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
>> Sep 5 09:39:13 siu5 Error for command 'read sector' Error Level: Fatal
>> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Requested Block
>> 123893504, Error Block: 123893541
>> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Sense Key:
>> uncorrectable data error
>> Sep 5 09:39:13 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
>> error code: 0x7
>> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.warning] WARNING:
>> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
>> Sep 5 09:39:17 siu5 Error for command 'read sector' Error Level: Fatal
>> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Requested Block
>> 123893504, Error Block: 123893541
>> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Sense Key:
>> uncorrectable data error
>> Sep 5 09:39:17 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
>> error code: 0x7
>> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.warning] WARNING:
>> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
>> Sep 5 09:39:21 siu5 Error for command 'read sector' Error Level: Fatal
>> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Requested Block
>> 123893504, Error Block: 123893541
>> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Sense Key:
>> uncorrectable data error
>> Sep 5 09:39:21 siu5 gda: [ID 107833 kern.notice] Vendor 'Gen-ATA '
>> error code: 0x7
>> Sep 5 09:39:25 siu5 gda: [ID 107833 kern.warning] WARNING:
>> /pci@0,0/pci-ide(a)1f,2/ide@1/cmdk@0,0 (Disk1):
>> Sep 5 09:39:25 siu5 Error for command 'read sector' Error Level: Fatal
>> S
>>
>> /michael
>
> Scrub your zpool
>
> zpool scrub mypool
>
> And see if it reports any dodgy data on one of the disks. If it does,
> replace the disk.
Could it also mean that I have a bad connection to the drives, like bad
cables and so? Why I am asking this is all 11 nodes/hosts in the system
are connected to a inhouse designed SATA chassis backplane, one or two
drives per node driven via standard SAS cables from Sun(LSI) HBA's.

Does anyone know of some way to enhance the error reports on these SATA
interfaces by means of setting some "debug variables" to the drivers in
order to get more detailed info?

/michael