Skip to content

Exos X18 I/O error when waking up from Standby_z (HBA 9500-8i) #256

@domotiquefacile

Description

@domotiquefacile

Hello,

I've got 2 Seagate Exos X18 16GB in a RAID1 (BTRFS) configuration connected to a HBA 9500-8i, and I regularly get I/O errors when they are waking up from Standby_z state.
If I deactivate Standby_z, I have absolutely no issue with idle_a, idle_b and idle_c states, so the issue doesn't seem to be related to cable or PSU.
I think I managed to reproduce the issue manually by forcing the Standby_z state using openSeaChest_PowerControl -d /dev/sda --transitionPower standby_z but the issue seems to appear much less often if I transition to the state manually compared to letting the hard drive goes to Standby_z after some idle time.

Output from dmesg:

[mer. nov. 19 07:39:01 2025] BTRFS info (device dm-1): device stats zeroed by btrfs (878417)
[mer. nov. 19 18:17:08 2025] sd 0:0:1:0: attempting task abort!scmd(0x000000000fcc91dd), outstanding for 1044 ms & timeout 1000 ms
[mer. nov. 19 18:17:08 2025] sd 0:0:1:0: [sda] tag#6734 CDB: ATA command pass through(16) 85 08 0e 00 d5 00 01 00 e0 00 4f 00 c2 00 b0 00
[mer. nov. 19 18:17:08 2025] scsi target0:0:1: handle(0x000d), sas_address(0x300062b208f03540), phy(0)
[mer. nov. 19 18:17:08 2025] scsi target0:0:1: enclosure logical id(0x300062b208093540), slot(0) 
[mer. nov. 19 18:17:08 2025] scsi target0:0:1: enclosure level(0x0000), connector name( C0.0)
[mer. nov. 19 18:17:12 2025] sd 0:0:1:0: task abort: SUCCESS scmd(0x000000000fcc91dd)
[mer. nov. 19 18:17:12 2025] sd 0:0:1:0: [sda] tag#6751 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=10s
[mer. nov. 19 18:17:12 2025] sd 0:0:1:0: [sda] tag#6751 Sense Key : Not Ready [current] 
[mer. nov. 19 18:17:12 2025] sd 0:0:1:0: [sda] tag#6751 Add. Sense: Logical unit not ready, cause not reportable
[mer. nov. 19 18:17:12 2025] sd 0:0:1:0: [sda] tag#6751 CDB: Read(16) 88 00 00 00 00 00 7e 2d 3a 30 00 00 00 10 00 00
[mer. nov. 19 18:17:12 2025] I/O error, dev sda, sector 2116893232 op 0x0:(READ) flags 0x1000 phys_seg 2 prio class 2
[mer. nov. 19 18:17:12 2025] BTRFS error (device dm-1): bdev /dev/dm-1 errs: wr 0, rd 1, flush 0, corrupt 0, gen 0
[mer. nov. 19 18:17:20 2025] sd 0:0:2:0: attempting task abort!scmd(0x00000000784af5b4), outstanding for 2304 ms & timeout 1000 ms
[mer. nov. 19 18:17:20 2025] sd 0:0:2:0: [sdb] tag#6755 CDB: ATA command pass through(16) 85 08 0e 00 d5 00 01 00 e0 00 4f 00 c2 00 b0 00
[mer. nov. 19 18:17:20 2025] scsi target0:0:2: handle(0x000e), sas_address(0x300062b208f03544), phy(4)
[mer. nov. 19 18:17:20 2025] scsi target0:0:2: enclosure logical id(0x300062b208093540), slot(4) 
[mer. nov. 19 18:17:20 2025] scsi target0:0:2: enclosure level(0x0000), connector name( C0.1)
[mer. nov. 19 18:17:23 2025] sd 0:0:2:0: task abort: SUCCESS scmd(0x00000000784af5b4)
[mer. nov. 19 18:17:24 2025] sd 0:0:2:0: Power-on or device reset occurred
[mer. nov. 19 18:17:24 2025] sd 0:0:1:0: Power-on or device reset occurred
[mer. nov. 19 18:17:24 2025] BTRFS info (device dm-1): read error corrected: ino 0 off 5708692340736 (dev /dev/dm-1 sector 2116860448)
[mer. nov. 19 18:17:24 2025] BTRFS info (device dm-1): read error corrected: ino 0 off 5708692344832 (dev /dev/dm-1 sector 2116860456)
[mer. nov. 19 18:17:24 2025] BTRFS info (device dm-1): read error corrected: ino 0 off 5708692348928 (dev /dev/dm-1 sector 2116860464)
[mer. nov. 19 18:17:24 2025] BTRFS info (device dm-1): read error corrected: ino 0 off 5708692353024 (dev /dev/dm-1 sector 2116860472)

Mainboard: Supermicro X12SCZ-F

  - Firmware Version = 01.05.23
  - Firmware Build Time = 07/14/2025
  - Redfish Version = 1.21.0
  - BIOS Firmware Version = BIOS Date: 07/11/2025 Ver 3.4

HBA: Broadcom HBA 9500-8i

  - Firmware Package Build = 36.00.00.00
  - Firmware Version = 36.00.00.00
  - Bios Version = 09.71.00.00_36.00.00.00
  - NVDATA Version = 36.00.00.11
  - PSOC FW Version = 0x006E
  - PSOC Part Number = 06021
  - Driver Name = mpt3sas
  - Driver Version = 43.100.00.00

HDDs: 2x Seagate Exos X18 16GB (ST16000NM000J-2TW103)

  - FwRev = SN04
  - EPC settings =
Name       Current Timer Default Timer Saved Timer   Recovery Time C S
Idle A     *1            *1            *1            1             Y Y
Idle B     *1200         *1200         *1200         4             Y Y
Idle C     *12000         6000         *12000        20            Y Y
Standby Z  *18000         9000         *18000        110           Y Y

OS: Debian 12.12 (OpenMediaVault) running in a Proxmox VM (with HBA passthrough)

output of openSeaChest_Info -d /dev/sda --echoCommandLine -i -v 4 | tee verboseInfo.txt
verboseInfo.txt

Is this a known issue with Standby_z ? It seems most people are not using this state, but for me I am almost never using those drives (only once every 2 weeks on average), so it makes sense to put them to sleep.
Perhaps this is an interaction problem with the HBA ?
Any help would be greatly appreciated. The only thing I haven't tried so far is to update the hard drive's firmware. But I am a bit afraid to lose the data on the hard drives.

Note: sorry in advance if this is not the correct project to report this issue

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions