cancel
Showing results for 
Search instead for 
Did you mean: 

S3610 SSDs have failed "READ/WRITE FPDMA QUEUED" ATA commands, frozen, then link reset

ASmit32
New Contributor II

Hi,

I have a new Linux machine with two DC S3610 1.6TB SSDs. It's Debian jessie so kernel 3.6.17. Since around one month after installation these errors started appearing:

Jul 30 16:30:59 snaps kernel: [186914.249429] ata1.00: exception Emask 0x0 SAct 0x3 SErr 0x0 action 0x6 frozen

Jul 30 16:30:59 snaps kernel: [186914.250465] ata1.00: failed command: WRITE FPDMA QUEUED

Jul 30 16:30:59 snaps kernel: [186914.251505] ata1.00: cmd 61/08:00:39:db:8e/00:00:09:00:00/40 tag 0 ncq 4096 out

Jul 30 16:30:59 snaps kernel: [186914.251505] res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)

Jul 30 16:30:59 snaps kernel: [186914.253613] ata1.00: status: { DRDY }

Jul 30 16:30:59 snaps kernel: [186914.254781] ata1.00: failed command: WRITE FPDMA QUEUED

Jul 30 16:30:59 snaps kernel: [186914.255810] ata1.00: cmd 61/08:08:71:fc:4e/00:00:66:00:00/40 tag 1 ncq 4096 out

Jul 30 16:30:59 snaps kernel: [186914.255810] res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)

Jul 30 16:30:59 snaps kernel: [186914.257940] ata1.00: status: { DRDY }

Jul 30 16:30:59 snaps kernel: [186914.259086] ata1: hard resetting link

Jul 30 16:31:00 snaps kernel: [186914.577366] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)

Jul 30 16:31:00 snaps kernel: [186914.578307] ata1.00: configured for UDMA/133

Jul 30 16:31:00 snaps kernel: [186914.578310] ata1.00: device reported invalid CHS sector 0

Jul 30 16:31:00 snaps kernel: [186914.578311] ata1.00: device reported invalid CHS sector 0

Jul 30 16:31:00 snaps kernel: [186914.578316] ata1: EH complete

The error is always the same, and the only thing on ata1.00 is one of the SSDs. I switched the two SSDs around and the problem followed the same SSD.

I can't force the error to happen on demand, it just seems to happen every other day or so, though not at the same time of day. All IO is held up briefly while the link is reset. The drive passes a SMART long self-test.

So is this drive faulty? If not, what can I try to fix this? If so, is there an easy way to prove it for RMA purposes?

Jul 27 05:59:30 snaps kernel: [ 33.054376] ata1.00: ATA-9: INTEL SSDSC2BX016T4, G2010110, max UDMA/133

Jul 27 05:59:30 snaps kernel: [ 33.054474] ata1.00: 3125627568 sectors, multi 1: LBA48 NCQ (depth 31/32)

Jul 27 05:59:30 snaps kernel: [ 33.054567] ata2.00: ATA-9: INTEL SSDSC2BX016T4, G2010110, max UDMA/133

Jul 27 05:59:30 snaps kernel: [ 33.054657] ata2.00: 3125627568 sectors, multi 1: LBA48 NCQ (depth 31/32)

$ sudo smartctl -i /dev/sda

smartctl 6.4 2014-10-07 r4002 [x86_64-linux-3.16.0-4-amd64] (local build)

Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===

Device Model: INTEL SSDSC2BX016T4

Serial Number: BTHC511604V41P6PGN

LU WWN Device Id: 5 5cd2e4 04b7b1bfa

Firmware Version: G2010110

User Capacity: 1,600,321,314,816 bytes [1.60 TB]

Sector Sizes: 512 bytes logical, 4096 bytes physical

Rotation Rate: Solid State Device

Form Factor: 2.5 inches

Device is: Not in smartctl database [for details use: -P showall]

ATA Version is: ACS-2 T13/2015-D revision 3

SATA Version is: SATA 2.6, 6.0 Gb/s (current: 6.0 Gb/s)

Local Time is: Fri Jul 31 11:04:09 2015 UTC

SMART support is: Available - device has SMART capability.

SMART support is: Enabled

$ sudo smartctl -i /dev/sdb

smartctl 6.4 2014-10-07 r4002 [x86_64-linux-3.16.0-4-amd64] (local build)

Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===

Device Model: INTEL SSDSC2BX016T4

Serial Number: BTHC511604SD1P6PGN

LU WWN Device Id: 5 5cd2e4 04b7b1ba2

Firmware Version: G2010110

User Capacity: 1,600,321,314,816 bytes [1.60 TB]

Sector Sizes: 512 bytes logical, 4096 bytes physical

Rotation Rate: Solid State Device

Form Factor: 2.5 inches

Device is: Not in smartctl database [for details use: -P showall]

ATA Version is: ACS-2 T13/2015-D revision 3

SATA Version is: SATA 2.6, 6.0 Gb/s (current: 6.0 Gb/s)

Local Time is: Fri Jul 31 11:04:35 2015 UTC

SMART support is: Available - device has SMART capability.

SMART support is: Enabled

Message was edited by: Andy Smith Now seeing same problems with other SSD, so this is not restricted to a single drive.

45 REPLIES 45

ASmit32
New Contributor II

According to:

https://downloadmirror.intel.com/18455/eng/Intel_SSD_Toolbox_3_3_1_Release_Notes_325993-020US.pdf https://downloadmirror.intel.com/18455/eng/Intel_SSD_Toolbox_3_3_1_Release_Notes_325993-020US.pdf

"This release of the Intel® SSD Toolbox includes firmware updates for the Intel® SSD Pro 2500 and

Intel® SSD 535 Series and the Intel® SSD DC S3710, DC S3610, DC S3510, DC S3500 M.2 and DC S3500 HD Series products"

This appears to be a new firmware update released this week.

However, SSD toolbox is Windows-only software. Intel SSD Data Center Tool which I would normally use on Linux does not yet show an update.

1) Is an update coming for this?

2) If not, is there some way to download the firmware update and place it somewhere that ISDCT will find it?

3) Is there any known fix in this firmware update for the problem we are discussing in this thread?

Thanks,

Andy

jbenavides
Valued Contributor II

Hello,

https://downloadcenter.intel.com/download/18455/Intel-Solid-State-Drive-Toolbox Intel® Solid-State Drive Toolbox version 3.3.1 was recently released. It contains firmware updates to prevent the behavior mentioned in this thread.

Currently, you need a computer with the Intel® SSD Toolbox installed to perform the update. We expect future versions of Intel® Solid-State Drive Data Center Tool and Intel® Firmware Update Tool to contain the firmware updates as well, however, we do not have a specific date for this yet.

ASmit32
New Contributor II

So can you give me a solution for a firmware upgrade that works in Linux please?

Thanks,

Andy

DNeri
New Contributor II

We need a solution that does not depend on Windows.

Thank you.

Regards,

Daniel

JRice4
New Contributor

Hi Jonathan,

"It contains firmware updates to prevent the behavior mentioned in this thread."

I see nothing in the release notes at https://downloadmirror.intel.com/18455/eng/Intel_SSD_Toolbox_3_3_1_Release_Notes_325993-020US.pdf https://downloadmirror.intel.com/18455/eng/Intel_SSD_Toolbox_3_3_1_Release_Notes_325993-020US.pdf to support this statement, can you clarify whether you're referring to a change which isn't mentioned in the release notes, or which change in the release notes indicates a fix if it is in there.

Thanks

James