Server obecně drhne v náhodných intervalech při I/O operacích. Nejvíc to jde vidět na DB dotazech, kdy i triviální select občas vyskočí až na 10s nebo 15s.
Ve chvílích, kdy to laguje jede aktivita systémovýho disku na maximum.
* Server jede 320dní bez restartu (tim pádem i bez kontroly disku)
* Badblocks hlásí 0 vadnej sektorů na disku.
* smartctl hlásí testresults jako passed.
Code:
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
Please note the following marginal Attributes:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
190 Airflow_Temperature_Cel 0x0022 061 044 045 Old_age Always In_the_past 39 (Min/Max 16/43)
Code:
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 111 084 006 Pre-fail Always - 0
3 Spin_Up_Time 0x0003 096 095 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 149
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 088 061 030 Pre-fail Always - 691666502
9 Power_On_Hours 0x0032 050 050 000 Old_age Always - 44263
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 172
187 Reported_Uncorrect 0x0032 098 098 000 Old_age Always - 2
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 061 044 045 Old_age Always In_the_past 39 (Min/Max 16/43)
194 Temperature_Celsius 0x0022 039 056 000 Old_age Always - 39 (0 16 0 0 0)
195 Hardware_ECC_Recovered 0x001a 066 047 000 Old_age Always - 575606
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0
202 Data_Address_Mark_Errs 0x0032 100 253 000 Old_age Always - 0
* Nicméně dnešní dmesg vypadá takhle:
Code:
[28342526.232020] ata4.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
[28342526.232026] ata4.00: BMDMA stat 0x64
[28342526.232029] ata4.00: failed command: READ DMA
[28342526.232033] ata4.00: cmd c8/00:80:3f:5e:9c/00:00:00:00:00/e8 tag 0 dma 65536 in
res 51/40:00:8b:5e:9c/00:00:00:00:00/e8 Emask 0x9 (media error)
[28342526.232037] ata4.00: status: { DRDY ERR }
[28342526.232039] ata4.00: error: { UNC }
[28342526.367719] ata4.00: configured for UDMA/133
[28342526.380312] ata4.01: configured for UDMA/133
[28342526.380319] sd 3:0:0:0: [sda] Unhandled sense code
[28342526.380320] sd 3:0:0:0: [sda]
[28342526.380322] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[28342526.380323] sd 3:0:0:0: [sda]
[28342526.380324] Sense Key : Medium Error [current] [descriptor]
[28342526.380327] Descriptor sense data with sense descriptors (in hex):
[28342526.380328] 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
[28342526.380333] 08 9c 5e 8b
[28342526.380335] sd 3:0:0:0: [sda]
[28342526.380337] Add. Sense: Unrecovered read error - auto reallocate failed
[28342526.380338] sd 3:0:0:0: [sda] CDB:
[28342526.380340] Read(10): 28 00 08 9c 5e 3f 00 00 80 00
[28342526.380344] end_request: I/O error, dev sda, sector 144465547
[28342526.380352] ata4: EH complete
[28342529.820024] ata4.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
[28342529.820031] ata4.00: BMDMA stat 0x64
[28342529.820034] ata4.00: failed command: READ DMA
[28342529.820038] ata4.00: cmd c8/00:40:67:5e:9c/00:00:00:00:00/e8 tag 0 dma 32768 in
res 51/40:00:8b:5e:9c/00:00:00:00:00/e8 Emask 0x9 (media error)
[28342529.820042] ata4.00: status: { DRDY ERR }
[28342529.820044] ata4.00: error: { UNC }
[28342529.957837] ata4.00: configured for UDMA/133
[28342529.972323] ata4.01: configured for UDMA/133
[28342529.972337] sd 3:0:0:0: [sda] Unhandled sense code
[28342529.972339] sd 3:0:0:0: [sda]
[28342529.972340] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[28342529.972342] sd 3:0:0:0: [sda]
[28342529.972343] Sense Key : Medium Error [current] [descriptor]
[28342529.972345] Descriptor sense data with sense descriptors (in hex):
[28342529.972346] 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
[28342529.972352] 08 9c 5e 8b
[28342529.972354] sd 3:0:0:0: [sda]
[28342529.972356] Add. Sense: Unrecovered read error - auto reallocate failed
[28342529.972357] sd 3:0:0:0: [sda] CDB:
[28342529.972359] Read(10): 28 00 08 9c 5e 67 00 00 40 00
[28342529.972363] end_request: I/O error, dev sda, sector 144465547
[28342529.972367] Buffer I/O error on device sda1, logical block 18058185
[28342529.972370] Buffer I/O error on device sda1, logical block 18058186
[28342529.972372] Buffer I/O error on device sda1, logical block 18058187
[28342529.972375] Buffer I/O error on device sda1, logical block 18058188
[28342529.972381] ata4: EH complete
* Přemountovat disk s jinejma parametrama nemůžu, aniž by ho někdo v serverovně fyzicky znovu nahodil.
* Disk je tam tenhle
http://www.hdsentinel.com/storageinfo_d ... TM380211AS* Deska:
http://www.newegg.com/Product/Product.a ... 6813131200Nápady?
Umírá disk, nebo to může bejt něco jinýho?
Dát tam nějaký nový SSDčko? Kdo ho koupí a nainstaluje?
edit: Tak podle názorů na netu tyhle data znamenají, disk opravdu brzy odejde do křemíkového nebe.
Už můžete panikařit.