Gość <account_deleted> Opublikowano 24 Listopada 2010 Zgłoś Opublikowano 24 Listopada 2010 To prawie niemożliwe - zepsuł mi się dysk WD... ;) Objawy: zaczął napidalać głowicami (aktuatorem) wydając nieprzyjemne dźwięki, tuż po tym jak dostałem maila, że macierz Raid1 do której z racji wieku został przydzielony została zdegradowana... Spojrzenie do logów: 11:16:58 [16588.320072] ata6: SRST failed (errno=-16) 11:16:58 [16588.320085] ata6: hard resetting link 11:16:58 [16588.320092] ata6: nv: skipping hardreset on occupied port 11:17:03 [16593.850039] ata6: link is slow to respond, please be patient (ready=0) 11:17:06 [16596.730064] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 300) 11:17:11 [16601.730048] ata6.00: qc timeout (cmd 0xec) 11:17:11 [16601.730059] ata6.00: failed to IDENTIFY (I/O error, err_mask=0x5) 11:17:11 [16601.730066] ata6.00: revaluateidation failed (errno=-5) 11:17:11 [16601.730081] ata6: hard resetting link 11:17:11 [16601.730086] ata6: nv: skipping hardreset on occupied port 11:17:17 [16607.260172] ata6: link is slow to respond, please be patient (ready=0) 11:17:20 [16610.560070] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 300) 11:17:28 [16618.340356] ata6.00: failed to set xfermode (err_mask=0x1) 11:17:28 [16618.340367] ata6.00: disabled 11:17:28 [16618.340403] ata6: EH complete 11:17:28 [16618.340439] sd 5:0:0:0: [sdd] Unhandled error code 11:17:28 [16618.340443] sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK 11:17:28 [16618.340451] sd 5:0:0:0: [sdd] CDB: Read(10): 28 00 01 68 19 7d 00 00 08 00 11:17:28 [16618.340468] end_request: I/O error, dev sdd, sector 23599485 11:17:28 [16618.340478] raid1: sdd4: rescheduling sector 0 11:17:28 [16618.340642] sd 5:0:0:0: [sdd] Unhandled error code 11:17:28 [16618.340650] sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK 11:17:28 [16618.340658] sd 5:0:0:0: [sdd] CDB: Read(10): 28 00 01 68 19 7d 00 00 08 00 11:17:28 [16618.340676] end_request: I/O error, dev sdd, sector 23599485 11:17:28 [16618.340872] sd 5:0:0:0: [sdd] READ CAPACITY(16) failed 11:17:28 [16618.340905] sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK 11:17:28 [16618.340912] sd 5:0:0:0: [sdd] Sense not available. 11:17:28 [16618.341078] sd 5:0:0:0: [sdd] Unhandled error code 11:17:28 [16618.341082] sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK 11:17:28 [16618.341088] sd 5:0:0:0: [sdd] CDB: Write(10): 2a 00 01 68 19 7d 00 00 08 00 11:17:28 [16618.341103] end_request: I/O error, dev sdd, sector 23599485 11:17:28 [16618.341139] sd 5:0:0:0: [sdd] READ CAPACITY failed 11:17:28 [16618.341144] sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK 11:17:28 [16618.341150] sd 5:0:0:0: [sdd] Sense not available. 11:17:28 [16618.341180] raid1: Disk failure on sdd4, disabling device. 11:17:28 [16618.341182] raid1: Operation continuing on 1 devices. 11:17:28 [16618.341206] raid1: sdc4: redirecting sector 0 to another mirror 11:17:28 [16618.341591] sd 5:0:0:0: [sdd] Asking for cache data failed 11:17:28 [16618.341596] sd 5:0:0:0: [sdd] Assuming drive cache: write through 11:17:28 [16618.341608] sdd: detected capacity change from 80000000000 to 0 11:17:28 [16618.451808] RAID1 conf printout: 11:17:28 [16618.451819] --- wd:1 rd:2 11:17:28 [16618.451825] disk 0, wo:0, o:1, dev:sdc4 11:17:28 [16618.451830] disk 1, wo:1, o:0, dev:sdd4 11:17:28 [16618.472551] RAID1 conf printout: 11:17:28 [16618.472561] --- wd:1 rd:2 11:17:28 [16618.472568] disk 0, wo:0, o:1, dev:sdc4 Aby więc zakończyć jego cierpienia, odłączyłem go "na żywca": 11:19:22 [16732.332710] ata6: exception Emask 0x10 SAct 0x0 SErr 0x1810000 action 0xe frozen 11:19:22 [16732.332722] ata6: SError: { PHYRdyChg LinkSeq TrStaTrns } 11:19:22 [16732.332739] ata6: hard resetting link 11:19:23 [16733.080053] ata6: SATA link down (SStatus 0 SControl 300) 11:19:23 [16733.080077] ata6: EH complete 11:19:23 [16733.080105] ata6.00: detaching (SCSI 5:0:0:0) 11:19:23 [16733.093163] sd 5:0:0:0: [sdd] Stopping disk 11:19:23 [16733.093240] sd 5:0:0:0: [sdd] START_STOP FAILED 11:19:23 [16733.093245] sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK 2 minuty na wyjęcie dysku to niezły wynik, bo P182 obudową serwerową raczej nie jest, a poukładane kabeli zadania nie ułatwiają... W tym momencie straciłem trochę czasu na zrobienie kawy i skrócenie sobie życia o 5 minut poprzez wyjaranie szluga :) Pacjent: Przyczyną zapaści były zanieczyszczone drogi oddechowe aktuatora, oznaczone na fotce. Styki konektora podoginane, PCB oczyszczone z rakotwórczych naleciałości - test: 12:47:40 [22030.880061] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 300) 12:47:40 [22030.900391] ata6.00: ATA-6: WDC WD800JD-75JNA0, 05.01C05, max UDMA/100 12:47:40 [22030.900398] ata6.00: 156250000 sectors, multi 0: LBA 12:47:40 [22030.920389] ata6.00: configured for UDMA/100 12:47:40 [22030.920404] ata6: EH complete 12:47:40 [22030.920613] scsi 5:0:0:0: Direct-Access ATA WDC WD800JD-75JN 05.0 PQ: 0 ANSI: 5 12:47:40 [22030.920943] sd 5:0:0:0: Attached scsi generic sg5 type 0 12:47:40 [22030.921177] sd 5:0:0:0: [sde] 156250000 512-byte logical blocks: (80.0 GB/74.5 GiB) 12:47:40 [22030.921268] sd 5:0:0:0: [sde] Write Protect is off 12:47:40 [22030.921321] sd 5:0:0:0: [sde] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA 12:47:40 [22030.922815] sde: sde1 sde2 sde3 sde4 12:47:40 [22030.931297] sd 5:0:0:0: [sde] Attached SCSI disk Żyje! SMART: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0003 165 162 021 Pre-fail Always - 2741 4 Start_Stop_Count 0x0032 097 097 000 Old_age Always - 3804 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 200 200 051 Pre-fail Always - 0 9 Power_On_Hours 0x0032 078 078 000 Old_age Always - 16749 10 Spin_Retry_Count 0x0013 100 100 051 Pre-fail Always - 0 11 Calibration_Retry_Count 0x0012 100 100 051 Old_age Always - 0 12 Power_Cycle_Count 0x0032 097 097 000 Old_age Always - 3785 194 Temperature_Celsius 0x0022 112 082 000 Old_age Always - 31 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0012 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 4761 200 Multi_Zone_Error_Rate 0x0009 200 200 051 Pre-fail Offline - 0 Odbudowa macierzy: 13:09:00 [23310.879332] md: bind<sde4> 13:09:00 [23310.923189] RAID1 conf printout: 13:09:00 [23310.923195] --- wd:1 rd:2 13:09:00 [23310.923199] disk 0, wo:0, o:1, dev:sdc4 13:09:00 [23310.923201] disk 1, wo:1, o:1, dev:sde4 13:09:00 [23310.924582] md: recovery of RAID array md1 13:09:00 [23310.924586] md: minimum _guaranteed_ speed: 1000 KB/sec/disk. 13:09:00 [23310.924588] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for recovery. 13:09:00 [23310.924593] md: using 128k window, over a total of 66324288 blocks. 13:09:04 [23314.458777] md: md1: recovery done. 13:09:04 [23314.518117] RAID1 conf printout: 13:09:04 [23314.518122] --- wd:2 rd:2 13:09:04 [23314.518129] disk 0, wo:0, o:1, dev:sdc4 13:09:04 [23314.518134] disk 1, wo:0, o:1, dev:sde4 ...bez wyłączania kompa :lol: Dlaczego założyłem ten temat: Ostatnio coraz częściej trafiają do mnie dyski, w których jedyną usterką jest zły styk na którymś z konektorów. Jeśli macie "uszkodzony" hdd po gwarancji warto spróbować wyleczyć go poprzez oczyszczenie styków - to nic nie kosztuje ;) Cytuj Udostępnij tę odpowiedź Odnośnik do odpowiedzi Udostępnij na innych stronach Więcej opcji udostępniania...