Subject: RE: OFFTOPIC: 3ware controller fejl? Date: Thu, 20 Oct 2005 07:42:38 +0200 From: "Albers, Tony" <none@tony.albers--hp.com.lh.bsd-dk.dk> To: <none@bsd-dk--bsd-dk.dk.lh.bsd-dk.dk>
Hej Jeppe,
Jeg ville starte med at undersøge om der evt er kommet ny formware til den -det kan jo godt være at fjelen du ser er rettet.
/tony
-----Original Message-----
From: owner-bsd-dk@hobbes.bsd-dk.dk [mailto:owner-bsd-dk@hobbes.bsd-dk.dk] On Behalf Of Jeppe Liisberg
Sent: 19. oktober 2005 20:51
To: bsd-dk@bsd-dk.dk
Subject: OFFTOPIC: 3ware controller fejl?
Hej Alle,
Det er måske lidt offtopic på en freebsd liste, men i plejer at være så gode til at hjælpe, så jeg prøver alligevel ;-)
Jeg har nu for 2. gang på 2 måneder tilsyneladende mistet en disk på min 3ware 8006-2lp raid controller. Begge gange efter samme mønster:
Ctl Date Severity Alarm Message
------------------------------------------------------------------------------
c0 - - WARNING: ATA port timeout: Port #0
Unit UnitType Status %Cmpl Stripe Size(GB) Cache AVerify IgnECC
------------------------------------------------------------------------------
u0 RAID-1 DEGRADED - - 233.761 ON -
-
Port Status Unit Size Blocks Serial
---------------------------------------------------------------
p0 OK u0 233.76 GB 490234752 B606KH5H
p1 DEGRADED u0 233.76 GB 490234752 L5040VHH
Begge gange er det lykkedes at rebuilde arrayet een gang, og det holder en lille uges tid, og så fejler det med samme warning (ATA port
timeout: Port #0) og kan ikke rebuildes.
Jeg er begyndt at mistænke controlleren for at være fejlbehæftet, og vil gerne høre om nogle af jer derude har en mening om det?
Det undrer mig blandt andet at den siger "ATA port timeout: Port #0"
og "p1 DEGRADED" - burde det ikke være p0=Port #0 ???
nedenfor en lidt mere fyldig diag.
På forhånd tak for input!
- Jeppe
3ware DiskSwitch 2/4/8/12
FE8S 1.05.00.068 19-May-04
Model No. : 8006-2LP
Bios BE7X 1.08.00.048
(c) 1997 - 2003 3ware
INV 0200
Achip version # 03.20
Achip version #
Achip version #
Checking Pchip Version
(Will Hang if Incorrect)...
Pchip version # 01.30-66
Alloc rnd :
bkgrnd tasks stopped
waiting for disks ready...
Spinup check:
Aport 00
Aport 01
disks ready.
Drive 00: UDMA100 Maxtor 7B250S0
Drive 01: UDMA100 Maxtor 7L250S0
READY
Unit 00: TwinStor[0:1] of a CBOD[0] and a CBOD[1]
<< SOFT reset: count = 0001 >>
Time: 0000DE1E msec
3ware DiskSwitch 2/4/8/12
FE8S 1.05.00.068 19-May-04
Model No. : 8006-2LP
Pchip version # 01.30-66
Alloc rnd :
bkgrnd tasks stopped
waiting for disks ready...
Spinup check:
Aport 00
Aport 01
disks ready.
Drive 00: UDMA100 Maxtor 7B250S0
Drive 01: UDMA100 Maxtor 7L250S0
READY
Unit 00: TwinStor[0:1] of a CBOD[0] and a CBOD[1]
AEN sent to host: 0001
bkgrnd tasks stopped
--------- SMART Info for last 24 hrs, Day 0001 ---------
0001 soft resets were received.
No timeouts occured on any Aport.
---------
Sbuf memory test...
.5MB Sbuf
OK
--------- SMART Info for last 24 hrs, Day 0002 --------- 0000 soft resets were received.
No timeouts occured on any Aport.
---------
Sbuf memory test...
.5MB Sbuf
OK
TFR Out 00 20 DF 3B 1B E0 25
Aport timeout 01 0EB8B0A0 619F
TFR In 00 20 DF 3B 1B E0 D0
Reset link ...
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
AEN sent to host: 0109
Degrade unit : Unit 00, Drive 01
AEN sent to host: 0002
Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]
--------- SMART Info for last 24 hrs, Day 0003 --------- 0000 soft resets were received.
Aport 01 had 0001 timeout reading.
---------
Drive 00: UDMA100 Maxtor 7B250S0
Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
Drive 00: UDMA100 Maxtor 7B250S0
Drive 01: UDMA100 Maxtor 7L250S0
---------Error---------
Status: 00C4
Code: 0031
Time: 13668C4A msec
C
Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
Unit 01: CBOD[1]
AEN sent to host: 000B
Rebuilding Unit 00
00
TFR Out 00 40 00 A3 02 E0 35
Aport timeout 01 1380CFA0 7FCE
TFR In 00 40 00 A3 02 E0 D0
Reset link ...
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
AEN sent to host: 010A
AEN sent to host: 0004
bkgrnd tasks stopped
Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]
--------- SMART Info for last 24 hrs, Day 0004 --------- 0000 soft resets were received.
Aport 01 had 0001 timeout reading.
---------
Drive 00: UDMA100 Maxtor 7B250S0
Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
Drive 00: UDMA100 Maxtor 7B250S0
Drive 01: UDMA100 Maxtor 7L250S0
---------Error---------
Status: 00C4
Code: 0031
Time: 15759182 msec
C
Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
Unit 01: CBOD[1]
AEN sent to host: 000B
Rebuilding Unit 00
00
TFR Out 00 40 00 77 00 E0 35
Aport timeout 01 15787D08 2CF5
TFR In 00 40 00 77 00 E0 D0
Reset link ...
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
TFR In 00 00 00 00 00 00 80
AEN sent to host: 010A
AEN sent to host: 0004
bkgrnd tasks stopped
Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]
This archive was generated by hypermail 2b30 : Wed 15 Nov 2006 - 18:24:53 CET