Date: Wed, 19 Oct 2005 20:50:54 +0200 From: Jeppe Liisberg <none@jeppe.liisberg--gmail.com.lh.bsd-dk.dk> To: bsd-dk@bsd-dk.dk Subject: OFFTOPIC: 3ware controller fejl?
Hej Alle,
Det er måske lidt offtopic på en freebsd liste, men i plejer at være
så gode til at hjælpe, så jeg prøver alligevel ;-)
Jeg har nu for 2. gang på 2 måneder tilsyneladende mistet en disk på
min 3ware 8006-2lp raid controller. Begge gange efter samme mønster:
Ctl  Date                        Severity  Alarm Message
------------------------------------------------------------------------------
c0   -                           -         WARNING: ATA port timeout: Port #0
Unit  UnitType  Status         %Cmpl  Stripe  Size(GB)  Cache  AVerify  IgnECC
------------------------------------------------------------------------------
u0    RAID-1    DEGRADED       -      -       233.761   ON     -      
 -
Port   Status           Unit   Size        Blocks        Serial
---------------------------------------------------------------
p0     OK               u0     233.76 GB   490234752     B606KH5H
p1     DEGRADED         u0     233.76 GB   490234752     L5040VHH
Begge gange er det lykkedes at rebuilde arrayet een gang, og det
holder en lille uges tid, og så fejler det med samme warning (ATA port
timeout: Port #0) og kan ikke rebuildes.
Jeg er begyndt at mistænke controlleren for at være fejlbehæftet, og
vil gerne høre om nogle af jer derude har en mening om det?
Det undrer mig blandt andet at den siger "ATA port timeout: Port #0"
og "p1     DEGRADED" - burde det ikke være p0=Port #0 ???
nedenfor en lidt mere fyldig diag.
På forhånd tak for input!
- Jeppe
       3ware DiskSwitch 2/4/8/12
             FE8S 1.05.00.068  19-May-04
         Model No. : 8006-2LP
             Bios BE7X 1.08.00.048
       (c) 1997 - 2003  3ware
INV 0200
             Achip version # 03.20
             Achip version #
             Achip version #
             Checking Pchip Version
             (Will Hang if Incorrect)...
             Pchip version # 01.30-66
Alloc rnd :
        bkgrnd tasks stopped
        waiting for disks ready...
Spinup check:
    Aport 00
    Aport 01
disks ready.
        Drive 00: UDMA100 Maxtor 7B250S0
        Drive 01: UDMA100 Maxtor 7L250S0
READY
         Unit 00: TwinStor[0:1] of a CBOD[0] and a CBOD[1]
<< SOFT reset: count = 0001 >>
    Time: 0000DE1E msec
       3ware DiskSwitch 2/4/8/12
             FE8S 1.05.00.068  19-May-04
         Model No. : 8006-2LP
             Pchip version # 01.30-66
Alloc rnd :
        bkgrnd tasks stopped
        waiting for disks ready...
Spinup check:
    Aport 00
    Aport 01
disks ready.
        Drive 00: UDMA100 Maxtor 7B250S0
        Drive 01: UDMA100 Maxtor 7L250S0
READY
         Unit 00: TwinStor[0:1] of a CBOD[0] and a CBOD[1]
AEN sent to host: 0001
        bkgrnd tasks stopped
--------- SMART Info for last 24 hrs, Day 0001 ---------
0001 soft resets were received.
No timeouts occured on any Aport.
---------
Sbuf memory test...
.5MB Sbuf
OK
--------- SMART Info for last 24 hrs, Day 0002 ---------
0000 soft resets were received.
No timeouts occured on any Aport.
---------
Sbuf memory test...
.5MB Sbuf
OK
 TFR Out 00 20 DF 3B 1B E0 25
 Aport timeout 01 0EB8B0A0 619F
 TFR In  00 20 DF 3B 1B E0 D0
 Reset link ...
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
AEN sent to host: 0109
Degrade unit :         Unit 00,        Drive 01
AEN sent to host: 0002
         Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]
--------- SMART Info for last 24 hrs, Day 0003 ---------
0000 soft resets were received.
Aport 01 had 0001 timeout reading.
---------
        Drive 00: UDMA100 Maxtor 7B250S0
         Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
        Drive 00: UDMA100 Maxtor 7B250S0
        Drive 01: UDMA100 Maxtor 7L250S0
---------Error---------
  Status: 00C4
    Code: 0031
    Time: 13668C4A msec
C
         Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
         Unit 01: CBOD[1]
AEN sent to host: 000B
Rebuilding Unit 00
 00
 TFR Out 00 40 00 A3 02 E0 35
 Aport timeout 01 1380CFA0 7FCE
 TFR In  00 40 00 A3 02 E0 D0
 Reset link ...
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
AEN sent to host: 010A
AEN sent to host: 0004
        bkgrnd tasks stopped
         Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]
--------- SMART Info for last 24 hrs, Day 0004 ---------
0000 soft resets were received.
Aport 01 had 0001 timeout reading.
---------
        Drive 00: UDMA100 Maxtor 7B250S0
         Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
        Drive 00: UDMA100 Maxtor 7B250S0
        Drive 01: UDMA100 Maxtor 7L250S0
---------Error---------
  Status: 00C4
    Code: 0031
    Time: 15759182 msec
C
         Unit 00: Incomplete Degraded TwinStor[0:Fx] of a CBOD[0] and a CBOD[1]
         Unit 01: CBOD[1]
AEN sent to host: 000B
Rebuilding Unit 00
 00
 TFR Out 00 40 00 77 00 E0 35
 Aport timeout 01 15787D08 2CF5
 TFR In  00 40 00 77 00 E0 D0
 Reset link ...
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
AEN sent to host: 010A
AEN sent to host: 0004
        bkgrnd tasks stopped
         Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]
This archive was generated by hypermail 2b30 : Wed 15 Nov 2006 - 18:24:53 CET