[tech] maraena downtime

James Andrewartha trs80 at ucc.gu.uwa.edu.au
Fri Apr 23 20:02:30 WST 2010


Hi all,

The downtime was caused by shutdown -h now at the wrong prompt, abetted by 
maraena's lack of molly-guard. The aim was to reboot mylah so it'd come up 
with the tape library's SCSI controller thanks to Xen PCI passthrough. 
This is so we could run amanda on mylah instead of maranea.

Unfortunately it hasn't really worked - firstly, the aic7xxx module is 
loaded in maraena before the pciback module and so grabs it despite adding 
'pciback.permissive pciback.hide=(00:06.0)' to the kernel command line. 
Secondly, after manually detaching the PCI device with sysfs[1] and 
starting mylah, it never properly attaches, just looping forever with the 
following. Trying rmmod aic7xxx doesn't work, it just hangs.

[  908.400062] Timer Expired
[  908.400085] Recovery code awake
[  908.400095] aic7xxx_abort returns 0x2003
[  908.400118] scsi 0:0:15:0: Attempting to queue a TARGET RESET message
[  908.400129] CDB: 0x12 0x0 0x0 0x0 0x24 0x0
[  908.400176] aic7xxx_dev_reset returns 0x2003
[  908.400242] Recovery SCB completes
[  928.404085] scsi 0:0:15:0: Attempting to queue an ABORT message
[  928.404109] CDB: 0x0 0x0 0x0 0x0 0x0 0x0
[  928.404161] scsi0: At time of recovery, card was not paused
[  928.404177] >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
[  928.404179] scsi0: Dumping Card State in Command phase, at SEQADDR 0xa7
[  928.404196] Card was paused
[  928.404208] ACCUM = 0x80, SINDEX = 0xa0, DINDEX = 0xe4, ARG_2 = 0x0
[  928.404221] HCNT = 0x0 SCBPTR = 0x0
[  928.404231] SCSISIGI[0x86]:(REQI|BSYI|CDI) ERROR[0x0] 
[  928.404269] SCSIBUSL[0x80] LASTPHASE[0x80]:(CDI) 
[  928.404300] SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) 
[  928.404324] SBLKCTL[0xa]:(SELWIDE|SELBUSB) SCSIRATE[0x0] 
[  928.404357] SEQCTL[0x10]:(FASTMODE) SEQ_FLAGS[0x0] 
[  928.404387] SSTAT0[0x7]:(DMADONE|SPIORDY|SDONE) 
[  928.404415] SSTAT1[0x3]:(REQINIT|PHASECHG) SSTAT2[0x0] 
[  928.404448] SSTAT3[0x0] SIMODE0[0x8]:(ENSWRAP) 
[  928.404477] SIMODE1[0xac]:(ENSCSIPERR|ENBUSFREE|ENSCSIRST|ENSELTIMO) 
[  928.404510] SXFRCTL0[0x88]:(SPIOEN|DFON) DFCNTRL[0x4]:(DIRECTION) 
[  928.404547] DFSTATUS[0x88]:(HDONE|PRELOAD_AVAIL) 
[  928.404570] STACK: 0x35 0x0 0x177 0x35
[  928.404601] SCB count = 4
[  928.404610] Kernel NEXTQSCB = 3
[  928.404619] Card NEXTQSCB = 0
[  928.404627] QINFIFO entries:
[  928.404639] Waiting Queue entries:
[  928.404652] Disconnected Queue entries:
[  928.404665] QOUTFIFO entries:
[  928.404677] Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
[  928.404842] Sequencer SCB Info:
[  928.404850]   0 SCB_CONTROL[0x0] SCB_SCSIID[0x0] SCB_LUN[0x0]
[  928.404891] SCB_TAG[0x0]
[  928.404901]   1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.404946] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.404974]   2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405019] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405048]   3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405164] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405199]   4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405248] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405279]   5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405326] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405357]   6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405404] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405435]   7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405482] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405513]   8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405561] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405592]   9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405638] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405668]  10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405712] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405740]  11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405784] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405813]  12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405857] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405886]  13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.405930] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[  928.405958]  14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[  928.406002] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406031]  15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406075] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406104]  16 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406147] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406176]  17 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406220] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406249]  18 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406293] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406322]  19 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406366] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406394]  20 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406439] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406468]  21 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406513] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406542]  22 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.406586] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.406615]  23 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  24 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  25 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  26 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  27 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  28 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  29 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  30 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028]  31 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
[  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
[  928.408028] Pending list: 
[  928.408028]   2 SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0xf7]:(TWIN_CHNLB|TWIN_TID) 
[  928.408028] SCB_LUN[0x0] 
[  928.408028] Kernel Free SCB list: 1 0 
[  928.408028] Untagged Q(15): 2 
[  928.408028] 
[  928.408028] <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
[  928.408028] (scsi0:A:15:0): Device is disconnected, re-queuing SCB
[  928.408028] Recovery code sleeping

[1] http://lists.xensource.com/archives/html/xen-devel/2010-03/msg00448.html 

-- 
# TRS-80              trs80(a)ucc.gu.uwa.edu.au #/ "Otherwise Bub here will do \
# UCC Wheel Member     http://trs80.ucc.asn.au/ #|  what squirrels do best     |
[ "There's nobody getting rich writing          ]|  -- Collect and hide your   |
[  software that I know of" -- Bill Gates, 1980 ]\  nuts." -- Acid Reflux #231 /


More information about the tech mailing list