[tech] maraena downtime

Adrian Chadd adrian at ucc.gu.uwa.edu.au
Fri Apr 23 20:49:06 WST 2010


Is there an ioapic on that box? is it working?

If not, is the aic driver xen-aware?



Adrian

On Fri, Apr 23, 2010, James Andrewartha wrote:
> Hi all,
> 
> The downtime was caused by shutdown -h now at the wrong prompt, abetted by 
> maraena's lack of molly-guard. The aim was to reboot mylah so it'd come up 
> with the tape library's SCSI controller thanks to Xen PCI passthrough. 
> This is so we could run amanda on mylah instead of maranea.
> 
> Unfortunately it hasn't really worked - firstly, the aic7xxx module is 
> loaded in maraena before the pciback module and so grabs it despite adding 
> 'pciback.permissive pciback.hide=(00:06.0)' to the kernel command line. 
> Secondly, after manually detaching the PCI device with sysfs[1] and 
> starting mylah, it never properly attaches, just looping forever with the 
> following. Trying rmmod aic7xxx doesn't work, it just hangs.
> 
> [  908.400062] Timer Expired
> [  908.400085] Recovery code awake
> [  908.400095] aic7xxx_abort returns 0x2003
> [  908.400118] scsi 0:0:15:0: Attempting to queue a TARGET RESET message
> [  908.400129] CDB: 0x12 0x0 0x0 0x0 0x24 0x0
> [  908.400176] aic7xxx_dev_reset returns 0x2003
> [  908.400242] Recovery SCB completes
> [  928.404085] scsi 0:0:15:0: Attempting to queue an ABORT message
> [  928.404109] CDB: 0x0 0x0 0x0 0x0 0x0 0x0
> [  928.404161] scsi0: At time of recovery, card was not paused
> [  928.404177] >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
> [  928.404179] scsi0: Dumping Card State in Command phase, at SEQADDR 0xa7
> [  928.404196] Card was paused
> [  928.404208] ACCUM = 0x80, SINDEX = 0xa0, DINDEX = 0xe4, ARG_2 = 0x0
> [  928.404221] HCNT = 0x0 SCBPTR = 0x0
> [  928.404231] SCSISIGI[0x86]:(REQI|BSYI|CDI) ERROR[0x0] 
> [  928.404269] SCSIBUSL[0x80] LASTPHASE[0x80]:(CDI) 
> [  928.404300] SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) 
> [  928.404324] SBLKCTL[0xa]:(SELWIDE|SELBUSB) SCSIRATE[0x0] 
> [  928.404357] SEQCTL[0x10]:(FASTMODE) SEQ_FLAGS[0x0] 
> [  928.404387] SSTAT0[0x7]:(DMADONE|SPIORDY|SDONE) 
> [  928.404415] SSTAT1[0x3]:(REQINIT|PHASECHG) SSTAT2[0x0] 
> [  928.404448] SSTAT3[0x0] SIMODE0[0x8]:(ENSWRAP) 
> [  928.404477] SIMODE1[0xac]:(ENSCSIPERR|ENBUSFREE|ENSCSIRST|ENSELTIMO) 
> [  928.404510] SXFRCTL0[0x88]:(SPIOEN|DFON) DFCNTRL[0x4]:(DIRECTION) 
> [  928.404547] DFSTATUS[0x88]:(HDONE|PRELOAD_AVAIL) 
> [  928.404570] STACK: 0x35 0x0 0x177 0x35
> [  928.404601] SCB count = 4
> [  928.404610] Kernel NEXTQSCB = 3
> [  928.404619] Card NEXTQSCB = 0
> [  928.404627] QINFIFO entries:
> [  928.404639] Waiting Queue entries:
> [  928.404652] Disconnected Queue entries:
> [  928.404665] QOUTFIFO entries:
> [  928.404677] Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
> [  928.404842] Sequencer SCB Info:
> [  928.404850]   0 SCB_CONTROL[0x0] SCB_SCSIID[0x0] SCB_LUN[0x0]
> [  928.404891] SCB_TAG[0x0]
> [  928.404901]   1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.404946] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.404974]   2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405019] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405048]   3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405164] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405199]   4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405248] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405279]   5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405326] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405357]   6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405404] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405435]   7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405482] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405513]   8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405561] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405592]   9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405638] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405668]  10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405712] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405740]  11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405784] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405813]  12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405857] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405886]  13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.405930] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
> [  928.405958]  14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
> [  928.406002] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406031]  15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406075] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406104]  16 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406147] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406176]  17 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406220] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406249]  18 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406293] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406322]  19 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406366] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406394]  20 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406439] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406468]  21 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406513] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406542]  22 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.406586] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.406615]  23 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  24 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  25 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  26 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  27 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  28 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  29 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  30 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028]  31 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) 
> [  928.408028] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 
> [  928.408028] Pending list: 
> [  928.408028]   2 SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0xf7]:(TWIN_CHNLB|TWIN_TID) 
> [  928.408028] SCB_LUN[0x0] 
> [  928.408028] Kernel Free SCB list: 1 0 
> [  928.408028] Untagged Q(15): 2 
> [  928.408028] 
> [  928.408028] <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
> [  928.408028] (scsi0:A:15:0): Device is disconnected, re-queuing SCB
> [  928.408028] Recovery code sleeping
> 
> [1] http://lists.xensource.com/archives/html/xen-devel/2010-03/msg00448.html 
> 
> -- 
> # TRS-80              trs80(a)ucc.gu.uwa.edu.au #/ "Otherwise Bub here will do \
> # UCC Wheel Member     http://trs80.ucc.asn.au/ #|  what squirrels do best     |
> [ "There's nobody getting rich writing          ]|  -- Collect and hide your   |
> [  software that I know of" -- Bill Gates, 1980 ]\  nuts." -- Acid Reflux #231 /


More information about the tech mailing list