[tech] Motsugo Downtime (Don't worry, it already happened)
Andrew Adamson
bob at ucc.gu.uwa.edu.au
Fri Aug 23 21:55:32 WST 2013
I had a quick look at the motsugo IPMI event logs and there's nothing in
there about any ECC errors or SMART errors. It did log that the case cover
was taken off at 16:26 though, so the log is definitely working.
A software bug perhaps?
On a side note, I'm going to upgrade the IPMI firmware on motsugo, but
that shouldn't break anything. I hope.
Andrew Adamson
bob at ucc.asn.au
|"If you can't beat them, join them, and then beat them." |
| ---Peter's Laws |
On Fri, 23 Aug 2013, Sam Moore wrote:
> Hi,
>
> Motsugo was giving I/O errors around 3:00pm today, so someone ([MSH] ?)
> rebooted it. It hung before the BIOS on detecting PCI devices. Due to the
> committee meeting that was happening at the same time, it stayed hung
> until [BG3] and I power cycled it (just rebooting it again had no effect).
>
> I think mussel was rebooted as well for some reason.
> Because motsugo was down, mussel couldn't mount /home, so that might have
> caused some issues as well.
>
> Due to motsugo being rebooted without any warning, some of the
> meeting minutes might be lost.
>
> [SZM]
> Unsubscribe here: http://lists.ucc.gu.uwa.edu.au/mailman/options/tech/bob%40ucc.gu.uwa.edu.au
>
More information about the tech
mailing list