[tech] [committee] Temperature Monitoring in Server Room [repost]
Andrew Williams
andrew at ucc.gu.uwa.edu.au
Tue Mar 19 15:55:24 AWST 2019
On 2019-03-19 3:36 PM, Melissa Star wrote:
> As for what UCC does, are you saying that the club should choose externally written code over the code written by a club member?
>
> I've had a look at Icigna2, and it seems really good, to be fair. But it would not give me the experience of interacting more directly with the hardware and writing some interesting code.
>
> And it would not encourage others to make the type of home-grown solutions that encourage learning.
It depends what people want to learn - having experience with a
widely-used server monitoring tool would be good experience for anyone
wanting to get a sysadmin job in a large organisation. Working with
hardware and rolling your own code is good experience for anyone wanting
to be a developer.
> One thought - running such a thing on a VM is a bad idea given it should run on every single machine to report its own internal state, not all machines can run VMs, and that will waste a hell of a lot of CPU and resources.
You _can_ run an instance of icinga on every machine, and set them up as
a cluster, but you wouldn't run each instance in a VM - that way it
could only report the state of its own VM, not the host machine.
In practice, I found it easier to run one instance of the icinga process
and web server (which we have a VM for, but you don't need one). That
process then uses SSH to run the plugins on each machine - essentially,
to monitor disk space, for example, it's doing the equivalent of 'ssh
foo at bar "df"' once every hour, or day, or whatever interval you specify.
You need to set up a monitoring account with the icinga public SSH key
on each monitored machine, to accept the SSH connections without a
password.
Andrew
More information about the tech
mailing list