[oclug] rebooting a hung linux box

james terris shinden at sympatico.ca
Mon Aug 27 11:19:34 EDT 2001


Hey, I was wondering if anyone has every seen or worked
on something where one linux box could detect if another
one had locked up or crashed and would reboot it?

The second linux box could ping the first and if it doesn't
get a response for some amount of time (or as soon as it
doesn't get a response) it could hardware reset the
first linus box then wait 10 minutes or so (so it doesn't
keep reseting it as it's starting up) and then start
pinging it again.

Any ideas on how I could do this?

thx,
james



More information about the OCLUG mailing list