Uptime Baby!

der.hans PLUGd@LuftHans.com
Wed, 18 Oct 2000 01:58:48 -0700 (MST)


Am 18. Oct, 2000 schwäzte Jason so:

> sinck@ugive.com wrote:
> > 
> > \_ If I had a business machine, I would try and have a cron job reboot
> > \_ the sucker on sunday night, at a random time between 3 and 3:30 AM
> > \_ when the load averages have been under .0? (whatever, depends on the
> > \_ CPU, etc etc) for at least 15 minutes. Why random?
> > Well, how about just "Why?"
> 
> If the machine is running unattended, it cleans out any processes that
> for one reason or another fail to terminate.
> 
> Netscape is perhaps the most heinous example of this (though that
> wouldnt be running unattended...) .. a crashed netscape can persist
> even after you exit from X!

There are better ways to hunt down spinning processes. A simple method is
to note that it's eating time. Prob is that Netscape also does this when
being used :(.

Doesn't it lose assocication with a tty when this happens?

sh-2.01$ uptime
  1:50am  up 259 days,  2:23, 34 users,  load average: 0.00, 0.01, 0.00
sh-2.01$ ps aux | tr : ' ' | sort +10 -n | tail -5
root       435  0.0  0.3  3004   816  p2 S    02 03  22 01 xterm 
lufthans   347  0.0  1.1  7572  2928  ?  S    01 59  23 44 kwm 
root      2683  0.0  0.2  2884   612  ?  S    01 50  90 31 xterm 
root     30178  1.2  0.1  1668   452  p3 S   Oct  7 182 47 ssh spliff 
root       176 99.9  9.6 36824 24988  ?  S    01 55 967 01 /usr/bin/X11/X

X has been up since I booted, so almost 300 days. It still hasn't broken
1000 minutes.

If it's a desktop you can probably trigger on anything hitting more than
100 minutes. If it's a server, maybe try 100, but be prepared to move to
500 or 1000 for the services the machine is providing.

If you've got spinning processes, check them by hand. You can probably
just kill Netscapes. If it's a service like nfs or apache you probably
want to investigate why it's spinning, not just ignore the prob.

ciao,

der.hans
-- 
#  der.hans@LuftHans.com   home.pages.de/~lufthans/   www.Opnix.com
#  You can't handle the source! - der.hans