System Crashed when using Mondo Backup

Top Page
Attachments:
Message as email
+ (text/plain)
+ (text/html)
Delete this message
Reply to this message
Author: Bill Wesson
Date:  
To: plug-discuss
Subject: System Crashed when using Mondo Backup
I run a nightly backup at 2AM of our Fedora Core 1 server using Mondo. About
once every two weeks the system will freeze up. It won't process email,
present web pages, and I can't log into it using SSH. The console screen is
blank. We have to cold start server.

The easiest suggestion is to stop using Mondo, but I like the fast recovery
Mondo offers.

So we're trying to figure out how to diagnose this problem and where to
start. Memory, Hard Drive (is new), IDE controller, etc?

Does anyone have any ideas where to start?

Thanks,
--Bill Wesson

This is my script to run Mondo daily at 2AM:

mkdir -p /home/mondo/`date +%A`
mondoarchive -Oi -d /home/mondo/`date +%A` -E "/home/mondo"


Log files don't provide any clues.

CRON log for Sunday-2AM
Oct 24 01:50:00 payson CROND[24793]: (root) CMD (/usr/local/bin/weblogs)
Oct 24 02:00:00 payson CROND[25052]: (root) CMD (run-parts
/etc/cron.daily-2am)
Oct 24 02:00:00 payson CROND[25056]: (root) CMD (nice --adjustment=15
/usr/local/sbin/update_site_summary_cache)
Oct 24 02:00:00 payson CROND[25054]: (root) CMD (/usr/local/bin/weblogs)
Oct 24 02:01:00 payson CROND[25961]: (root) CMD (run-parts /etc/cron.hourly)
Oct 24 02:10:00 payson CROND[7866]: (root) CMD (/usr/local/bin/weblogs)

CRON log for Monday-2AM
Oct 25 01:50:00 payson CROND[29959]: (root) CMD (/usr/local/bin/weblogs)
Oct 25 02:00:00 payson CROND[30059]: (root) CMD (run-parts
/etc/cron.daily-2am)
Oct 25 02:00:00 payson CROND[30063]: (root) CMD (nice --adjustment=15
/usr/local/sbin/update_site_summary_cache)
Oct 25 02:00:00 payson CROND[30061]: (root) CMD (/usr/local/bin/weblogs)
Oct 25 02:01:00 payson CROND[30953]: (root) CMD (run-parts /etc/cron.hourly)
Oct 25 07:28:59 payson crond[2982]: (CRON) STARTUP (fork ok)

MESSAGES log for Monday-2AM
Oct 25 01:50:08 payson logger: weblogs: (29959) done.
Oct 25 02:00:00 payson logger: weblogs: (30061) starting.
Oct 25 02:00:01 payson autofs: automount shutdown succeeded
Oct 25 02:00:09 payson logger: weblogs: (30061) done.
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 68.157.222.155#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 207.65.0.25#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 68.157.222.155#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 207.65.0.25#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 68.157.222.155#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 207.65.0.25#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 68.157.222.155#53
Oct 25 02:04:03 payson named[1736]: lame server resolving 'dogg.com' (in
'dogg.c
om'?): 207.65.0.25#53
Oct 25 07:27:44 payson syslogd 1.4.1: restart.

1AM - Monday
             total       used       free     shared    buffers     cached
Mem:        773208     713100      60108          0      66540     522432
-/+ buffers/cache:     124128     649080
Swap:      2048248      90048    1958200



2AM - Monday
             total       used       free     shared    buffers     cached
Mem:        773208     768768       4440          0     102908     491276
-/+ buffers/cache:     174584     598624
Swap:      2048248      90048    1958200


Thanks,
Bill Wesson, Network Administrator
Vision Engraving Systems
<http://www.visionengravers.com> http://www.visionengravers.com
17621 N. Black Canyon Hwy
Phoenix, AZ 85023
602-439-0600