Re: Frustrated - Weird problem

Top Page
Attachments:
Message as email
+ (text/plain)
Delete this message
Reply to this message
Author: Brian Cluff
Date:  
To: Main PLUG discussion list
Subject: Re: Frustrated - Weird problem
When I've seen this in the past it's been caused by a loop in the
network, and it takes a while for the broadcast traffic, mostly DHCP in
my case, to build up to the point that there is nothing left of the
network. In my cause it was usually caused by someone bringing in a
rouge wireless access point, and plugging multiple cables into it.

Try loading something like iptraf and see if you see a ton of off
traffic from all over the place.

Brian Cluff

On 09/02/2010 08:03 PM, Simon Chatfield wrote:
> Ok, I've got a doozy of an issue which has happened twice this week and
> is absolutely crushing to my clients who are in busy season right about
> now. Here's the issue...
>
> I have a beefy linux database server which runs both postgres and mysql.
> We just recently loaded mysql and putting it under significant load.
>
> Apperantly at random, twice the week (Monday and this evening) it
> appears to take the network down save for a single machine which we are
> still able to ssh into. There are 6 other boxes which we cannot ssh into
> when this occurs. Link light activity does appear to still be active on
> the network. The method for solving the problem has been to hard reboot
> this specific server and as soon as it goes down, we can access the
> other boxes via ssh and they start working again. When the box comes
> back up, we can then ssh into that machine and everything is good (until
> it happens again that is). After the reboot, there isn't much in the
> logs, but I see the log entry for the tech unplugging and plugging in
> the computer from the switch PRIOR to the reboot so the network link was
> detected and logged even though it was not responding to ssh.
>
> These machines are hosted down at i/o so a hardboot is causing us
> significant time to get a tech to handle it.
>
> Has anyone ever heard of a single linux box bringing down 'most' of a
> network? then reboot and the other boxes are then accessible?
>
> My client is at his whits end, and I don't blame him. However, I'm not
> even sure what kind of problem this is. hardware on that box? system
> configuration? a bad switch?
>
> Looking for ideas at least, and if someone has time and ability, I'd
> love to have someone on-site to help debug and fix this issue...
>
> Thanks everyone!
>


---------------------------------------------------
PLUG-discuss mailing list -
To subscribe, unsubscribe, or to change your mail settings:
http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss