Tracebacks In Router Logs

Unanswered Question
Oct 7th, 2009
User Badges:

All,

I have a 6509 that is giving me some traceback errors. I am not sure what service is hanging can anyone give me some insight? Here is a snippet from the logs:

Oct 7 08:12:28.714 EDT: %SYS-SPSTBY-3-CPUHOG: Task is running for (20000)msecs, more than (2000)msecs (12/5),process = CHKPT rcv MSG process.

-Traceback= 40259A54 40F8F00C 40F3A710 410B493C 40801348 40F62278 40F62594 40F62B9C 407C8CD4 40F6250C 407C8CD4 40F635C8 40F5E85C 40F5EA20 40F53058 40F45C84


This is all through the logs any help would be most appreciated. Thanks.


Mario

  • 1
  • 2
  • 3
  • 4
  • 5
Overall Rating: 4 (1 ratings)
Loading.
Giuseppe Larosa Wed, 10/07/2009 - 07:33
User Badges:
  • Super Silver, 17500 points or more
  • Hall of Fame,

    Founding Member

Hello Mario,

this message comes from the standby supervisor

SYS-SPSTBY-3-CPUHOG


and tells a process has used more then 2000 msecs of standby sup main cpu.

That process is CHKPT that is checking a message.

Standby processor should just process packets sent by active supervisor to keep its CEF and other tables updated.


This point to problems in communication between supervisors.

The traceback is a low level extract from memory about the error described in the first message.


This may be a sign of an HW problem or you may be hitting a sw bug in the redundancy feature.

To make a search about possible bugs you should provide the IOS image in use in your chassis.


From a practical point of view if you see several messages like this it can be a real issue that can come before an HW failure of standby supervisor for example.

If it happens only few times it is less critical.



Hope to help

Giuseppe



mrashby Wed, 10/07/2009 - 09:13
User Badges:

Giuseppe,

Thanks this helps a lot now I know where to start troubleshooting. If it is a known bug do I still have to be worried about a hardware issue?


Mario

Giuseppe Larosa Wed, 10/07/2009 - 10:21
User Badges:
  • Super Silver, 17500 points or more
  • Hall of Fame,

    Founding Member

Hello Mario,


>> If it is a known bug do I still have to be worried about a hardware issue?


if you find a sw bug that applies to your scenario this would lead to an IOS upgrade and not to an HW replacement.


Generally speaking, the problem is that it may be difficult to find an exact match.


Hope to help

Giuseppe


johnnylingo Tue, 09/25/2012 - 09:37
User Badges:
  • Bronze, 100 points or more

Started seeing this a few days ago on a 6509 running 12.2(33)SXH1  At first I thought it was software, but then the Supervisors failed over and now it looks like a hardware issue. 

mattleayr Mon, 10/15/2012 - 11:12
User Badges:

johnnylingo - did youresolve this problem - was it a hardware issue?


I'm seeing the same on a standby supervisor 32 running a similar version of code to you - 12.2(33)SXH2a

Actions

This Discussion