Cat6500 error mesage %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET

Unanswered Question
Aug 5th, 2008

Hi all!!

We have a new Cat6500 Chassis.

All Modules: Sup720, Line Cards, and Service Modules aren't new (they don't come with the Chassis).

Some days ago, the switch was turned-on, but not in Production yet.

The following messages appeared:

Jul 27 07:03:29 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Jul 28 12:45:57 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Jul 31 16:17:08 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Aug 2 00:13:03 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Aug 3 00:37:00 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Aug 3 14:18:24 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Aug 4 10:49:47 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Aug 5 01:13:24 cst: %SYSTEM_CONTROLLER-SP-3-EXCESSIVE_RESET: System Controller is getting reset so frequently

Any idea with that? It is a HW Error or a Software BUG? What do you recommend to solve this??

#sh module

Mod Ports Card Type Model Serial No.

--- ----- -------------------------------------- ------------------ -----------

1 48 CEF720 48 port 10/100/1000mb Ethernet WS-X6748-GE-TX SAL1121PJ1H

5 2 Supervisor Engine 720 (Active) WS-SUP720-BASE SAD08300A08

7 24 CEF720 24 port 1000mb SFP WS-X6724-SFP SAL11413DC2

8 4 SLB Application Processor Complex WS-X6066-SLB-APC SAD0804088N

9 1 Application Control Engine Module ACE10-6500-K9 SAD111700WL

Mod MAC addresses Hw Fw Sw Status

--- ---------------------------------- ------ ------------ ------------ -------

1 001b.d40e.c0b0 to 001b.d40e.c0df 2.5 12.2(14r)S5 12.2(18)SXF1 Ok

5 000f.f7be.c8e0 to 000f.f7be.c8e3 3.1 8.1(3) 12.2(18)SXF1 Ok

7 001d.a246.266c to 001d.a246.2683 2.6 12.2(14r)S5 12.2(18)SXF1 Ok

8 000f.2320.6f2e to 000f.2320.6f35 1.7 3.1(6) Ok

9 001b.53bc.b58c to 001b.53bc.b593 1.4 8.7(0.22)ACE 3.0(0)A1(6.3 Ok

Mod Sub-Module Model Serial Hw Status

---- --------------------------- ------------------ ----------- ------- -------

1 Centralized Forwarding Card WS-F6700-CFC SAD112104ZP 3.1 Ok

5 Policy Feature Card 3 WS-F6K-PFC3A SAD083406YZ 2.4 Ok

5 MSFC3 Daughterboard WS-SUP720 SAD083405TK 2.3 Ok

7 Centralized Forwarding Card WS-F6700-CFC SAL1209HL5V 4.0 Ok

Mod Online Diag Status

---- -------------------

1 Pass

5 Pass

7 Pass

8 Pass

9 Pass

The IOS version is:s72033-ipservicesk9-mz.122-18.SXF14.bin"

cisco WS-C6509-E (R7000) processor (revision 1.4) with 458720K/65536K bytes of memory.

Thanks in advance...

Pedro

I have this problem too.
0 votes
  • 1
  • 2
  • 3
  • 4
  • 5
Overall Rating: 4 (2 ratings)
Loading.
Mark Yeates Tue, 08/05/2008 - 07:14

Pedro,

Is your chassis properly grounded? I have TAC case currently open on the exact same issue. From the engineer on the case that this error is caused of random static discharge or other external factors. I don't have a final resolution, but when it comes available I will let you know.

Mark

PEDRO AGUIRRE Tue, 08/05/2008 - 08:01

Hello Mark,

Thanks for your info and your time!

Please let me know your progress and solution for your TAC case...

By now, we will put in practice the recommendation on "Output interpreter" (verify the correct Module insertion and power-off the Slot/Chassis).

Best Regards!!

Pedro

Ryan Carretta Wed, 08/06/2008 - 01:00

Two things.

First, the message you're seeing is misleading. The error message is printed every time the system controller is reset - so the 'excessively' is incorrect in that sense. Each message corresponds to one inband channel resync. This is tracked by CSCsg69605. In future releases the message will read:

%SYSTEM_CONTROLLER-3-RESET: System Controller is reset. Normal Operation continues.

Second,

This is often the case with hardware that is improperly seated. You aren't seeing the message often (a good thing - usually in bad hardware cases we see it many many times per minute), but it could still be some kind of HW issue. Try reseating the supervisor with complete diagnostics set ('diagnostic level bootup complete'). If that doesn't reveal anything, try a spare if possible and/or replace the HW.

PEDRO AGUIRRE Wed, 08/06/2008 - 06:42

Ryan, thanks for de answer and the information...

We will do what you say... we hope that isn't a issue with the HW, because this SUP720 was installed and operative on another Cat6500 chassis last month!!.

Regards,

Pedro

Actions

This Discussion