Router 7600 continiously rebooting

Unanswered Question
May 5th, 2010

Hi

I am getting the followoing loggs and my router is rebooting;

"


*** System received a Software forced crash ***

signal= 0x17, code= 0x1500, context= 0xce08d8c

PC = 0x827412c, Vector = 0x1500, SP = 0x10f23308

System Bootstrap, Version 12.2(33r)SRD5, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 2009 by cisco Systems, Inc.
C7600-RSP720/SP platform with 1048576 Kbytes of main memory


Autoboot executing command: "boot bootdisk:"


Initializing ATA monitor library...

string is bootdisk:c7600rsp72043-advipservicesk9-mz.122-33.SRB5.bin


Initializing ATA monitor library...

Self extracting the image... [OK]

Self decompressing the image : ############################################################################################################################################################################################## [OK]


              Restricted Rights Legend

Use, duplication, or disclosure by the Government is
subject to restrictions as set forth in subparagraph
(c) of the Commercial Computer Software - Restricted
Rights clause at FAR sec. 52.227-19 and subparagraph
(c) (1) (ii) of the Rights in Technical Data and Computer
Software clause at DFARS sec. 252.227-7013.

           cisco Systems, Inc.
           170 West Tasman Drive
           San Jose, California 95134-1706

Cisco IOS Software, c7600rsp72043_sp Software (c7600rsp72043_sp-ADVIPSERVICESK9-M), Version 12.2(33)SRB5, RELEASE SOFTWARE (fc2)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 1986-2008 by Cisco Systems, Inc.
Compiled Wed 05-Nov-08 13:38 by prod_rel_team
Image text-base: 0x080000BC, data-base: 0x0C000000


Buffered messages:
Queued messages:
*May  3 07:38:21.667: %SYS-3-LOGGER_FLUSHING: System pausing to ensure console debugging output.

*May  3 07:38:21.523: %PFREDUN-6-ACTIVE: Initializing as ACTIVE processor

*May  3 07:38:21.623: %C7600_PLATFORM-0-UNKNOWN_CHASSIS: The chassis type is not known.(0xFFFF)

%Software-forced reload


07:38:22 UTC Mon May 3 2010: Unexpected exception to CPU Vector 1500, PC = 0x0827412C, LR = 0x082740CC

-Traceback= 827412C 82740CC 8550B40 8552500 88EEB98 892255C 88EF4CC 88EE6BC 82769B4 826C80C

CPU Register Context:
MSR = 0x00029200  CR  = 0x20000008  CTR = 0x08A3BEA4  XER   = 0x00000000
R0  = 0x082740CC  R1  = 0x10F23308  R2  = 0xFFFCFFFC  R3    = 0x0F519C5C
R4  = 0x00000008  R5  = 0x095E2DE8  R6  = 0x0826993C  R7    = 0x00029200
R8  = 0x00029200  R9  = 0x00000000  R10 = 0x0C050000  R11   = 0x0CDF0000
R12 = 0x000011C8  R13 = 0x04044000  R14 = 0x088EE688  R15   = 0x00000000
R16 = 0x00000000  R17 = 0x00000000  R18 = 0x00000000  R19   = 0x00000000
R20 = 0x00000000  R21 = 0x00000000  R22 = 0x00000000  R23   = 0x00000000
R24 = 0x0CDF0000  R25 = 0x0CDF0000  R26 = 0x00000000  R27   = 0x10F23570
R28 = 0x00000000  R29 = 0x00000004  R30 = 0x00000000  R31   = 0x00000000

------------------ show redundancy states ------------------


File _20100503-073822-UTC Device Error :No such device

1082 Unused bytes of context save space
*May  3 07:38:23.207: %SYS-SP-3-LOGGER_FLUSHING: System pausing to ensure console debugging output.

*May  3 07:38:21.983: %SYS-3-LOGGER_FLUSHED: System was paused for 00:00:00 to ensure console debugging output.

*May  3 07:38:22.023: scp assert failure: queue != NULL: ../const/native/scp_const.c: 889
*May  3 07:38:22.023: -Traceback= 843ED38 843F40C 88286D4 8277318 82774D8 82568A0 82699B0 8257F6C 8828758 82769B4 826C80C
*May  3 07:38:22.023: %SCHED-7-WATCH: Attempt to monitor uninitialized watched queue (address 0). -Process= "slcp process", ipl= 0, pid= 55
-Traceback= 811E7D4 811EEE0 82593F0 843F41C 88286D4 8277318 82774D8 82568A0 82699B0 8257F6C 8828758 82769B4 826C80C
*May  3 07:38:23.207: %OIR-SP-6-CONSOLE: Changing console ownership to switch processor


*** System received a Software forced crash ***

signal= 0x17, code= 0x1500, context= 0xce08d8c

PC = 0x827412c, Vector = 0x1500, SP = 0x10f23308

System Bootstrap, Version 12.2(33r)SRD5, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 2009 by cisco Systems, Inc.
C7600-RSP720/SP platform with 1048576 Kbytes of main memory


Autoboot executing command: "boot bootdisk:"


Initializing ATA monitor library...

string is bootdisk:c7600rsp72043-advipservicesk9-mz.122-33.SRB5.bin


Initializing ATA monitor library...

"

I have a same router with same hardware configuraton in production and the same image and it s working fine, now here can the problem be is it in the RSP or in the chassis backplane.

the RAm but

tried to reseat the RAM but still problem is there.

Any help is appreciated.

Thanks

I have this problem too.
0 votes
  • 1
  • 2
  • 3
  • 4
  • 5
Overall Rating: 5 (2 ratings)
Loading.
Leo Laohoo Wed, 05/05/2010 - 02:06

Power off the router for about 10-15 seconds and power it back up.  Make sure your flash has available free space or the crashinfo directory will fill it all up.

When you have successfully boot up your router, upgrade your IOS.  For obvious reasons, you've hit a bug and it's got nothing to do with your config.

rupam_chakra1983 Wed, 05/05/2010 - 02:13

Hi

Actually this is a new router that just been shipped , and from day one it is not booting and condition is same.

there is no crash info file generated in the flash:............

do u expect a bug , but the same hardware with image is running well in production.........

How to analyze the PC in the traceback generated.

spremkumar Wed, 05/05/2010 - 02:58

hi rupam

Have you tried downgrading or upgrading to the next version of IOS ? also have checked the compact flash card installed in the router?

From the error message its evident that its caused by the software.

regds

rupam_chakra1983 Wed, 05/05/2010 - 03:46

Hi

Downgrading i have not tried because ., same image is running on other rouyer and have copied the same image from that router which i am uploaded in the faulty router via tftpd.

The falsh is ok and image is fine.

Can you plz let me know how to analyze the rpogram counter to find out the cause of the problem.

spremkumar Wed, 05/05/2010 - 03:59

Hi Rupam

Are you sure that both the chassis are of same model/series?

May  3 07:38:21.623: %C7600_PLATFORM-0-UNKNOWN_CHASSIS: The chassis type is not known.(0xFFFF)

one of the messages after the ios is loaded couldnt ascertain the chassis type itself.
Any differences between the 2 devices in comparison at present there in you environment.
regds
rupam_chakra1983 Wed, 05/05/2010 - 04:19

yes , both the router are of same model

And what does tis means:

rommon 9 > k

Stack trace:

Current PC = 0x08273dcc

Frame 00: FP = 0x10f17b18    PC = 0x08273dcc

Frame 01: FP = 0x10f17b28    PC = 0x0846cfbc

Frame 02: FP = 0x10f17b48    PC = 0x0854f754

Frame 03: FP = 0x10f17d78    PC = 0x08551114

Frame 04: FP = 0x10f17d98    PC = 0x088ed49c

Frame 05: FP = 0x10f17db0    PC = 0x08920e7c

Frame 06: FP = 0x10f17dd8    PC = 0x088eddd0

Frame 07: FP = 0x10f17de0    PC = 0x088ecfc0

Frame 08: FP = 0x10f17de8    PC = 0x08276654

Frame 09: FP = 0x10f17df0    PC = 0x0826c4ac

Suspect bogus FP = 0x00000000, aborting

rommon 10 > context

CPU context of the most recent exception:

R0    = 0x08273d6c R1    = 0x10f17b18 R2     = 0xfffcfffc R3     = 0x0f50e5bc

R4    = 0x00000008 R5    = 0x095dee60 R6     = 0x082695dc R7     = 0x00029200

R8    = 0x00029200 R9    = 0x00000000 R10    = 0x0c050000 R11    = 0x0cdf0000

R12   = 0x000011c8 R13   = 0x04044000 R14    = 0x088ecf8c R15    = 0x00000000

R16   = 0x00000000 R17   = 0x00000000 R18    = 0x00000000 R19    = 0x00000000

R20   = 0x00000000 R21   = 0x00000000 R22    = 0x00000000 R23    = 0x00000000

R24   = 0x0cdf0000 R25   = 0x0cdf0000 R26    = 0x00000000 R27    = 0x10f17d80

R28   = 0x00000000 R29   = 0x00000004 R30    = 0x00000000 R31    = 0x00000000

CR    = 0x20000008 LR    = 0x08273d6c CTR    = 0x08a39cdc XER    = 0x00000000

TBU   = 0x00000000 TBL   = 0x0dac576a DEAR   = 0x00000000 IVPR   = 0x00000000

PVR   = 0x80210020 DBCR0 = 0x41000000 DBCR1  = 0x00000000 DBCR2  = 0x00000000

IAC1  = 0x00000000 IAC2  = 0x00000000 DAC1   = 0x00000000 DAC2   = 0x00000000

CSRR0 = 0x00000000 CSRR1 = 0x00000000 MCSRR0 = 0x00000000 MCSRR1 = 0x00000000

PC    = 0x08273dcc MSR   = 0x00029200

Leo Laohoo Wed, 05/05/2010 - 05:25

12.2(33)SRB5 is a very suspicious IOS.  I believe you can't boot this unless you use the minimum 12.2(33)SRC or later.

rupam_chakra1983 Wed, 05/05/2010 - 05:37

Hi

Infact tried to install SRB4 which is running successfully on some other router, same is not booting with current router..........

burleyman Wed, 05/05/2010 - 05:35

This issue is a little similar to an issue I had with 6 new Catalyst 4506 switches we got for an office upgrade. They were all shipped with the same IOS on all 6, all modules and supervisors were the same. I was able to get 3 of the 6 going with no problem, I configured them and had them in place and passing traffic, no errors or issues. I got to the 4th switch and it kept rebooting on me and saying....Unsupported chassis type 65535, system can not boot..... So I thought bad Supervisor so we had used another Supervisor from the other switch that I had not deployed yet and same thing. We opened a TAC case and they looked at it and said they shipped those with the wrong code version and we needed to change it. I responded, I have three already deployed and had no problems, they really did not have an answer but told me I should upgrade those as well. I did the IOS upgrade and all was good and has been flawless for over two years.

What I think is, that while the switches have all the same parts, the parts themselves may have a chip or something from and newer lot of components and while some may work with one IOS another may not. I would try and update the IOS on one of the problem routers and see if that helps.

Mike

rupam_chakra1983 Wed, 05/05/2010 - 06:09

Hi

Thanks for your input.

What i ahve already deed ,i copied image from running router and put the same image in the problematic one but still it doen't works.

What else can be done , i dongraded it from SB5 to SB4.

Jerry Ye Wed, 05/05/2010 - 07:06

Which flash are you booting from? sup-bootdisk: (internal) or disk0: (external)? Can you find a known good compact flash (CF) and format it with a 7600 and boot the problem router with that external CF?

You can force it to boot from disk0: while you are in the rommon.

Regards,

jerry

Jerry Ye Wed, 05/05/2010 - 07:14

Have your re-format the CF on disk0: and boot it off there?

Regards,

jerry

burleyman Wed, 05/05/2010 - 07:33

I would try and update the IOS to the newest and see if that work for you .... 12.2.33-SRE1

That is what I had to do when I had a simular issue. Nothing to lose.

Mike

rupam_chakra1983 Wed, 05/05/2010 - 21:46

Hi

Yes i have reformatted the flash: and also downgraded to SRB4 image which is running on some othere same model/harware router.

So image is ok, but the problem i m getting is the same.

burleyman Thu, 05/06/2010 - 04:58

I would not down grade but try to upgrade to the very newest and see if that helps. Just because the IOS is running on other hardware does not mean that it will run on the problem hardware as it could have hit a bug. I had switches all the same hardware and shipped to me with the same IOS on them all. when I fired them up only some worked and the others kept rebooting and saying unknown chassis type. I open a TAC case and they told me that they shipped with the wrong IOS and I needed to upgrade to the latest IOS and that solved my problem. I did not see by your posts that you tried the latest IOS so I am not sure if you have already.

Mike

Leo Laohoo Thu, 05/06/2010 - 15:31

I agree with Mike.  You have a Traceback.  Experience and sleepless nights have taught me that this could go two ways:  You either have a hardware fault or ran over an IOS bug.  The lesser-of-two-evils dictate that you tackle the possibility of an IOS bug.  "Majority wins does not always mean they are correct" means that just because one (or more) machine successfully running this version means there's no problem with the firmware.

Like Mike said, you have nothing to loose with upgrading the firmware to the latest-n-greatest.  Or you can spend hours/days/weeks waiting for an answer you may like.

vvasisth Thu, 05/06/2010 - 05:20

you are seying traceback and that needs to be decoded.

i would recommend you to open a TAC case

rupam_chakra1983 Mon, 05/10/2010 - 02:27

Hi

I have replaced the RSP but still same problem is there tried to change the slot from 5 to 6 but still  problem not resolved.

can it be a backplane issue..

Also i tried to upload to image above 12.2 SRC ......Same problem

Actions

This Discussion