cat 4006 modules loss of conactivity

Unanswered Question
Sep 17th, 2003
User Badges:

i have big problem with switch catalyst 4006 ....with ( ws-4013 supervisor -3 modules ws-x4148-rj)

without any reasons when i put any of the modules in slot 2 or slot 3 or slot 5 they give faulty status!!!! when i put modules in slot 4 or slot 6 they

give ok status (ready)!!!


actions i made:


1-reset the cat 4006

2-erase all configurations

3-replace modules places

4-upgrade software image from cat4000.5-4-2.bin to cat4000-k8.8-1-2.bin


i can't disable or enable any of this module .

please help

  • 1
  • 2
  • 3
  • 4
  • 5
Overall Rating: 0 (0 ratings)
Loading.
mabadie Wed, 09/17/2003 - 22:16
User Badges:

thank you for your help ..

switch have two power supply in power supply 1 and power supply 2 ..

note : i replace the places of two power supplies

tsingletary Thu, 09/18/2003 - 06:52
User Badges:

I have several 4006s in my network. In each i keep three power supplies. Everytime a power supply goes bad i start having issues with the switch. long story short try to get another power supply = 3 total in your 4006.

sounds to simple i know. I hope this helps.

Bobby Thekkekandam Thu, 09/18/2003 - 07:28
User Badges:
  • Cisco Employee,

This probably isn't a power supply issue as you have two active power supplies, which is the minimum required.


But to test whether you are exceeding the power budget of the switch, try the switch with just the supervisor and a module in slot 2 or 3. (Or if you do have a spare PSU, you can insert it into the switch and plug it in. Does it come up? If not, then we can elminate power as the issue here.


Do the modules and supervisor pass POST? You can check this with the command 'show test.' I would recommend setting the diagnostic level to do a complete POST test. Use the command "set test diaglevel complete" to enable this. Then reset the switch and look at the output of show test. Do you see any F's, indicating failures.


I would check this on the modules when they are in slot 2,3, or 5 and again with the modules in slot 4 or 6. Also note the POST output for module 1. If you could post that output here as well that will be helpful.

mabadie Sat, 09/20/2003 - 23:50
User Badges:

thanks alot for your assistance .

i tried all your suggested ideas but nothing changed.

this is the output of sh test for module 5

Eeprom: .

CX1000 Regs:

Ports 1-8 : F Ports 9-16 : F Ports 17-24 : F

Ports 25-32 : F Ports 33-40 : F Ports 41-48 : F

CX1000 Sram:

Ports 1-8 : F Ports 9-16 : F Ports 17-24 : F

Ports 25-32 : F Ports 33-40 : F Ports 41-48 : F


10/100Base-TX Loopback Status:

Ports 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24


-----------------------------------------------------------------------


F F F F F F F F F F F F F F F F F F F F F F F F



25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48


-----------------------------------------------------------------------


F F F F F F F F F F F F F F F F F F F F F F F

thanks

mabadie Sun, 09/21/2003 - 03:29
User Badges:

i would like to show you syslogging messages:

2003 Sep 20 21:02:19 %SYS-5-MOD_OK:Module 1 is online


2003 Sep 20 21:02:31 %SYS-5-MOD_OK:Module 4 is online


Console> 2003 Sep 20 21:02:58 %SYS-4-P2_WARN: 1/Astro 5-2 - management request t

imed out.

2003 Sep 20 21:02:58 %SYS-4-P2_WARN: 1/Astro 5-3 - management request timed out.


2003 Sep 20 21:02:58 %SYS-4-P2_WARN: 1/Astro 5-1 - management request timed out.


2003 Sep 20 21:02:58 %SYS-4-P2_WARN: 1/Astro 5-6 - management request timed out.


2003 Sep 20 21:02:58 %SYS-4-P2_WARN: 1/Astro 5-4 - management request timed out.


2003 Sep 20 21:02:58 %SYS-4-P2_WARN: 1/Astro 5-5 - management request timed out.



2003 Sep 20 21:03:27 %SYS-3-MOD_FAILREASON:Module 5 failed due to diagnostics

2003 Sep 20 21:03:27 %SYS-3-MOD_FAIL:Module 5 failed to come online

Bobby Thekkekandam Sun, 09/21/2003 - 04:11
User Badges:
  • Cisco Employee,

I strongly suspect a faulty supervisor. Since you see these messages/POST failure dependant on the slot, that points to a possible SERDES or oscillator failure on the superivsor. I would recommend opening a TAC case for further troubleshooting and possible RMA.

mabadie Sun, 09/21/2003 - 05:12
User Badges:

i have found this docoument (Common CatOS Error Messages on Catalyst 4000 Series Switches):-


%SYS−4−P2_WARN: 1/Astro(mod/port)

Problem: The switch is generating %SYS−4−P2_WARN: 1/Astro(6/6).... messages.

Description: This error message indicates that the Supervisor has lost communication to a component on a

10/100 line card. The Supervisor keeps track of any time−outs associated with this communication. Since this

communication between the components and the Supervisor happens on the same path that normal data

travels, it is possible that a spanning tree loop or a broadcast storm of some sort is blocking this

communication, and causing the errors to display. Clear the counters on these particular ports and check to see

if there are any errors or any high traffic levels. If there are not, swap the line card to a different slot number,

and determine if the errors continue with the new slot number. If so, you may have a bad module. If you do

not see the error messages, move a good working module to the previous slot and see if the error messages

appears for the new module as well. Also, you may see messages appearing for more than one module or slot.

If so, you may have a bad supervisor module. Contact the TAC to troubleshoot this hardware further.

Bobby Thekkekandam Sun, 09/21/2003 - 06:09
User Badges:
  • Cisco Employee,

Yes, these management timeouts have several possible causes, including an STP loop, broadcast storm, or other data plane congestion. However in your case, The messages and module POST failures are specific to certain slots, which is more indicatve of a hardware problem than one that is software or traffic related.

Actions

This Discussion