cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
879
Views
5
Helpful
5
Replies

ZX Link between WS-C2960L-48TS-LL and WS-C2960L-8TS-LL won't come up

Recently got two 2960s to replace in service late model 3750s directly connected in remote locations via direct fiber. Have not been able to get the link to stay UP - they flap and go err-disable immediately. If i use the same fiber and connect the 2960 to the remote end 3750 it's fine. Swapped to different new optics and still no dice. Both 2960 are running latest Version 15.2(6)E. Seems like an IOS bug and I got a support ticket working through. Never seen a problem like this, anyone come across this problem? 

1 Accepted Solution

Accepted Solutions

Thank you for this information Stephan,

 

Usually, error-disable conditions due to "link-flap" mean that either the local switch port or remote switch port bounced a number of times, in a very short period of time. As a protection mechanism, in order to prevent network instability (in terms of STP, for example), the switch sends the port into an err-disable state to prevent any further flapping.

 

The majority of the times this points to a L1 issue, so the troubleshooting methodology would happen as follows:

- Make sure the fiber is clean and if possible clean it/replace it.

- Make sure any media distributor/patch panel in between is clean and healthy too.

- Move the connection from the local site to a different port. See if it comes up.

- Move the connection from the remote site to a different port. See if it comes up.

- Use different transceivers (same model), in the local site and then in the remote site. 

 

Once the issue is isolated to a switch (which, looks like you have done all that shown above already) per the case notes), further troubleshooting can happen at that switch.

 

Going forward, the case notes show that the issue seems to be related to bug CSCvg04687. Although I am aware the switch was upgraded to 15.2(6)E, it looks like the bug is not fully fixed there. It looks like the current TAC engineer is already in talks with our internal teams for confirmation. I would let the current TAC engineer continue to work so they can let you know if there is some other OS with the fix.

 

As an interim workaround, you may want to change the err-disable link-flap timers to be less aggressive, at least until there is feedback from that TAC engineer.

 

I apologize about not being able to help further.

 

Hope this helps.

 

Eduardo.

View solution in original post

5 Replies 5

educruz
Cisco Employee
Cisco Employee

Good day,

 

When the link goes into an err-disable state, can you provide the output of "show interface status err-disable", please?

 

Also, it would be great to have the output of "show interface Gi x/y/z" of the respective involved ports.

 

Thank you,

 

Eduardo.

 

I've only got access to one site today which is listed below. The remote site is the same, as I've gone through all the steps to show output while screen sharing with support on the phone - SR 683713560
Port Name Status Reason Err-disabled Vlans
Gi0/51 LINK-TO-WESTGATE err-disabled link-flap
NOC-access1#sh int g0/51
GigabitEthernet0/51 is down, line protocol is down (err-disabled)
Hardware is Gigabit Ethernet, address is 701f.538b.b8b3 (bia 701f.538b.b8b3)
Description: LINK-TO-WESTGATE
MTU 1522 bytes, BW 1000000 Kbit/sec, DLY 10 usec,
reliability 255/255, txload 1/255, rxload 1/255
Encapsulation ARPA, loopback not set
Keepalive not set
Auto-duplex, Auto-speed, link type is auto, media type is 1000BaseZX SFP
input flow-control is off, output flow-control is unsupported
ARP type: ARPA, ARP Timeout 04:00:00
Last input 00:01:27, output 00:01:26, output hang never
Last clearing of "show interface" counters never
Input queue: 0/75/0/0 (size/max/drops/flushes); Total output drops: 0
Queueing strategy: fifo
Output queue: 0/40 (size/max)
5 minute input rate 0 bits/sec, 0 packets/sec
5 minute output rate 0 bits/sec, 0 packets/sec
593 packets input, 40714 bytes, 0 no buffer
Received 593 broadcasts (593 multicasts)
0 runts, 0 giants, 0 throttles
0 input errors, 0 CRC, 0 frame, 0 overrun, 0 ignored
0 watchdog, 593 multicast, 0 pause input
0 input packets with dribble condition detected
11031 packets output, 8058470 bytes, 0 underruns
0 output errors, 0 collisions, 8 interface resets
0 unknown protocol drops
0 babbles, 0 late collision, 0 deferred
0 lost carrier, 0 no carrier, 0 pause output
0 output buffer failures, 0 output buffers swapped out



Thank you for this information Stephan,

 

Usually, error-disable conditions due to "link-flap" mean that either the local switch port or remote switch port bounced a number of times, in a very short period of time. As a protection mechanism, in order to prevent network instability (in terms of STP, for example), the switch sends the port into an err-disable state to prevent any further flapping.

 

The majority of the times this points to a L1 issue, so the troubleshooting methodology would happen as follows:

- Make sure the fiber is clean and if possible clean it/replace it.

- Make sure any media distributor/patch panel in between is clean and healthy too.

- Move the connection from the local site to a different port. See if it comes up.

- Move the connection from the remote site to a different port. See if it comes up.

- Use different transceivers (same model), in the local site and then in the remote site. 

 

Once the issue is isolated to a switch (which, looks like you have done all that shown above already) per the case notes), further troubleshooting can happen at that switch.

 

Going forward, the case notes show that the issue seems to be related to bug CSCvg04687. Although I am aware the switch was upgraded to 15.2(6)E, it looks like the bug is not fully fixed there. It looks like the current TAC engineer is already in talks with our internal teams for confirmation. I would let the current TAC engineer continue to work so they can let you know if there is some other OS with the fix.

 

As an interim workaround, you may want to change the err-disable link-flap timers to be less aggressive, at least until there is feedback from that TAC engineer.

 

I apologize about not being able to help further.

 

Hope this helps.

 

Eduardo.

Post the complete output to the command "sh controll e Gi0/51". 

What is the end-to-end distance of the fibre? 

If the error-disable is caused by Layer 1 issue, I don't see this evident in the output of the "sh interface Gi0/51".

TAC pointed to a documented bug behavior: CSCvg04687. My solution was to replace on of the new 2960L units with a 11 year old 3750. Basically just subscribe to the bug tracker for CSCvg04687 and wait for a fix in 15.2(6)E2. ¯\_(ツ)_/¯

Review Cisco Networking products for a $25 gift card