Besides the occasional s/w bugs we are challenged with from time to time the biggest downtime we encounter are failed line cards especially ws-x6248-rj45 and ws-x6348-rj45 (not always full HW fail, sometimes soft reset or reinsertion will recover)
There are a multitude of alerts we can receive once a card goes into a failed state (reactive) but I was wondering if anyone has developed a means of proactively checking a line cards health. The aim being to have advance warning and perform a scheduled swap out thus mitigating the impact.
I was thinking along the lines of checking the SCP counters and monitoring the number of retries. An increasing amount of SCP retries is indicative of a pending problem. This check would need to be scripted etc. (sh scp module <#>)
Others may be measuring the asicreg counters but a lot of the time these are engineering commands that only the Cisco TAC can interpret.
I was wondering if anyone has BKM's we could apply to monitor a line cards health?
We are pleased to announce availability of Beta software for 16.6.3.
16.6.3 will be the second rebuild on the 16.6 release train targeted
towards Catalyst 9500/9400/9300/3850/3650 switching platforms. We are
looking for early feedback from customers befor...
Introduction Featured Speakers Luis Espejel is the Telecommunications
Manager of IENova, an Oil & Gas company. Currently he works with Cisco
IOS® and Cisco IOS XE platforms, and NX to some extent. He has also
worked as a Senior Engineer with the Routing P...
In this session you can learn more about Layer 3 multicast and the best
practices to identify possible threats and take security measures. It
provides an overview of basic multicast, the best security practices for
use of this technology, and recommendati...