We have a failover pair of loadbalancers (non-Cisco) which are connected to each other via Catalyst 6509Es with Sup720 supervisor cards. Failover is achieved by the newly active loadbalancer GARPing all its service IP addresses with the relevant MAC address in order to update nearby ARP tables (failover GARPs are fired out by the loadbalancers at a rate of 200 per second). Failing over services between these loadbalancers has been found to be problematic, with numerous services not failing over in a timely manner.
Some of the loadbalanced networks involved are routed on the Sup720s, the rest are routed on FWSM modules in the same chassis. Problems occur only with VLANs routed on the Sup720s; all VLANs routed on the FWSMs failover without issue.
Investigation has shown that this is due to a proportion of the ARP table entries not being updated in the Sup720 and (with the default 4 hour ARP table timeout) subsequently requiring a manual flush of the "wrong" ARP entries.
Testing by throwing GARPs at both the FWSM and the Sup720 has revealed the following:
we can quite happily throw ~200 GARPs per second at the FWSM and all the relevant ARP table entries are updated with the correct MAC address. This fits with the sucessful failovers for any FWSM routed networks.
however, repeating the same test with a VLAN routed on the Sup720 results in GARPs being dropped by the Sup720. The Sup720 keeps discarding a proportion of GARPs until we drop the rate of GARP to <~75 per second.
We're not rate-limiting ARP anywhere in the Sup720 - hardware or otherwise - and the FWSM handles the rate of GARP without issue. Is there any built-in restriction on the Sup720 we're not aware of that would cause the observed behaviour, is it configurable, or can anyone suggest what else could be causing the observed behaviour?
Question We run asr9001 with XR 6.1.3, and we have a very long delay to
login w/ SSH 1 or 2 to the device compare to IOS device. After
investigation, the there is 1s delay between the client KEXDH_INIT and
the server (XR) KEXDH_REPLY. After debug ssh serv...
Introduction The purpose of this document is to demonstrate the Open
Shortest Path First (OSPF) behavior when the V-bit (Virtual-link bit) is
present in a non-backbone area. The V-bit is signaled in Type-1 LSA only
if the router is the endpoint of one or ...
Hi, I am seeing quite a few issues with patch install and wanted to
share my experience and workaround to this. Login to admin via CLI, then
access root with the “shell” command Issue “df –h” and you’ll probably
see the following directory full or nearly ...