I have a stack of 7 3750s, all on the same IOS, 12.2.25 IP base. Today the entire stack stopped passing traffic and there was one switch cycling power and not booting up all the way when it got to detecting the stackwise portion of the POST tests. We could not console into any switch either.
We bypassed the suspect problem switch and then could ping and get into the stack via telnet. We saw the following entries once a minute in the log.
Mar 11 09:32:52.389 CDT: %PLATFORM_RPC-3-MSG_THROTTLED: RPC Msg Dropped by throt
tle mechanism: type 3, class 21, max_msg 8, total throttled 221
-Traceback= 8A4200 4B12F0 230BA0 230CE4 231740 231954 7599A8 753BEC
Mar 11 09:33:19.350 CDT: %PLATFORM_RPC-3-MSG_THROTTLED: RPC Msg Dropped by throt
tle mechanism: type 37, class 14, max_msg 32, total throttled 1032
-Traceback= 8A4200 4B12F0 5E1064 5E5168 5D2C38 7599A8 753BEC
I found a few bugs matching above, so am thinking of upgrading to current 12.2.50 IOS version. I was able to get the entire stack back up by unplugging power to all switches and bringing them all up fresh. Then the one suspect switch came back fine also. So it looks like something in the IOS Software with the stack management was not functioning properly anymore? Anyone seen this behavior before? We had power cycled 2 of the bottom switches in stack connected to problem switch to try to see if that helped clear things up, and the 'show ver' output still showed those switches were up for months after they came back online which also points to something gone astray with the stack part of the IOS software.