If you have been experiencing slowness, this is why...

Proxy · Aug 13, 2014

Apparently Level 3 along with a lot of other providers have been feeling the pain of the default ipv4 TCAM values.

Link 1: http://downdetector.com/status/level3/map/

Link 2: http://www.nux.ro/archive/2014/08/512k_routes_ought_to_be_enough_for_everyone.html

Link 3: http://www.ipv4depletion.com/?p=672

mwarps · Aug 13, 2014

Linode looks like it dropped Level3 entirely. Amusing.

MrGuvernment · Aug 13, 2014

Ya started yesterday around 4am it seems. A lot of Cisco gear hitting the 512k route defaults and just dropping things.

http://www.bgpmon.net/what-caused-todays-internet-hiccup/

What caused the instability

Folks quickly started to speculate that this might be related to a known default limitation in older Cisco routers. These routers have a default limit of 512K routing entries in their TCAM memory. The BGP routing table size depends on a number of variables. The primary contributors are Internal, or local routes (more specific and VPN routes used only in the local AS) and External routes. These External routes are all the routes of everyone on the Internet, we sometimes refer to this as the global routing table of Default Free Zone (DFZ). For most networks on the Internet today the global routing table is the major contributor, while local routes vary from a few dozen to a few hundred. Only a small percentage of the networks have a few thousand Internal routes.

The size of the global routing table today (August 12 2014) is about 500k, this number marginally varies per provider. Let’s compare a few major ISP’s. The table below shows the number of IPv4 prefixes received per provider on a full feed BGP session.

The table above shows that depending on the provider and location the numbers differ slightly, but are indeed very close. It also shows that right now the number of prefixes is still several thousands under the 512,000 limit so it shouldn’t be an issue. However when we take a closer look at our BGP telemetry we see that starting at 07:48 UTC about 15,000 new prefixes were introduced into the global routing table.

Ultima99 · Aug 13, 2014

We can't even keep such important routers upgraded to handle traffic. And we thought our physical infrastructure in this country was the main victim of neglect.

Romale23 · Aug 13, 2014

it doesn't even require an upgrade you can just change the setting

Ultima99 · Aug 13, 2014

Romale23 said:
it doesn't even require an upgrade you can just change the setting

Laziness triumphs again then.

dave99 · Aug 13, 2014

Romale23 said:
it doesn't even require an upgrade you can just change the setting

although that's just a quick fix for now, it will come up again as the number of routes continues to grow.

Romale23 · Aug 13, 2014

dave99 said:
although that's just a quick fix for now, it will come up again as the number of routes continues to grow.

ya but literally talking doubling the numbers it willl go up to 1,000 right now its 512

Ehren8879 · Aug 13, 2014

The TCAM recarve requires rebooting your router, at least Cisco. That's why providers have been reluctant. I know it's why I've been putting it off.

I still have a little room tbough

polonyc2 · Aug 13, 2014

I'm on FiOS and was wondering why my home connection has been so spotty the last 2 days...YouTube videos wouldn't load fully...general slowness all around...hopefully it'll get fixed in the next few days...

http://www.zdnet.com/internet-hiccups-today-youre-not-alone-heres-why-7000032566/

Karandras · Aug 14, 2014

Hey,

We are running into this as well with our 6500/sup720-3bxl. We attempted to do a change last night, adjust the TCAM to ipv4 - 800k or 600k. The router didn't take the change and it put itself into a reboot loop until we removed the offending config.

http://www.cisco.com/c/en/us/suppor...-switches/116132-problem-catalyst6500-00.html

That article shows moving all the tables to ipv4, would that have fixed this issue or would it still run into a reboot issue? This is the error:

Last reload reason: FIB Protocol Allocation mismatch

The above article shows that when you run "mls cef maximum-routes ip 800" it will auto-allocate the remaining memory to ipv6, why would it have an allocation mismatch then?

Any help is much appreciated as we don't want to run into this in the middle of the day since our tables are 96%, trying to get a fix for this sooner rather than later...

Ehren8879 · Aug 14, 2014

What if you try something like 640 or 768?

If you have been experiencing slowness, this is why...

Proxy

Pumpkin Ghost

mwarps

Supreme [H]ardness

MrGuvernment

Fully [H]

Ultima99

Supreme [H]ardness

Romale23

Gawd

Ultima99

Supreme [H]ardness

dave99

2[H]4U

Romale23

Gawd

Ehren8879

Supreme [H]ardness

polonyc2

Fully [H]

Karandras

[H]ard|Gawd

Ehren8879

Supreme [H]ardness