Purpose: TGW route capacity limit is reached which results in failure.
Impact: New route programing will fail which will have network connectivity issue on new networks.
Environment
VMware NSX-T Data Center
Cause
When number of routes in the route table reaches max limit, no new routes can be added.
Resolution
Steps to resolve
For 4.1.2.0 and higher
Recommended Actions:
Login to NSX manager UI in 'Networking & Security' tab. Then navigate to 'transit connect' tab.
Check if 'learned routes' page contains a route with 'Failure' status. Check if the failure is due to reaching route table limits.
Login to ESX host and run 'vmc-cli -s'. Note down the onprem table id and egress route table id.
If failure is due to route table limits, then do the following:
Check the number of route failures due to route table limits in 'learned routes' tab. This is the minimum number to which the route table limits should be increased. Lets call this threshold.
Create AWS support request to increase the VPC route table limit to a minimum of threshold value. The VPC route table can only be increased upto 1000 entries(AWS hard limit). Recommendation is to increase the limits to more than the threshold.
Once the AWS route table limits are increased, check 'learned routes' tab to find out if all the failures related to route table limits are eliminated.