Network connectivity issues when accessing a container on from Tanzu worker
book
Article ID: 317767
calendar_today
Updated On:
Products
VMware NSXVMware Container Networking with Antrea
Issue/Introduction
NSX-T release earlier than 3.2.1.
Container cluster with Tanzu.
Container Network Interface (CNI) is Antrea.
Significant latency is introduced on specific TCP traffic sent by the worker node, this may cause timeout of TCP transaction.
There will be a high number of TCP retransmissions in the impacted TCP stream.
ESXi host's vmkernel.log logging may show logs similar to below:
2022-01-18T14:29:15.352Z cpu49:104388484)NSX_EncapDoInnerOffload:391:[nsx@6876 comp="nsx-esx" errorCode="ESX34"]Failed to checksum segment: Read only
Environment
VMware NSX-T Data Center
Cause
This issue occurs due to a blocked checksum calculation for the TCP segment, due to a read only packet buffer descriptor, which is preventing checksum offload for the inner packet.
Resolution
This issue is resolved in NSX-T Data Center 3.2.1.
Workaround:
If Antrea is to be used, you can disable Tx segmentation on the worker:
ethtool -K eth0 tx-udp_tnl-segmentation off && ethtool -K eth0 tx-udp_tnl-csum-segmentation off
If Antrea is not required, you can use Calico as the CNI.
Additional Information
Impact/Risks: Latency introduced on TCP traffic, or TCP stream may time out.