NSX Edge detects pool members are DOWN due to Layer 4 connection problem
book
Article ID: 303197
calendar_today
Updated On:
Products
VMware NSX
Issue/Introduction
Symptoms:
- NSX Edge detects pool members are DOWN due to Layer 4 connection problem.
- In the NSX Edge log file, you see entries similar to:
2017-02-23T10:19:05+00:00 NSX-Edge-0 loadbalancer[15780]: [htvs]: [local0.alert] Server LOADBALANCER/POOLMEMBER-01 is DOWN, reason: Layer4 connection problem, info: "General socket error (No buffer space available) at initial connection step of tcp-check", check duration: 0ms. 7 active and 0 backup servers left. 617 sessions active, 0 requeued, 0 remaining in queue.
2017-02-23T10:19:06+00:00 NSX-Edge-0 loadbalancer[15780]: [htvs]: [local0.alert] Server LOADBALANCER/POOLMEMBER-02 is DOWN, reason: Layer4 connection problem, info: "General socket error (No buffer space available) at initial connection step of tcp-check", check duration: 0ms. 6 active and 0 backup servers left. 603 sessions active, 0 requeued, 0 remaining in queue.
2017-02-23T10:19:25+00:00 NSX-Edge-0 kernel[]: [htvs]: [kern.info] TCP: out of memory -- consider tuning tcp_mem
2017-02-23T10:19:27+00:00 NSX-Edge-0 loadbalancer[15780]: [htvs]: [local0.emerg] backend LOADBALANCER has no server available!
2017-02-23T10:19:40+00:00 NSX-Edge-0 kernel[]: [htvs]: [kern.info] TCP: out of memory -- consider tuning tcp_mem
2017-02-23T10:19:41+00:00 NSX-Edge-0 loadbalancer[15780]: [htvs]: [local0.notice] Server LOADBALANCER/POOLMEMBER-01 is UP, reason: Layer7 check passed, code: 0, info: "(tcp-check)", check duration: 0ms. 1 active and 0 backup servers online. 0 sessions requeued, 0 total in queue.
2017-02-23T10:19:55+00:00 NSX-Edge-0 loadbalancer[15780]: [htvs]: [local0.notice] Server LOADBALANCER/POOLMEMBER-02 is UP, reason: Layer7 check passed, code: 0, info: "(tcp-check)", check duration: 0ms. 2 active and 0 backup servers online. 0 sessions requeued, 0 total in queue.
Note: The preceding log excerpts are only examples. Date, time, and environmental variables may vary depending on your environment.
Cause
This issue occurs due to out of memory "net.ipv4.tcp_mem" kernel parameter.
Resolution
To work around this issue:
- In NSX for vSphere 6.2.7 and later versions:
Increase the "net.ipv4.tcp_mem" kernel parameter using PUT REST API for edge sysctl parameters.
Body:
<systemControl>
<property>sysctl.net.ipv4.tcp_mem=65536 63355 62333</property>
</systemControl>
Feedback
thumb_up
Yes
thumb_down
No