VMware NSX
Required Scoping Info :
Gather the following Scoping Details
Triage Checklist
Detailed Problem statement:
Issue Timeline and Scope
Problem Start Time: [Date and Time]
Problem End Time (if applicable): [Date and Time]
Scope of issue:
Scope: Is all VIP traffic hosted on the impacted Load Balancer affected? (Yes/No)
Specific VIPs: If only specific VIPs are impacted, please list the specific VIP names/IPs.
Accessibility: Are the affected VIPs completely inaccessible or is the inaccessibility intermittent?
User Impact: Are all users attempting to use the LB impacted, or is it isolated to only a specific subnet of users?
The following basic network tests must be performed to check for reachability, ping loss, or latency issues:
Execute Ping tests to the Virtual IP (VIP):
Execute Ping tests to each Pool Member IP address
Capture the exact error message observed when accessing the services directly and via the VIP:
Access the VIP and capture the complete error message displayed on the browser.
Access each pool member directly by IP address/hostname and capture the complete error message displayed on the browser.
Capture HAR results (HTTP Archive) while reproducing the issue by following the instructions in the referenced Broadcom KB: How to generate HAR file (KB 205795).
Enable verbose logging on the NSX Edge to capture detailed flow information during the issue timeframe.
Enable DEBUG logging on the Load Balancer page in the NSX-T UI.
Enable Access Logs on the Virtual Server page in the NSX-T UI.
Crucial Note: Gather logs from the Edge BEFORE disabling debug logging, as debug logs are immediately deleted upon disabling the setting.
Revert Logging: Ensure debug logging is reverted to the default level immediately after log capture is complete.
The following deep inspection data and logs are mandatory for complete datapath analysis:
Packet Capture (PCAP): Perform a packet capture on the LB service interface on the active Edge node. Use the exclusive packet capture method defined in the following Broadcom KB: Configuring Exclusive Packet Capture for Load Balancer (KB 345763).
Application TCP Dump: Capture a TCP dump on the application hosted pool member simultaneously during the time the error is reproduced.
Stage 5: Essential historical data from VROPs :
Incase if VROPs is configured on your infrastructure Please collect the required graphs outlined in the below KB:
Troubleshooting NSX Native Load Balancer Issues using VMware Aria Operations
Edge Log Bundle: Download the Edge log bundle with core file enabled.
Manager Log Bundle: Download the NSX Manager log bundle.