/common/logs/admin/app.log:<timestamps> UTC [NetworkStretchService_SvcThread-154, j: ########, s: ########, , TxId: ########-####-####-####-############] ERROR c.v.v.h.n.i.AbstractJobInt- InterconnectServiceJobs workflow InterconnectServiceConfigJob failed. Error: Interconnect Service Workflow GenerateAndPostConfig failed. Error: Operation timedout in state POST_CONFIG_VIX
<timestamps> UTC [NetworkStretchService_SvcThread-154, j: ########, s: ########, , TxId: ########-####-####-####-############] ERROR c.v.v.h.n.i.UnstretchNetworkJobInt- Error encountered in Unstretch network job
java.lang.RuntimeException: Interconnect Service Workflow GenerateAndPostConfig failed. Error: Operation timedout in state POST_CONFIG_VIX
HCX Manager UI, under Interconnect -> Service Mesh, when viewing appliances and clicking the "i - info" icon, you see the alarm:System state is criticalConfig engine is in systemdBad stateMemory usage is highadmin user.cclilistgo # (where # is the NE appliance ID)show system memory' to check memory.[admin@HCX-NE-R#] show system memory
MemTotal: 3075532 kB
MemFree: 75913 kB
MemAvailable: 15120 kB >>>>>>>
sshtop'Shift + M' >> To check top memory used process.

/var/log/messages.<timestamp> <Fleet-Appliance> cgw 1098 - - [Info-Tasker] : Timeout vmware-toolbox-cmd stat balloon
<timestamp> <Fleet-Appliance> cgw 1098 - - [Err-Tasker] : cmd (/usr/bin/vmware-toolbox-cmd stat balloon) done, error: Timeout
<timestamp> <Fleet-Appliance> cgw 1098 - - [Err-ops] : getBalloonStat() failed, /usr/bin/vmware-toolbox-cmd stat balloon: Timeout
<timestamp> <Fleet-Appliance> cgw 1098 - - [Warning-ops] : Memory usage is probably high (free: %3)
<timestamp> <Fleet-Appliance> cgw 1098 - - [Info-opsEvent] : new system event: SystemEvent[<timestamp>, <timestamp>, 60002, critical, Memory usage is high, map[balloon:0 MB cache:32772096 free:102031360 total:3149344768 used:3047313408]]
VMware HCX
A memory leak affecting the ndd process has been found on the NE appliance.
This causes high memory usage, and the NE appliance is unable to allocate resources, causing tasks to fail.
This issue is resolved in VMware HCX 4.11.1, available at Broadcom downloads.
If you are having difficulty finding and downloading software, please review the Download Broadcom products and software KB.
Workaround:
Config engine is in systemdBad state:
Memory usage is high AND not showing the Config engine is in systemBad state, proceed with the following workaround:
admin user.cclilistgo # (where # is the NE appliance ID)sshsystemctl stop nddsystemctl disable nddNote: After disabling the ndd service on the NE Appliance VM, there will be no impact on the system from a traffic forwarding and stability perspective. However, the Transport Analytics feature will be non-functional for those NE Appliances. On-demand bandwidth testing can be used as an alternative to the Transport Analytics feature instead.
Note: If you are running HCX 4.11.0 or below, we recommend proactively implementing Workaround 2 to prevent this issue in the future until we release a patch.
This needs to be done on both the HCX NE-I (source/Initiator) and NE-R (target/receiver) appliances.
VMware HCX 4.11.1 Release Notes, see:
Fixed Issue 3528977: Long running Network Detection Daemon (ndd) process can cause the system to run out of memory on Network Extension (NE) and Interconnect (IX) appliances.