This document is created as a reference for the HCX Network Extension (NE) appliance unexpected memory consumption and how to recover that.
Symptoms:
The memory allocated to a given Network Extension (NE) appliance may be exhausted during runtime and below errors could be seen in the appliance log:
2023-02-21T12:11:23+00:00 HCX-NE-I1 GatewayLogs[1057]: [Warning-ops] : Memory usage is probably high (free: %4)
2023-02-22T12:25:04+00:00 HCX-NE-I1 kernel: ip: page allocation failure: order:4, mode:0x6000c0(GFP_KERNEL), nodemask=(null)
2023-02-22T12:25:04+00:00 HCX-NE-I1 kernel: ip cpuset=/ mems_allowed=0
Accessing NE appliance via
CCLI/SSH may or may not be serviced depending upon memory condition.
To verify current memory consumption for a given NE appliance:
Login to HCX Manager admin console >> ccli >> list >> go [NE_Appliance] >> ssh
root@HCX-NE-I1 [ ~ ]# cat /proc/meminfo
MemTotal: 3075532 kB
MemFree: 75913 kB
MemAvailable: 0 kB >>>>>>>
Note: If SSH is inaccessible via CCLI, then execute
"show system memory" directly from CCLI:
admin@hcx [ ~ ]$ ccli
Welcome to HCX Central CLI
[admin@hcx] list
[admin@hcx] go 0
Switched to node 0.
[admin@HCX-NE-I1] show system memory
MemTotal: 3075532 kB
MemFree: 75913 kB
MemAvailable: 0 kB >>>>>>>
Location of Appliance log:HCX ManagerĀ : /tmp/Fleet-Appliances/Service Mesh/NE-Appliance/var/log/messages
NE appliance : /var/log/messages