This issue occurs due to slow disk performance, which adversely impacts the NSX controller cluster. The controller zookeeper process handles all I/O events in a single thread. If file write operations are consuming resources, controller keep-alive messages may be starved.