./kafka-consumer-groups.sh --bootstrap-server kafka:9092 --group intelligence-nsx-config-update --describe --command-config /root/adminclient.props
VMware NSX
The batch size at which the updates are being processed should be reduced and the time per batch needs be increased to cater to high churn using below steps:
kafka:
configUpdateConsumer:
maxPollRecords: 300 <---------- old value
maxPollIntervalMs: 300000 <----- old value
set maxPollRecords to 100 (suggested value) and set maxPollIntervalMs to 600000 ~ 10 mins(suggested value).
This would look like:
kafka:
configUpdateConsumer:
maxPollRecords: 100 <---------- new value
maxPollIntervalMs: 600000 <----- new value
Note: use pod name from above command.
kubectl delete pod nsx-config-XXXXX -n nsxi-platform
Workaround:
An immediate workaround for this issue can be to restart the nsx-config pod, as it would refresh the memory and cache of nsx-config pod and it would be able to process the next set of messages polled in stipulated time interval i.e. max.poll.interval.ms (default is 5 mins). However, this workaround is temporary and does not guarantee to resolve this issue. Steps are below:
# get nsx-config pod name
kubectl get pods -n nsxi-platform | grep nsx-config
output : nsx-config-XXXXX
# delete the nsx-config pod to restart
Note: Use pod name from above command.
kubectl delete pod nsx-config-XXXXX -n nsxi-platform