NSX Intelligence: Redis data directory gets filled up and roll-up doesn’t work as expected
search cancel

NSX Intelligence: Redis data directory gets filled up and roll-up doesn’t work as expected

book

Article ID: 319092

calendar_today

Updated On: 06-05-2023

Products

VMware NSX

Issue/Introduction

Symptoms:
NSX Application Platform is degraded on NSX Intelligence 3.2.0 or 3.2.0.1 after upgrading from 1.2.x versions.
- Intelligence is showing Down
- Review pod data directory utilization using below command on Redis pods: redis-master-0, redis-slave-0, redis-slave-1
kubectl exec -it -n nsxi-platform redis-slave-0 –df -kh /data
or, from NSX Manager
napp-k exec -it -n nsxi-platform redis-slave-0 –df -kh /data

When this issue is present, the output indicates the data utilization to be near 100%.


Environment

VMware NSX-T Data Center

Cause

During an upgrade from NSX Intelligence 1.2.x, a script is used to persist data from druid to Redis. Once the upgrade is complete, latestflow service takes care of maintaining up to one million flows in Redis. When latestflow tries to evict data once it reaches one million flows, it fails with "java.lang.ClassCastException". The data in Redis keeps growing, and rewrite operations get stuck due to the large data size. 

Resolution

This issue is resolved in NSX Intelligence 3.2.1 and later releases.

Workaround:
Manually delete the files from redis-master pod and delete the redis-master-0 pod. The pod will start up automatically and the slave pod will be synced to master.
1. Delete the files - appendonly.aof and dump.rdb, in redis-master-0
  kubectl exec -it -n nsxi-platform redis-master-0 -- rm /data/appendonly.aof
  kubectl exec -it -n nsxi-platform redis-master-0 -- rm /data/dump.rdb
2. Delete the redis-master-0 pod itself
  kubectl delete pod -n nsxi-platform redis-master-0

OR, to delete the files through the NSX Manager.
1. Delete the files - appendonly.aof and dump.rdb, in redis-pod
  napp-k exec -it redis-master-0 -- rm /data/appendonly.aof
  napp-k exec -it redis-master-0 -- rm /data/dump.rdb
2. Delete the redis-master-0 pod itself
  napp-k delete pod -n nsxi-platform redis-master-0


Additional Information

Impact/Risks:
NSX Application Platform and applications will not function properly and no visualization will be shown for "NOW" period in the Intelligence UI.