NSX intelligence Appliance /var/log partition 100% full, due to kafka-authorizer "Denied Operation" events
search cancel

NSX intelligence Appliance /var/log partition 100% full, due to kafka-authorizer "Denied Operation" events

book

Article ID: 324404

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

  • VMware NSX Intelligence appliance becomes unmanageable
  • ESXi hosts can no longer establish new connections to NSX Intelligence appliance
  • /var/log partition is 100% full
  • The size of the /var/log/kafka directory exceeds 28GB
  • Issue seen in NSX-T 3.1.0 and below. NSX Intelligence 1.2
  • /var/log/kafka-authorizer.log filled with events similar to below:
    <timestamps> UTC  INFO data-plane-kafka-request-handler-5 logger - Principal = User:UID=18163772-####-####-####-########c29,CN=VMware-NSX-Host,1.2.840.113549.1.9.1=#161b73736c2d636####36174657340766d776172#####,O=VMware\, Inc.,L=Palo Alto,ST=California,C=US is Denied Operation = Describe from host = <host> on resource = Topic:LITERAL:

Environment

VMware NSX-T Data Center 3.x
VMware NSX-T Data Center 2.x

Cause

  • Guest metric topic is not sent today as part of config msg. However, the kafka client was still creating guest metrics kafka topic object. This leads to the broker flooding the kafka-authorizer logs in Intelligence appliance.
  • The issue is seen in scale setup in a stable state after 26 days
  • Without the fix the kafka-authorizer continuously logs "Denied Operation" statements thereby flooding the logs.

Resolution

This issue is resolved in VMware NSX-T 3.1.1, available at Broadcom downloads.
If you are having difficulty finding and downloading software, please review the Download Broadcom products and software KB.

Workaround:

  • Clear up the NSX Intelligence /var/log folder with the following steps:
    • Navigate to the /var/log/kafka folder : 
      cd /var/log/kafka
    • Check the files size:
      ls -lh
    • Look for larger file (Approx. 790M). The file name starts with: kafka-authorizer.log
    • Delete the oldest files.
Sample file for deletion:
-rw-r--r-- 1 kafka kafka         0 May 15  2020 kafka-authorizer.log.2020-05-15
  • This is a short-term measure and may need to be repeated frequently depending on how quickly logs increase.