Aria Operations for Networks streaming collector failure alarm
search cancel

Aria Operations for Networks streaming collector failure alarm

book

Article ID: 375789

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

Title: Alarm for infrastructure_service.vrni_streaming_failure
Event ID: infrastructure_service.vrni_streaming_failure
Alarm Description:

  • Purpose: To remind users that streaming to Aria Operations for Networks streaming collector failed.
  • Impact: Edge metrics could not be streamed to the configured Aria Operations for Networks streaming collector.

Environment

VMware NSX

Resolution

Steps to Resolve
For VCF 9.0 and higher

Recommended Action:

Check the status code in the alarm description.
Remediation based on status code:

  1. UNAUTHENTICATED:
    1. Use the following API to get the global collector(s)' setting and locate the one which 'collector_type' is VRNI_STREAMING.

      Example:

      GET https://{MP_IP}/policy/api/v1/infra/ops-global-config

      "operation_collectors": [
          {
              "username": "<user>",
              "password": "<password>",
              "root_certificate": "-----BEGIN CERTIFICATE-----<certificate contents>-----END CERTIFICATE-----",
              "collector_type": "VRNI_STREAMING",
              "collector_ip": "<IP>",
              "collector_port": 50000
          }
      ]
    2. Check if the "username" is configured correctly.
    3. Check if the "password" is configured correctly.
    4. Check if the "root_certificate" is configured correctly.
    5. If any of above fields are not configured correctly, using the following API to correct its value:

      Example:

      PATCH https://{MP_IP}/policy/api/v1/infra/ops-global-config

      "operation_collectors": [
          {
              "username": "<user>",
              "password": "<password>",
              "root_certificate": "-----BEGIN CERTIFICATE-----<certificate contents>-----END CERTIFICATE-----",
              "collector_type": "VRNI_STREAMING",
              "collector_ip": "<IP>",
              "collector_port": 50000
          }
      ]

       

  2. UNAVAILABLE or DEADLINE_EXCEED:
    1. If there is an extra firewall between Edge node and Aria Operations for Networks streaming collector. Check if there is policy denying the traffic from TN to the collector, since the traffic requires HTTPS(443). 
    2. Using the following API to get the global collector(s)' setting and locate the one which 'collector_type' is VRNI_STREAMING.

      Example:

      GET https://{MP_IP}/policy/api/v1/infra/ops-global-config

      "operation_collectors": [
          {
              "username": "<user>",
              "password": "<password>",
              "root_certificate": "-----BEGIN CERTIFICATE-----<certificate contents>-----END CERTIFICATE-----",
              "collector_type": "VRNI_STREAMING",
              "collector_ip": "<IP>",
              "collector_port": 50000
          }
      ]
    3. Check if the "collector_port" field in the API's response is same as {metrics_target_port} in the alarm description.
    4. Check if the "collector_ip" field in the API's response is same as {metrics_target_address} in the alarm description.
    5. If any of above fields are not configured correctly, using the following API to correct its value:

      Example:

      PATCH https://{MP_IP}/policy/api/v1/infra/ops-global-config

      "operation_collectors": [
          {
              "username": "<user>",
              "password": "<password>",
              "root_certificate": "-----BEGIN CERTIFICATE-----<certificate contents>-----END CERTIFICATE-----",
              "collector_type": "VRNI_STREAMING",
              "collector_ip": "<IP>",
              "collector_port": 50000
          }
      ]

Maintenance required for remediation? No