Custom monitoring processes are not getting properly monitored on Aria Operations
search cancel

Custom monitoring processes are not getting properly monitored on Aria Operations

book

Article ID: 407670

calendar_today

Updated On:

Products

VCF Operations/Automation (formerly VMware Aria Suite)

Issue/Introduction

Monitoring a Linux process in Aria Operations with Telegraf Agent fails to update the 'AVAILABILITY|Resource Availability' metric.

This has been observed with RedHat Enterprise Linux (RHEL) 10 in the following scenario:

  • Process monitoring has been configured with 'Executable Name' filter type as per Monitor Linux Processes
  • A simple filter value such as 'sshd' is used with filter type 'Executable Name'

Environment

Aria Operations: 8.18.x

Cause

A too simple string such as 'sshd' is used with the 'Executable Name' filter type. 

In the case of RHEL 10. When an SSH session is opened against the RHEL 10 system, one or more processes with name starting with 'sshd-session: <username>...' is started. In lab environment we found that this process remained active even when the SSHD service was stopped on the RHEL 10 server. The 'Executable Name' filter will look for any process containing the string 'sshd', and if other processes contains this string, the 'AVAILABILITY|Resource Availability' metric will not change, due to additional processes containing the string 'sshd'.

It was observed that the 'Tags:Processes|process_name' property on the process monitoring object changed from sshd to sshd-session. This is expected, as the 'Executable Name' filter type looks for any processes containing the string 'sshd', and returns a running process.

Resolution

Instead of using 'Executable Name' filter type in cases where you may have multiple processes containing the search string, use 'PID File' or 'Regex Pattern' filter types.

 

The PID (Process ID) file is normally found under /var/run/<process_name>.pid for system processes. Consult documentation for your application or operating system if process/application/service use PID file.

As an example for SSHD service on RHEL 10, you can use the following filter value for PID File:

/var/run/sshd.pid

 

Regex pattern is another alternative. Please consult with Regex documentation for more advanced regex searches.

As an example for SSHD service on RHEL 10, you can use the following filter value for Regex Pattern:

/usr/sbin/sshd*