Identifying and Reducing Backlog in Wavefront & DX OpenExplore
search cancel

Identifying and Reducing Backlog in Wavefront & DX OpenExplore

book

Article ID: 384271

calendar_today

Updated On:

Products

Observability DX OpenExplore

Issue/Introduction

Ingestion Backlog can impact Alerting and Dashboard accuracy in your observability environment. This guide provides steps to identify the presence of a backlog and strategies to mitigate it.

Out-of-the-Box Dashboards and Alerts are provided to help monitor your proxy ingestion and alert your team when data is not arriving as expected. These can be used As-Is or cloned and modified to focused on ingestion from individual segments of your business.

Resolution

Identifying Backlog - Dashboards & Charts.

Tanzu Observability Service and Proxy Data Dashboard is an out-of-the-box Dashboard that provides visibility to Ingestion.  

Broken down by sections this dashboard is used every day by Customers and Support alike to identify ingestion patterns and diagnose issues.

If you find data that is no longer needed, you can reduce that ingestion by creating filters and/or preprocessor rules.

Learn how to monitor Wavefront proxies. See Monitor Wavefront Proxies

 

 Proxies Overview Section

This section allows you to see Proxy Backlog Sizes for Points, Histograms and Spans. 

Review the "Info" section on the left for definition on the Metrics used in this section. 

For additional information on these and others internal ~proxy. metrics see Article, Monitor Wavefront Proxies Section: Proxy Internal Metrics.

 

Other charts in this section provide details on causes of backlog for example Max Burst Rate and Queuing Reasons, ect. 

 

Proxy Troubleshooting Section 

Monitor CPU/Memory Resources, Network latency, view Preprocessor Rules information that can impact performance on your Proxies.   

Review the "Info" section on the left for definition on the Metrics used in this section.

 

Ingest Rate by Source

If your combined Points-Per-Second (PPS) ingestion rate is above your Collector PPS rate the additional PPS will be "push-back" to the proxy where it will be buffered until it can be resent

Reviewing the sources that are sending data to your proxy, will allow you to identify any that are sending unexpectedly high amounts.

 


Analyze for Unwanted Metrics:

Our Developers and Technical writers have published multiple articles to help customers identify unused metrics. Here are a few for your review.


Filtering and Blocking Ingestion at the Proxy (and Operator) Level.

Using YAML Files to block unwanted data through your Container Proxies - Kubernetes, Docker, Operator Deployment methodology.

Non-containerized proxies also use preprocessor rules to allow you to block metrics at the proxy level. 

Note: Its best practice to regularly review and refine filtering rules to prevent unwanted metrics from contributing to high PPS.

Additional Information

For questions or further assistance, please contact Broadcom Support.