HTML and XML files may be deleted by the DLP Endpoint Agent during detection
search cancel

HTML and XML files may be deleted by the DLP Endpoint Agent during detection

book

Article ID: 242100

calendar_today

Updated On:

Products

Data Loss Prevention Endpoint Prevent

Issue/Introduction

HTML and XML files encoded as UTF-16 LE may be deleted by the DLP Endpoint Agent during detection.

Environment

Affects the following configurations:

  • Windows OS
  • DLP Endpoint Agent 15.8
  • HTML or XML file types, encoded as UTF-16 LE.
    • This is uncommon; these files are typically encoded as UTF-8
  • Advanced Agent Setting - Detection.MARKUP_AS_TEXT - set to ON (Default is OFF)

Cause

The issue is caused by an optimization on Windows for handling UTF-16 LE (Little Endian) encoded files during content extraction.

Resolution

Workaround

Set the Detection.MARKUP_AS_TEXT Advanced Agent Setting back to the default value of OFF.

Fix

A fix for this issue is planned for the 15.8 MP3 release of the DLP Endpoint Agent.