HTML and XML files may be deleted by the DLP Endpoint Agent during detection

book

Article ID: 242100

calendar_today

Updated On:

Products

Data Loss Prevention Endpoint Prevent

Issue/Introduction

HTML and XML files encoded as UTF-16 LE may be deleted by the DLP Endpoint Agent during detection.

Cause

The issue is caused by an optimization on Windows for handling UTF-16 LE (Little Endian) encoded files during content extraction.

Environment

Affects the following configurations:

  • Windows OS
  • DLP Endpoint Agent 15.8
  • HTML or XML file types, encoded as UTF-16 LE.
    • This is uncommon; these files are typically encoded as UTF-8
  • Advanced Agent Setting - Detection.MARKUP_AS_TEXT - set to ON (Default is OFF)

Resolution

Workaround

Set the Detection.MARKUP_AS_TEXT Advanced Agent Setting back to the default value of OFF.

Fix

A fix for this issue is planned for the 15.8 MP3 release of the DLP Endpoint Agent.