DLP Fails to Detect Data Hidden in XML Files within Office Documents
search cancel

DLP Fails to Detect Data Hidden in XML Files within Office Documents

book

Article ID: 397139

calendar_today

Updated On:

Products

Data Loss Prevention Network Monitor and Prevent for Email

Issue/Introduction

DLP does not detect sensitive data embedded in XML files, such as item1.xml, located in the customXML directory of Microsoft Office files (e.g., .docx, .xlsx, .pptx). This issue occurs even when keyword-based detection rules are configured, resulting in no incidents being generated for hidden data.

Cause

DLP is designed to scan the primary content and metadata of Office files, but does not include the capability to inspect embedded XML files within the customXML directory. These XML files, can contain sensitive information and are accessible when Office files are extracted (e.g., by renaming a .docx file to .zip and navigating to the customXML folder).

Resolution

A Feature Request has been created to enhance Symantec DLP’s detection capabilities.