When saving Word documents that contain size 1 text to pdf file, extra spaces are added causing DLP to possibly not match
search cancel

When saving Word documents that contain size 1 text to pdf file, extra spaces are added causing DLP to possibly not match

book

Article ID: 403516

calendar_today

Updated On:

Products

Data Loss Prevention Network Discover Data Loss Prevention Data Loss Prevention Sensitive Image Recognition Data Loss Prevention Plus Suite Data Loss Prevention Network Protect Data Loss Prevention Network Prevent for Web Virtual Appliance Data Loss Prevention Network Prevent for Email Virtual Appliance Data Loss Prevention Network Prevent for Email Data Loss Prevention Network Monitor and Prevent for Email and Web Data Loss Prevention Network Monitor and Prevent for Web Data Loss Prevention Network Monitor and Prevent for Email Data Loss Prevention Network Monitor Data Loss Prevention Form Recognition Data Loss Prevention for Mobile Data Loss Prevention Enterprise Suite Data Loss Prevention Enforce Data Loss Prevention Endpoint Prevent Data Loss Prevention Endpoint Discover Data Loss Prevention Discover Suite Data Loss Prevention Data Access Governance Data Loss Prevention Core Package Data Loss Prevention Cloud Storage Data Loss Prevention Cloud Service for Email Data Loss Prevention Cloud Service for Discovery/Connector Data Loss Prevention Cloud Prevent for Microsoft Office 365 Data Loss Prevention Cloud Package Data Loss Prevention Cloud Detection Service for REST Data Loss Prevention Cloud Detection Service for ICAP Data Loss Prevention Cloud Detection Service for Endpoint Data Loss Prevention API Detection Data Loss Prevention API Detection for Developer Apps Virtual Appliance Data Loss Prevention Cloud Detection Service Data Loss Prevention Oracle Standard Edition 2

Issue/Introduction

When saving Word documents that contain size 1 text to pdf file format space characters are added to the size 1 text by the saving process. This may cause DLP to not match text based conditions as DLP is looking for exact matches and will "see" the text and space characters exactly as they are.

The screenshot below shows the saved pdf file with size 1 font at 1,000% zoom. You can see the spaces that were introduced to the text. The original text in the Word document was "bananamonkey" with no spaces.

 

Environment

Any DLP version

Cause

Saving the Microsoft Word document to a pdf file format causes extra spaces to be added to size 1 text. As such, DLP policies may not match on the text that has extra spaces added to it.

Resolution

As the issue is caused by a third-party product and DLP is working by design by detecting the text as it appears, there is not a specific resolution to this issue. However, one possible solution to explore would be to add a regex condition to the policy that could account for the introduced space characters.