How are IDM detections affected by ContentExtraxtion.TrackedChanges setting.
search cancel

How are IDM detections affected by ContentExtraxtion.TrackedChanges setting.

book

Article ID: 382639

calendar_today

Updated On:

Products

Data Loss Prevention

Issue/Introduction

Environment

DLP 15.8 - 16.1

Resolution

Microsoft Word supports a feature called Track Changes: Track changes in Word - Microsoft Support

Normally DLP doesn't extract the tracked changes, unless the setting ContentExtraction.TrackedChanges is enabled, which enables DLP to extract tracked changes within Microsoft Word files, if there are any.

As a simple example, suppose you index Word file "DocumentA", with a 50% match threshold for IDM. Now, by default, with TrackedChanges disabled, if someone emails a document that matches 50% of the indexed "DocumentA", there will be a match.

Suppose that someone sends a document where 50% of the live document content would match indexed file "DocumentA", but it also contains tracked changes. If you enable ContentExtraction.TrackedChanges, now the total extracted document text size goes up, which can cause the match percentage of the file being scanned to be lower (in this example lower than 50%) so there will no longer be a match at the 50% threshold.