search cancel

Incident highlighting is shifted or offset when the source file contains emojis or other multi-byte characters

book

Article ID: 257515

calendar_today

Updated On:

Products

Data Loss Prevention Enterprise Suite Data Loss Prevention Data Loss Prevention Enforce

Issue/Introduction

Incident highlighting is either offset (shift-left or shift-right by one or more characters) or just garbled. 
This happens when there are emojis or other multi-byte characters in the message body.
The policy is not triggering off the emoji or multi-byte characters.
The presence of emojis or other multi-byte characters in the file or email corrupts the incident display in the console.

This screenshot shows an EDM incident that was triggered by some SSNs.
Note that the incident highlighting is shifted to include non-triggering data and only part of the SSN.
All the data is test data, no actual PII data was used in this test.

Cause

The offset highlighting is caused by differences between the encoding of the multi-byte characters and the text in the source document.

Resolution

Due to the complexity of this issue, the fix will be included in a later release.

Attachments