EML Import of Arabic mails fail Document Classification
book
Article ID: 107475
calendar_today
Updated On:
Products
CA Data Protection (DataMinder)
Issue/Introduction
When importing eml (internet email events) which contain Arabic text into Data Protection (DataMinder) r14.5 fail analysis against Document Classification policy triggers.
For example, when triggering on the following sentences;
نزل السعر لايبور نزل سعر ايبور نزل سعر كايبور
by configuring a Document Classifier with Parameter 8 text:
Some tightening of the Data Protection code was required to handle the encoding character set when the email body format was in HTML and to control the import failure of malformed eml files.
Resolution
FIX:RO93609 (incorporating Server_x64_14.5_HF0220 and Server_14.5_HF0219) has been released to address this issue.
After applying this fix, during import, malformed eml format files are moved to the 'FAILED' folder with a proper log message. In addition, conversion of eml text to Unicode was fixed to enable policy to match with the text.