You are not getting detection for text contained within PDFs created by MultiFunction Printers
search cancel

You are not getting detection for text contained within PDFs created by MultiFunction Printers

book

Article ID: 277772

calendar_today

Updated On:

Products

Data Loss Prevention Cloud Service for Email

Issue/Introduction

MultiFunction Printers, or MFPs, have a "Scan to email" function that creates PDFs from scanned content.

You have enabled OCR for DLP, and have confirmed that the scanning of these emails is otherwise functional, but DLP does not appear to be detecting data contained in the PDF files generated by these devices.

Environment

DLP with OCR enabled for either on-premises or Cloud Services

Cause

Many common imaging devices, including HP LaserJet MFPs, capture scanned pages as JBIG/JBIG2 images within PDF files - but this type of image is not supported for Content Extraction and OCR in DLP.

Resolution

A Feature Request has been created to track development of this functionality in a future release of DLP:

"PM-3176: Enable FR and OCR to support JBIG/JBIG2 images"