Breadths of a Data Identifier

book

Article ID: 159804

calendar_today

Updated On:

Products

Data Loss Prevention Network Monitor

Issue/Introduction

I am trying to create an SSN Data Identifier.  There is a brief description of the differences, but I would like more details.  How can I find out exactly what will match when I create the values of different breadths?

Resolution

The Breadths of the Data Identifiers are set so that the matches can be more or less refined based on the needs of the company deploying.  The Data Identifier can have matches for a specific pattern, for validators and for keywords.  The Data Identifier screen has a general overview of the differences.  On that screen, there is a blue more info link.  Clicking on that link will give exact details for the patterns, validators and keywords used.


For example, the SSN Wide Data Identifier uses patterns of:

\d{3.EN_US} \d{2.EN_US} \d{4.EN_US}
\d{3.EN_US}-\d{2.EN_US}-\d{4.EN_US}
\d{3.EN_US}.\d{2.EN_US}.\d{4.EN_US}
\d{3.EN_US}/\d{2.EN_US}/\d{4.EN_US}
\d{3.EN_US}\\\d{2.EN_US}\\\d{4.EN_US}
\d{9.EN_US}

Where \d represents a digit. and the number in brackets is the number of matching characters.

The Narrow Data Identifier has fewer patterns that match:

\d{3.EN_US} \d{2.EN_US} \d{4.EN_US}
\d{3.EN_US}-\d{2.EN_US}-\d{4.EN_US}
\d{9.EN_US}

However, it also matches on keywords: social security number, ssn, ss#

This information is available for all Data Identifiers.

Note: In v11, these patterns and validators can be changed by using the Custom Data Identifier feature.