If the email address data identifier is used as required field in a EMDI profile and the email IDs in the file used to create the EMDI profile do not fully match the regex pattern in the Email Address Data Identifier, the profile creation fails. Enforce localhost logs shows below error:
SEVERE [com.vontu.profiles.manager.database.DataSourceIndexCreator] Indexing error occurred while creating exact data profile "Test EMDI_Customer1". Too many invalid patterns
DLP 16.0 +
The email addresses in the CSV file include the domain abc.com.my. The current regex pattern in the email address data identifier only matches up to abc.com and does not match .my part which results in failure of indexing of the EMDI profile.
As a workaround if email addresses in the file to be indexed contain domains like abc.com.my or any other top-level domain, a new DI for email addresses can be created using the following regex pattern.
[-!#_$%&/''*+?^{}~a-zA-Z0-9]+(?:\.[-!#$%&' '*+?^_`{}~a-zA-Z0-9=]+)*@[-a-zA-Z0-9]+(\.[a-zA- Z]{2,11})+
Note: It is recommended to create a new DI using the above regex pattern instead of modifying the existing one.