Absolute paths in the include/exclude filters in the SharePoint Crawler do not work
search cancel

Absolute paths in the include/exclude filters in the SharePoint Crawler do not work

book

Article ID: 159915

calendar_today

Updated On:

Products

Data Loss Prevention Network Discover Data Loss Prevention

Issue/Introduction

For the SharePoint Crawler, you cannot use absolute paths.

Resolution

In the console, go to Manage > Discover Scanning > Discover Targets > Add SharePoint Target > Filters tab.

You can provide filters using regular expressions, or paths relative to the location of the SharePoint site.
Filters can include a site collection, site, sub site, folder, file name, or file extension.
Filters cannot include the protocol or hostname.
Path filters are not applied on attachments of an item, such as a .doc attachment to a list item.

For example: 
To exclude the site https://app-1234.foo.com/sites/travel/XYZ, the exclude filter *XYZ works.
But https://app* and https://app-1234.foo.com/sites/travel/XYZ* do not work.

The information above is extracted from the following TechDoc:

Using include and exclude filters