This article contains recommendations and guidelines for configuring Discover Servers to scan Microsoft SharePoint repositories efficiently.
Symantec recommends the following settings for each Discover server.
crawler.threadpoolsize = 30 (default value - found in crawler.properties file)
where crawler.threadpoolsize represents the number maximum number of crawler threads.
Note: Use the recommended value only if your setup conforms to the recommended hardware configuration in the table below.
MessageChain.NumChains = 1 * No. of CPU cores (if the cores are hyper-threaded, then 2*no. of cores)
where MessageChains.NumChain represents the number of messages in parallel that the FileReader will process.
MessageChain.CacheSize = 2 * MessageChain.NumChains
where MessageChain.CacheSize represents the size of the Detection (MessageChain) queue.
FileReader.MaxFileSystemCrawlerMemory = (crawler.threadpoolsize + MessageChain.NumChains + MessageChain.CacheSize) * FileReader.MaxFileSize
where FileReader.MaxFileSystemCrawlerMemory represents the total run-time memory for all running threads.
BoxMonitor.FileReaderMemory = 4 * FileReader.MaxFileSystemCrawlerMemory
where BoxMonitor.FileReaderMemory represents a dynamic memory pool holding all run-time data about the FileReader. This value should be less than the assigned system memory.
You can use the attached spreadsheet to calculate the recommended values for these parameters.
Note: The grid scanning feature for Microsoft SharePoint Server target is available in Symantec Data Loss Prevention from version 15.1 onwards.
Symantec recommends the following guidelines for configuring SharePoint scan targets:
Scan mode guidelines:
Be aware that:
Parameters | Recommended Configuration (Single Server scan) |
Recommended Configuration (Grid scan mode) |
---|---|---|
Number of CP cores | 16 | 16 |
RAM (GB) | 32 | 32 |
FileReaderMemory (GB) | 16 | 16 |
FileReader.MaxFileSystemCrawlerMemory (MB) | 2048 | 2048 |
CrawlerThread |
30 | 30 |
MessageChain.NumChains | 32 | 32 |
MessageChain.CacheSize | 64 | 64 |
crawler.grid.follower.queuesize | NA | 60 |
crawler.grid.queuesize.multiplier | NA | 120 |
For more information, refer to the grid scanning performance guidelines in the Symantec Data Loss Prevention 15.1 Administration Guide.