Setting up a 10 node HSFS cluster, and it appears only 3 nodes participate (1 data, and 2 workers). The rest seem to be idle while a scan is in progress. All workers are operating correctly and participate if done individually.
HSFS Cluster workload is divided by folders not number of files or data size.
Balancing files by folders resulted in higher speed scanning.