[gpadmin@mdw ~]$ ls -lrth $MASTER_DATA_DIRECTORY/pg_log | tail -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:52 gpdb-2021-01-12_142811.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:52 gpdb-2021-01-12_142856.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:52 gpdb-2021-01-12_142943.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:52 gpdb-2021-01-12_143032.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:52 gpdb-2021-01-12_143121.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:53 gpdb-2021-01-12_143212.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:53 gpdb-2021-01-12_143317.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:53 gpdb-2021-01-12_143432.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:53 gpdb-2021-01-12_143545.csv -rw-------. 1 gpadmin gpadmin 1.1G Jan 12 18:53 gpdb-2021-01-12_143701.csv [gpadmin@mdw ~]$
[gpadmin@mdw pg_log]$ for I in *.csv ; do echo $I; grep ',con' $I | cut -d ',' -f17 | sort | uniq -c | sort -nr; grep ',con' $I | cut -d ',' -f17,18,19 | sort | uniq -c | sort -nr | head; done
gpdb-2021-01-12_080045.csv
13847 "LOG"
5 "WARNING"
4 "ERROR"
7527 "LOG","00000","HashJoin: Too many batches computed: nbatch=67108864. gp_workfile_limit_files_per_query=100000
1803 "LOG","00000","HashJoin: Too many batches computed: nbatch=2097152. gp_workfile_limit_files_per_query=100000
1012 "LOG","00000","HashJoin: Too many batches computed: nbatch=524288. gp_workfile_limit_files_per_query=100000
727 "LOG","00000","HashJoin: Too many batches computed: nbatch=262144. gp_workfile_limit_files_per_query=100000
579 "LOG","00000","HashJoin: Too many batches computed: nbatch=4194304. gp_workfile_limit_files_per_query=100000
397 "LOG","00000","HashJoin: Too many batches computed: nbatch=65536. gp_workfile_limit_files_per_query=100000
316 "LOG","00000","HashJoin: Too many batches computed: nbatch=1048576. gp_workfile_limit_files_per_query=100000
258 "LOG","00000","HashJoin: Too many batches computed: nbatch=8388608. gp_workfile_limit_files_per_query=100000
251 "LOG","00000","HashJoin: Too many batches computed: nbatch=33554432. gp_workfile_limit_files_per_query=100000
230 "LOG","00000","HashJoin: Too many batches computed: nbatch=131072. gp_workfile_limit_files_per_query=100000
gpdb-2021-01-12_080322.csv
14452 "LOG"
9207 "LOG","00000","HashJoin: Too many batches computed: nbatch=67108864. gp_workfile_limit_files_per_query=100000
1009 "LOG","00000","HashJoin: Too many batches computed: nbatch=4194304. gp_workfile_limit_files_per_query=100000
905 "LOG","00000","HashJoin: Too many batches computed: nbatch=2097152. gp_workfile_limit_files_per_query=100000
805 "LOG","00000","HashJoin: Too many batches computed: nbatch=262144. gp_workfile_limit_files_per_query=100000
729 "LOG","00000","HashJoin: Too many batches computed: nbatch=524288. gp_workfile_limit_files_per_query=100000
389 "LOG","00000","HashJoin: Too many batches computed: nbatch=1048576. gp_workfile_limit_files_per_query=100000
363 "LOG","00000","HashJoin: Too many batches computed: nbatch=33554432. gp_workfile_limit_files_per_query=100000
267 "LOG","00000","HashJoin: Too many batches computed: nbatch=65536. gp_workfile_limit_files_per_query=100000
211 "LOG","00000","HashJoin: Too many batches computed: nbatch=131072. gp_workfile_limit_files_per_query=100000
155 "LOG","00000","HashJoin: Too many batches computed: nbatch=8388608. gp_workfile_limit_files_per_query=100000
The logging issue is fixed in GPDB v6.10.0. To resolve this issue, we recommended upgrading to the latest GPDB v6 release.
For more information on upgrading to the latest GPDB 6 release as of January, 2022, refer to VMware Tanzu Greenplum 6.x Release Note.
GPDB v6.10.0 code fix summary
The `ExecChooseHashTableSize()` code function is called by the query planner, when calculating cost for join paths.
The `ExecChooseHashTableSize()` code function logged its calculations to the GPDB master logs as "LOG","00000","HashJoin: Too many batches computed" entries.
This had the potential to greatly increase the size of the GPDB master logs.
The fix was to make these entries only visible in GPDB logs when 'DEBUG1' logging is set.
Note: Default logging will no longer capture the log entries.