The customer experienced a cluster freeze for few minutes in regular intervals. Everything, including
SELECT pg_backend_pid was hanging. Upon checking the master log with
tail -f we could see that every time the system halts it is on.
2020-06-04 19:44:36.505576 WIB,,,p22629,th579106592,,,,0,,,seg-1,,,,,"DEBUG1","00000","SNMP inform/trap alerts are disabled",,,,,,,,"send_snmp_inform_or_trap","sendalert.c",231,
2020-06-04 19:46:50.729581 WIB,"prd_etl","CVMDM",p8822,th579106592,"10.24.162.17","50437",2020-06-04 17:03:55 WIB,0,con1146,cmd2,seg-1,,,,,"LOG","08P01","unexpected EOF on client connection",,,,,,,0,,"postgres.c",451,
As we can see in the example above, the system was processing queries and would suddenly freeze for over 2 minutes then resume.