The purpose of this article is to describe how to fix an issue where GemFire takes forever to startup and the gfsh command "list regions" seems to hang.
This issue may be indicated by the following symptoms:
With "fine-level" logging enabled, you see an exception similar to the below:
[fine 2016/07/26 17:28:33.529 EDT server1.1.staging <Function Execution Processor1> tid=0x181] GemFire:service=Region,name=/TESTREGION,type=Member,member=server1.1.staging javax.management.InstanceNotFoundException: GemFire:service=Region,name=/TESTREGION,type=Member,member=server1.1.staging at com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.getMBean(DefaultMBeanServerInterceptor.java:1095)
/TESTREGION
in the example above), is not mentioned anywhere in the cache.xml and is not created programmatically.This problem occurs when:
In the above scenario, before the restart, as the cluster configuration service is enabled, the locator distributes a jar file to all the servers. This jar file contains an XML file that GemFire created, taking into account all the gfsh commands that were run previously. For example, if a region /TESTREGION
was created using gfsh previously, and now the servers are restarted with cache.xml which has no mention of TESTREGION, then the startup hangs.
Follow these steps to resolve this issue: