Internal FAQ on GPText
Is there a script that can quickly collect all GPTEXT logs?
The latest gpmt release has included the ability to collect GPTEXT logs, see gpmt gp_log_collector
gpsupport in 7.x does not have the gptext option.
How much Solr instance(s) should I created per server?
See Determining the Number of Tanzu Greenplum Text Nodes to Deploy in the Best Practices Guide.
How can I find the configuration of GPTEXT cluster (Solr and Zookeeper)
The configuration is stored in file $MASTER_DATA_DIRECTORY/gptext.conf or $COORDINATOR_DATA_DIRECTORY/gptext.conf
Use command:
# gptext-state configs
How can I find the history of what operations have done to the indexes
SELECT * from gptext.admin_history;
How can I find the web URL of solr instance
### Option 1 ###
# SELECT distinct(base_url) from gptext.index_status();
### Option 2 ###
for i in `cat $MASTER_DATA_DIRECTORY/gptext.conf | grep -v "^id"`; do echo $i | awk -F"," '{print "http://"$2":"$3"/solr"}'; done
Where can I find all GPTEXT UDF(User defined functions) and the syntax?
# \df gptext.*
See VMware Tanzu Greenplum Text Function Reference for the detailed explanation of each function
How to add replica of the shard
# gptext-replica add -i [index_name] -s [shardID] Example: # gptext-replica add -i mydb.public.messages -s shard3
How to find the problematic index(in yellow or red) and the reason for it?
1. Use gptext-state to find out all problematic index (in yellow or red)
$ gptext-state -D
2. Run gptext-state --index=[Index_Name] to check the detail
$ gptext-state --index=gpadmin.public.comment
How many indexes can the user create on top?
How to check how much data has been loaded into the index?
See article How to check how much data has been loaded into the GPText index
Where can I find the definition and explanation of the options in managed-schema?
Refer to Apache Solr Reference Guide.