GPTEXT frequently asked questions (FAQ)
search cancel

GPTEXT frequently asked questions (FAQ)

book

Article ID: 296245

calendar_today

Updated On:

Products

VMware Tanzu Greenplum

Issue/Introduction

Internal FAQ on GPText

Resolution

Is there a script that can quickly collect all GPTEXT logs?

The latest gpmt release has included the ability to collect GPTEXT logs, see gpmt gp_log_collector 
gpsupport in 7.x does not have the gptext option.

 

How much Solr instance(s) should I created per server?

See Determining the Number of Tanzu Greenplum Text Nodes to Deploy in the Best Practices Guide.


How can I find the configuration of GPTEXT cluster (Solr and Zookeeper)

The configuration is stored in file $MASTER_DATA_DIRECTORY/gptext.conf or $COORDINATOR_DATA_DIRECTORY/gptext.conf

Use command:

# gptext-state configs


How can I find the history of what operations have done to the indexes 

SELECT * from gptext.admin_history;


How can I find the web URL of solr instance

### Option 1 ###
# SELECT distinct(base_url) from gptext.index_status();

### Option 2 ###
for i in `cat $MASTER_DATA_DIRECTORY/gptext.conf | grep -v "^id"`; do echo $i | awk -F"," '{print "http://"$2":"$3"/solr"}'; done


Where can I find all GPTEXT UDF(User defined functions) and the syntax?

# \df gptext.*

See VMware Tanzu Greenplum Text Function Reference for the detailed explanation of each function

How to add replica of the shard 

# gptext-replica add -i [index_name] -s [shardID]

Example: 
# gptext-replica add -i mydb.public.messages -s shard3 


How to find the problematic index(in yellow or red) and the reason for it?

1. Use gptext-state to find out all problematic index (in yellow or red)

 $ gptext-state -D

2. Run gptext-state --index=[Index_Name] to check the detail 

$ gptext-state --index=gpadmin.public.comment

 

How many indexes can the user create on top?

  • There is no physical limitation of the maximum count of indexes
  • Based on the QA test, ideally, we can have 50~100 indexes per cluster
  • The more index you have, the more memory may be consumed by Slor. Check the memory usage carefully if you have many indexes in the cluster

 

How to check how much data has been loaded into the index?

See article How to check how much data has been loaded into the GPText index

 

Where can I find the definition and explanation of the options in managed-schema?

Refer to Apache Solr Reference Guide.