After upgrading GPDB to 5.20 or later, when querying some PXF external tables, you get the following error:
ERROR: remote component error (500) from '127.0.0.1:5888': type Exception report message javax.servlet.ServletException: java.io.FileNotFoundException: File hdfs://hdp-26-master.datalab.local:8020/tmp/pxf* does not exist. description The server encountered an internal error that prevented it from fulfilling this request. exception javax.servlet.ServletException: javax.servlet.ServletException: java.io.FileNotFoundException: File hdfs://hdp-26-master.datalab.local:8020/tmp/pxf* does not exist. (libchurl.c:946) (seg0 slice1 10.193.102.13:30563 pid=20664) (cdbdisp.c:254)
The tables are using the profile
Hdfstextmulti and wildcard
* in the location clause, for example:
External location: pxf://tmp/pxf*?PROFILE=Hdfstextmulti
If you remove the wildcard, or change the profile to
Hdfstextsimple, the query works.