The scope of this article is to provide a troubleshooting guide and perform a root cause analysis (RCA) for common problems when configuring Amazon Simple Storage Service popularly known as Amazon S3. Amazon S3 is easy to use object storage, with a simple web service interface to store and retrieve any amount of data from anywhere on the web.
Starting Greenplum Database 4.3.8.0 onwards, the CREATE EXTERNAL TABLE command supports creating readable external tables that access files from Amazon S3. To take advantage of this feature, the s3 protocol should be configured on the Greenplum cluster. This guide shares some of the commonly faced problems during and after configuring s3 protocol and how to segregate them.
ERROR: Failed to init S3 extension, segid = 12, segnum = 24, please check your configurations and net connection (gps3ext.cpp:166) (seg12 slice1 sdw3:40000 pid=4741) (cdbdisp.c:1326) DETAIL: External table test_s3_table, file s3://s3-us-west-2.amazonaws.com/test_s3_bucket/test_s3_file.txt config=/home/gpadmin/s3/s3.conf
gpconfig -c log_min_messages -v debug
ERROR: s3_import: could not read data (gps3ext.cpp:185) (seg11 slice1 sdw5:40003 pid=9702) (cdbdisp.c:1326) DETAIL: External table test_s3_table, file s3://s3-us-west-2.amazonaws.com/test_s3_bucket/test_s3_file.txt config=/home/gpadmin/s3/s3.conf
If any of the above checks fail then, Greenplum will not be able to perform read/write operation on Amazon s3.