gpload failed : ERROR: column "col01" named in 'DISTRIBUTED BY' clause does not exist
search cancel

gpload failed : ERROR: column "col01" named in 'DISTRIBUTED BY' clause does not exist

book

Article ID: 296884

calendar_today

Updated On:

Products

VMware Tanzu Greenplum

Issue/Introduction

When running gpload with MODE UPDATE or MERGE the following error is reported:
2023-03-17 10:18:08|INFO|gpload session started 2023-03-17 10:18:08
2023-03-17 10:18:08|INFO|started gpfdist -p 8000 -P 9000 -f "/home/gpadmin/gpload/data01.txt" -t 30 -m 32000000
2023-03-17 10:18:08|ERROR|could not run SQL "CREATE TEMP TABLE temp_staging_gpload_03361212_c4ad_11ed_90a2_0800270cdcee ("col02" integer,"col03" integer) DISTRIBUTED BY ("col01")": ERROR:  column "col01 named in 'DISTRIBUTED BY' clause does not exist
 
2023-03-17 10:18:08|INFO|rows Inserted          = 0
2023-03-17 10:18:08|INFO|rows Updated           = 0
2023-03-17 10:18:08|INFO|data formatting errors = 0
2023-03-17 10:18:08|INFO|gpload failed

The error message: "could not run SQL "CREATE TEMP TABLE temp_staging_gpload_03361212_c4ad_11ed_90a2_0800270cdcee ("col02" integer,"col03" integer) DISTRIBUTED BY ("col01")": ERROR: column "col01 named in 'DISTRIBUTED BY' clause does not exist"


Environment

Product Version: 6.23

Resolution

This is a code bug in gpload in Greenplum 6.x.
R&D are working on a fix and is expected to be released in 6.24.0