Version: 0.4 (Latest)

AWS Glue Reference

Reference material for AWS Glue Publishing -- data type mappings, CLI arguments, and querying.

Data Type Mapping

Source types map automatically to Glue-compatible types.

Option	Type	Description
`--publish_target ID`	String	Credential ID for Glue publishing (required)
`--publish_schema_pattern PATTERN`	String	Database naming pattern (default: `{schema}`)
`--publish_table_pattern PATTERN`	String	Table naming pattern (default: `{table}`)
`--glue_skip_existing`	Flag	Skip existing tables instead of drop and recreate
`--n_jobs N`	Integer	Parallel workers for table creation (default: 1)

Amazon Athena:

SELECT * FROM lx_tpch_1.customer LIMIT 10;

AWS Redshift Spectrum:

SELECT * FROM spectrum_schema.customer LIMIT 10;

Amazon EMR (Spark):

df = spark.table("lx_tpch_1.customer")
df.show(10)