Skip to main content

AWS Glue Reference

Reference material for AWS Glue Publishing -- data type mappings, CLI arguments, and querying.

Data Type Mapping

Source types map automatically to Glue-compatible types.

PostgreSQL to Glue

PostgreSQL TypeGlue Type
INTEGER, INT4int
BIGINT, INT8bigint
SMALLINT, INT2smallint
NUMERIC(p,s)decimal(p,s)
REAL, FLOAT4float
DOUBLE PRECISIONdouble
VARCHAR(n), TEXTstring
DATEdate
TIMESTAMPtimestamp
BOOLEANboolean
BYTEAbinary

SQL Server to Glue

SQL Server TypeGlue Type
INTint
BIGINTbigint
SMALLINTsmallint
TINYINTtinyint
DECIMAL(p,s)decimal(p,s)
FLOATdouble
REALfloat
VARCHAR(n), NVARCHAR(n)string
DATEdate
DATETIME, DATETIME2timestamp
BITboolean
VARBINARYbinary

CLI Reference

Glue Publishing Arguments

OptionTypeDescription
--publish_target IDStringCredential ID for Glue publishing (required)
--publish_schema_pattern PATTERNStringDatabase naming pattern (default: {schema})
--publish_table_pattern PATTERNStringTable naming pattern (default: {table})
--glue_skip_existingFlagSkip existing tables instead of drop and recreate
--n_jobs NIntegerParallel workers for table creation (default: 1)

Querying Glue Tables

Amazon Athena:

SELECT * FROM lx_tpch_1.customer LIMIT 10;

AWS Redshift Spectrum:

SELECT * FROM spectrum_schema.customer LIMIT 10;

Amazon EMR (Spark):

df = spark.table("lx_tpch_1.customer")
df.show(10)

See Also

Copyright © 2026 Architecture & Performance.