How to CREATE TABLE in Amazon Redshift

Galaxy Glossary

How do I use CREATE TABLE in Amazon Redshift?

CREATE TABLE defines a new, permanently stored table with chosen distribution and sort keys in Amazon Redshift.

Welcome to the Galaxy, Guardian!
You'll be receiving a confirmation email

Follow us on twitter :)

Oops! Something went wrong while submitting the form.

Description

Example H2

Example H3

Why is CREATE TABLE essential in Redshift?

CREATE TABLE materializes a dataset inside your Redshift cluster, letting you control column types, compression, distribution, and sort order for fast analytics.

What is the basic CREATE TABLE syntax?

The command starts with CREATE TABLE, an optional IF NOT EXISTS clause, column definitions, and table-level options such as DISTKEY and SORTKEY.

How do I define columns correctly?

Specify column_name followed by Redshift data types (INTEGER, VARCHAR(n), BOOLEAN, etc.). Add DEFAULT, IDENTITY, or ENCODE options to optimize storage and loading.

When should I set DISTKEY and SORTKEY?

Use DISTKEY on columns heavily joined across large tables (e.g., customer_id). Choose SORTKEY columns frequently filtered or ordered (e.g., order_date) to speed up scans.

Can I copy PostgreSQL DDL directly?

Mostly yes, but adjust Redshift-specific features (DISTKEY, SORTKEY, ENCODE). Also replace serial with IDENTITY.

Example: Creating an Orders table

CREATE TABLE IF NOT EXISTS public.orders ( id INT IDENTITY(1,1), customer_id INT NOT NULL, order_date DATE DEFAULT CURRENT_DATE, total_amount NUMERIC(10,2) ENCODE az64, DISTKEY(customer_id), SORTKEY(order_date) );

Best practices for CREATE TABLE

Choose compression via ANALYZE COMPRESSION on sample data. Keep SORTKEYs under four columns. Use VARCHAR lengths just large enough for data.

What are common mistakes?

Omitting distribution style, creating wide VARCHARs, forgetting BACKUP NO for transient tables, and mixing DISTKEY with EVEN distribution are typical issues.

Why How to CREATE TABLE in Amazon Redshift is important

How to CREATE TABLE in Amazon Redshift Example Usage


CREATE TABLE IF NOT EXISTS public.orderitems (
    id INT IDENTITY(1,1),
    order_id INT NOT NULL,
    product_id INT NOT NULL,
    quantity SMALLINT DEFAULT 1,
    ENCODE zstd,
    DISTKEY(order_id),
    SORTKEY(order_id, product_id)
);

How to CREATE TABLE in Amazon Redshift Syntax


CREATE TABLE [IF NOT EXISTS] schema.table_name (
    column_name data_type [IDENTITY(seed, step)] [DEFAULT expression]
                [ENCODE encoding],
    ...
)
[BACKUP { YES | NO }]
[DISTSTYLE { AUTO | EVEN | KEY | ALL }]
[DISTKEY (dist_key_column)]
[COMPOUND | INTERLEAVED] SORTKEY (sort_key_column [, ...]);

Common Mistakes

Ignoring distribution style—Default EVEN may cause costly data shuffling during joins. Fix by picking DISTKEY on high-cardinality join columns like customer_id.
Choosing oversized VARCHAR(65535) for every column—wastes space and slows scans. Fix by sizing VARCHAR to realistic limits or using ENCODE az64.