What is Apache Iceberg? (2026)

Quick answer

Apache Iceberg is an open table format that adds a metadata layer on top of Parquet, ORC, or Avro files in object storage (S3, GCS, ADLS). It brings ACID transactions, time travel via snapshots, schema evolution without rewrites, and hidden partitioning to data lakes — letting them behave like warehouses while staying open and engine-agnostic. Learn Iceberg hands-on at /learn/iceberg or build a production lakehouse with /projects/iceberg-lakehouse.

What is Apache Iceberg?

Iceberg was created at Netflix in 2017 to solve the limitations of the Hive table format at scale: slow partition discovery, unsafe concurrent writes, and the inability to change schemas without rewriting data. It was open-sourced and donated to the Apache Software Foundation in 2018.

Today Iceberg is the dominant open table format. It has native support in Spark, Flink, Trino, Presto, DuckDB, Snowflake, BigQuery, and Redshift. The emerging REST Catalog spec is making it the universal interface for multi-engine lakehouses.

The Hive table format treats partitions as directories — fragile, slow, and unsafe under concurrent writes. Iceberg replaces directory-based partitions with a three-layer metadata tree that records every snapshot, manifest, and data file. The result: O(1) query planning, ACID guarantees, and safe schema changes without touching the data files.

SKILL · ICEBERG

Master Iceberg in 5 hours, hands-on.

From catalog setup to MERGE INTO for CDC, compaction strategy, and multi-engine queries on the same tables. Real lakehouse, real Parquet files in S3.

Start learning →

Why does Iceberg matter?

ACID transactions via optimistic concurrency — concurrent writers no longer corrupt tables
Add, drop, rename, or reorder columns with metadata-only operations — no data rewrite
Hidden partitioning eliminates the manual WHERE partition_date = X filter on every query
Time travel: query any past snapshot by timestamp or ID for audit, debug, and recovery
Metadata tree gives O(1) query planning regardless of file count or table size
One open table works across Spark, Flink, Trino, DuckDB, Snowflake — no format lock-in

How does Iceberg work?

Iceberg is a three-layer metadata hierarchy sitting on top of immutable data files in object storage:

Catalog — a pointer (Glue, Nessie, REST, JDBC) that maps a table name to its current metadata file. Engines start here.
Metadata file — JSON snapshot log + schema history + partition spec + sort order. One per table version.
Manifest list + manifests — Avro files listing the data files in each snapshot, with per-file column stats for pruning.
Data files — Parquet, ORC, or Avro objects in S3/GCS/ADLS. Immutable; deletes are separate delete files (MoR mode).

Every commit writes a new metadata file and atomically swaps the catalog pointer. Readers always see a consistent snapshot, even mid-write.

-- Create an Iceberg table with hidden partitioning
CREATE TABLE catalog.db.events (
    event_id     BIGINT,
    user_id      BIGINT,
    event_type   STRING,
    event_ts     TIMESTAMP
)
USING iceberg
PARTITIONED BY (days(event_ts));   -- hidden partition

-- Time travel: query yesterday's snapshot
SELECT * FROM catalog.db.events
TIMESTAMP AS OF '2026-05-22 00:00:00';

-- Schema evolution: add column (no rewrite, milliseconds)
ALTER TABLE catalog.db.events
ADD COLUMN session_id STRING;

Iceberg vs Delta Lake vs Hudi

Feature	Iceberg	Delta Lake	Hudi
ACID transactions	Yes	Yes	Yes
Time travel	Snapshots	Versions	Timeline
Schema evolution	All ops, ID-based	Most ops	Limited
Hidden partitioning	Yes	No (Liquid)	No
Partition evolution (no rewrite)	Yes	No	Limited
Multi-engine support	Best in class	Growing (UniForm)	Spark-first
Row-level deletes	CoW + MoR	CoW + DV	MoR-first
Catalog standard	REST Catalog	Unity Catalog	Hive Metastore

Verdict: Choose Iceberg for multi-engine architectures and open interoperability — especially if Trino, Flink, DuckDB, or Snowflake share the same tables. Choose Delta Lake if you are all-in on Databricks and want Liquid Clustering. Choose Hudi for CDC-heavy workloads where MoR write speed matters more than read latency.

Catalogs, compaction, and the lifecycle you have to operate

A bare Iceberg table is incomplete — production deployments depend on three operational layers most tutorials skip:

Catalog choice. Hadoop catalogs (file-based) work for a single engine but break cross-engine discovery. Use a REST catalog — AWS Glue, Project Nessie, Apache Polaris, or Tabular — to share tables across Spark, Trino, and DuckDB simultaneously with consistent metadata.
Compaction. Streaming writes (Flink, Spark Structured Streaming) create many small files. Without a regular rewrite_data_files job, query latency degrades within days. Schedule compaction nightly on hot tables.
Snapshot expiry. Every write creates a snapshot. Old snapshots keep data files alive in S3 and bloat the metadata tree. Run expire_snapshots weekly with a retention window (typically 7-30 days).

Done right, Iceberg becomes invisible infrastructure. Done wrong, you ship a lakehouse that gets slower every week.

PROJECT · ICEBERG-LAKEHOUSE

Build a real lakehouse end-to-end.

Catalog setup, streaming CDC ingestion with Flink, batch reads with Spark, ad-hoc queries with Trino — same Iceberg tables. Time travel, compaction, and lifecycle governance included.

Open project →

Common mistakes (and what to do instead)

Not running compaction — small files from streaming writes degrade query performance. Schedule CALL catalog.system.rewrite_data_files() on hot tables.
Forgetting to expire snapshots — old snapshots keep data files alive and bloat metadata. Run expire_snapshots weekly with a 7-30 day retention window.
Wrong write mode for CDC — copy-on-write rewrites files on every update (fast reads, slow writes). Merge-on-read appends delete files (fast writes, slower reads). Match to your update frequency.
Choosing partition spec before profiling queries — partitioning by days(event_ts) is useless if every query filters by user_id. Profile first; partition evolution lets you change it later without a rewrite.
Skipping the REST catalog — a Hadoop or file-based catalog locks you to one engine. Use Glue, Nessie, or Polaris to keep the multi-engine promise of Iceberg alive.
Treating Iceberg as a query engine — Iceberg is just a table format. You still need Spark, Flink, Trino, or DuckDB to read and write it.

Who is Iceberg for?

Iceberg is built for data platform engineers and senior data engineers who own the lakehouse layer. If your data lives in S3, GCS, or ADLS and you need to share tables across multiple query engines without copying data, Iceberg is the right table format.

Teams that benefit most:

Platform teams unifying batch and streaming on the same tables — Flink writes, Spark reads, Trino queries
Analytics teams escaping Hive's partitioning pain and slow planning on multi-petabyte fact tables
Compliance-heavy orgs that need immutable audit trails via time travel for GDPR, SOC 2, and incident forensics
ML teams sharing feature tables across Spark training jobs and Snowflake serving without ETL between them

Frequently asked questions

What is Apache Iceberg?

Apache Iceberg is an open table format that adds ACID transactions, time travel, schema evolution, and hidden partitioning to data lakes on object storage (S3, GCS, ADLS). It works with Spark, Flink, Trino, DuckDB, Snowflake, and BigQuery — making your lake behave like a warehouse without locking you into a single engine.

What is the difference between Apache Iceberg and Delta Lake?

Both add ACID and time travel to data lakes. Iceberg is engine-agnostic with a metadata tree for fast planning on huge tables — preferred for multi-engine architectures. Delta Lake has deep Databricks/Spark integration and Liquid Clustering — preferred for Databricks-centric shops. Both are production-ready; the choice depends on your engine mix.

How does Iceberg time travel work?

Iceberg keeps an immutable snapshot for every write. You can query past snapshots using AS OF TIMESTAMP or AS OF VERSION (snapshot ID) for auditing, debugging, or data recovery. Old snapshots are reclaimed when you run expire_snapshots, so storage costs stay bounded.

What is hidden partitioning in Iceberg?

Hidden partitioning lets you define a partition spec (e.g. days(event_ts)) without requiring query filters on the partition column. Iceberg applies pruning automatically from the metadata, eliminating the most common cause of slow queries on Hive-style tables where every query had to manually add a partition WHERE clause.

What is schema evolution in Iceberg?

Schema evolution lets you add, drop, rename, or reorder columns without rewriting existing data files. Iceberg tracks schema changes in metadata and uses column IDs internally, so renaming a column never breaks existing files or downstream queries. Changes complete in milliseconds even on petabyte tables.

What to do next

Start shipping.

Three steps from a guide to a job-ready portfolio. Pick one and start now — the rest will follow.

01 · LEARN

Take the skill

Self-paced module with code, exercises, and a deliverable. Free preview, paid completion.

Start S0X · Iceberg →

02 · BUILD

Ship the project

Production-grade build with starter kit + mentor code review. The artifact that gets you interviews.

Open P0X · Iceberg Lakehouse →

03 · COMMIT

Pick a career path

The full progression — skills + projects + interview prep — for the role you actually want.

See paths →

What is Apache Iceberg?

Master Iceberg in 5 hours, hands-on.

Why does Iceberg matter?

How does Iceberg work?

Iceberg vs Delta Lake vs Hudi

Catalogs, compaction, and the lifecycle you have to operate

Build a real lakehouse end-to-end.

Common mistakes (and what to do instead)

Who is Iceberg for?

Frequently asked questions

Start shipping.

Take the skill

Ship the project

Pick a career path

Related guides

What is Apache Spark?

What is Apache Flink?

What is Data Modeling?