Question 1

What is the difference between Iceberg and Delta Lake?

Accepted Answer

Both Apache Iceberg and Delta Lake are open table formats that add ACID transactions, time travel, and schema evolution to data lakes. The key differences: Iceberg is engine-agnostic with native support in Spark, Flink, Trino, DuckDB, Snowflake, and BigQuery — making it the preferred choice for multi-engine architectures. Delta Lake has deep Databricks and Spark integration and is preferred in Databricks-centric environments. Both support hidden partitioning (Iceberg natively; Delta via Liquid Clustering).

Question 2

Should I use Iceberg or Delta Lake?

Accepted Answer

Choose Iceberg if you need to query the same tables from multiple engines (Spark + Trino + DuckDB), want engine-agnostic open standards, or are building on AWS, GCP, or Azure without Databricks. Choose Delta Lake if your team is all-in on Databricks or the Spark ecosystem, uses Databricks-specific features like Delta Live Tables or DeltaSharing, or prefers Liquid Clustering over Iceberg's partition evolution.

Question 3

Can Iceberg and Delta Lake work together?

Accepted Answer

Yes. Delta Universal Format (UniForm) allows Delta tables to expose an Iceberg-compatible metadata layer, so Iceberg-native engines (Trino, DuckDB) can read Delta tables without conversion. Some organizations run both: Iceberg for shared external tables and Delta for Databricks-internal pipelines.

Feature	Iceberg	Delta Lake
ACID transactions	✓	✓
Time travel	✓ AS OF TIMESTAMP/VERSION	✓ VERSION AS OF
Schema evolution	✓ add/drop/rename/reorder	✓ add/drop (rename ✗)
Hidden partitioning	✓ native	✓ via Liquid Clustering
Partition evolution	✓ no rewrite	✗ (Liquid replaces this)
Multi-engine support	✓ best	✓ growing (UniForm)
Streaming writes	✓ Flink, Spark	✓ Spark Structured Streaming
Row-level deletes	✓ CoW + MoR	✓ CoW + MoR

Iceberg vs Delta Lake: What's the Difference?

Side-by-Side Comparison

Mental Model

When to Use Each

Feature Comparison

Common Mistakes

FAQ

Related