Delta Lake

Open storage framework that brings reliability and performance to data lakes.

MCPCLI

About Delta Lake

Open storage framework that brings reliability and performance to data lakes. Explore how Delta Lake integrates with the agentic data stack ecosystem and supports autonomous data operations.

Key Features

  • ACID transactions with optimistic concurrency control on object storage
  • Schema enforcement on write and schema evolution support
  • Time travel and data versioning for auditing, rollbacks, and reproducibility
  • Unified batch and streaming processing on the same tables
  • MERGE/UPDATE/DELETE operations (upserts) for incremental data management
  • Broad engine support (Spark, Flink, Trino, Presto, Hive, Athena, Snowflake, BigQuery)
  • Small file compaction and Z-Order indexing for query performance
  • Change Data Feed for capturing row-level changes between table versions

Agent Integration

CLIspark-sql / deltalake (Python)

$ pip install deltalake
CLI Documentation