What is an Open Table Format in a Lakehouse Architecture? (ft. Apache Iceberg)
In this video, Cloudera's Director of Developers, Dipankar breaks down what an open table format actually is in a Lakehouse architecture. Table formats are not new and Databases have had their own storage formats for decades. But in the lakehouse world, things have changed.
In this video, Dipankar walks through:
Where table formats originated (from the relational model to modern systems)
Why proprietary storage formats led to engine lock-in
How Hive introduced one of the first open table abstractions in data lakes
What a table format actually is in a lakehouse
The core components: schema, partitions, stats, commit history, file paths
Why modern formats like Apache Iceberg, Delta Lake, and Apache Hudi bring openness and interoperability
Join the Cloudera Community (https://community.cloudera.com) to learn more!