Lakehouse Days: March 2025, Bengaluru

March 22, 2025 from 09:30 AM to 02:00 PM IST

Want to see e6data in action?

Learn how data teams power their workloads.

About the Event

Join us for an exclusive in-person event on “Introduction, optimization, integrations, table management, streaming data” hosted by e6data in Bengaluru!

Lakehouse Days - in collaboration with RisingWave, is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, merge-on-read query, serverless compaction and Iceberg table sharing with RisingWave, optimizing query performance, handling data transfers with Apache Arrow Flight, and Iceberg’s integration with GCP.

Lakehouse Days - in collaboration with RisingWave, is designed to enable fellow data geeks to meet, network, and have insightful discussions on the entropic world of data.

‍

Register Now!

Reserve your spot through this link: https://lu.ma/pd0r4bmr?utm_source=website

Venue - Accel LaunchPad, Koramangala

Date and time - Mar 22, 2025, from 09:30 AM to 2:00 PM

‍‍

Meet the Speakers

Rayees Pasha, CPO, RisingWave Labs

Topic: Streaming-first Approach to Iceberg with RisingWave

Summary: The session will provide an overview of the technical challenges of building a new Iceberg Table engine that is purpose-built for streaming workloads. The talk will highlight how RisingWave has built end-to-end key capabilities for Iceberg table management, including Iceberg’s merge-on-read query, Serverless Compaction, and Iceberg table sharing to allow direct queries from other engines. A key feature in this project is the native Iceberg compaction service written in Rust using Apache DataFusion and Apache Iceberg-Rust as foundational components.

Time: 09:30 - 10:15 AM IST
‍

Ankur Ranjan, Sr Software Engineer, e6data

Topic: Apache Arrow Flight: Reshaping How We Handle Data Transfers

‍Summary: In this talk, we will explore how Apache Arrow Flight overcomes the challenges of traditional protocols like ODBC and JDBC by providing a columnar-native transport that maintains data in its original format throughout the transfer process. Arrow Flight promises to enhance analytical workloads and align perfectly with modern data architectures by eliminating unnecessary conversions and streamlining data transfers. Join us to discover how this innovative approach can substantially improve data processing efficiency.

Time: 10:30 - 11:15 AM IST
‍

Sai Vineel Thamishetty, Sr Data Engineer, Walmart

Topic: Apache Iceberg with Google Cloud Platform (GCP)

‍Summary: This talk will explore the exciting developments with Apache Iceberg and its integration with Google Cloud Platform. Iceberg is now allowing users to store tables on Google Cloud Storage, which means we can use GCP’s scalable infrastructure alongside Iceberg’s performance enhancements. Popular data processing engines like Apache Spark and Trino have improved their support for Iceberg, making it easier for us to work with these tables directly in the cloud. There’s also a lot of buzz around improving interoperability with BigQuery, which could facilitate smoother data transfers and queries.

Time: 11:30 AM - 12:15 PM IST

Build future-proof data products

Try e6data for your heavy workloads!

Get Started for Free

Frequently asked questions (FAQs)

How do I integrate e6data with my existing data infrastructure?

How does billing work?

What kind of file formats does e6data support?

What kind of performance improvements can I expect with e6data?

What kinds of deployment models are available at e6data ?

How does e6data handle data governance rules?