Lakehouse Days: March 2025, Bengaluru

Want to see e6data in action?

Learn how data teams power their workloads.

Get Demo
Get Demo

About the Event

Join us for an exclusive in-person event on “Introduction, optimization, integrations, table management, streaming data” hosted by e6data in Bengaluru!

Lakehouse Days - in collaboration with RisingWave, is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, merge-on-read query, serverless compaction and Iceberg table sharing with RisingWave, optimizing query performance, handling data transfers with Apache Arrow Flight, and Iceberg’s integration with GCP.

Lakehouse Days - in collaboration with RisingWave, is designed to enable fellow data geeks to meet, network, and have insightful discussions on the entropic world of data.

Register Now!

Reserve your spot through this link: https://lu.ma/pd0r4bmr?utm_source=website 

Venue - Accel LaunchPad, Koramangala

​Date and time - Mar 22, 2025, from 09:30 AM to 2:00 PM

Meet the Speakers

Rayees Pasha, CPO, RisingWave Labs

Topic: Streaming-first Approach to Iceberg with RisingWave

Summary: The session will provide an overview of the technical challenges of building a new Iceberg Table engine that is purpose-built for streaming workloads. The talk will highlight how RisingWave has built end-to-end key capabilities for Iceberg table management, including Iceberg’s merge-on-read query, Serverless Compaction, and Iceberg table sharing to allow direct queries from other engines. A key feature in this project is the native Iceberg compaction service written in Rust using Apache DataFusion and Apache Iceberg-Rust as foundational components.

Time: 09:30 - 10:15 AM IST

Ankur Ranjan, Sr Software Engineer, e6data

Topic: Apache Arrow Flight: Reshaping How We Handle Data Transfers

Summary: In this talk, we will explore how Apache Arrow Flight overcomes the challenges of traditional protocols like ODBC and JDBC by providing a columnar-native transport that maintains data in its original format throughout the transfer process. Arrow Flight promises to enhance analytical workloads and align perfectly with modern data architectures by eliminating unnecessary conversions and streamlining data transfers. Join us to discover how this innovative approach can substantially improve data processing efficiency.

Time: 10:30 - 11:15 AM IST

Sai Vineel Thamishetty, Sr Data Engineer, Walmart

Topic: Apache Iceberg with Google Cloud Platform (GCP)

Summary: This talk will explore the exciting developments with Apache Iceberg and its integration with Google Cloud Platform. Iceberg is now allowing users to store tables on Google Cloud Storage, which means we can use GCP’s scalable infrastructure alongside Iceberg’s performance enhancements. Popular data processing engines like Apache Spark and Trino have improved their support for Iceberg, making it easier for us to work with these tables directly in the cloud. There’s also a lot of buzz around improving interoperability with BigQuery, which could facilitate smoother data transfers and queries.

Time: 11:30 AM - 12:15 PM IST

Read more about Apache Iceberg

Share on

Build future-proof data products

Try e6data for your heavy workloads!

Get Started for Free
Get Started for Free
Frequently asked questions (FAQs)
How do I integrate e6data with my existing data infrastructure?

We are universally interoperable and open-source friendly. We can integrate across any object store, table format, data catalog, governance tools, BI tools, and other data applications.

How does billing work?

We use a usage-based pricing model based on vCPU consumption. Your billing is determined by the number of vCPUs used, ensuring you only pay for the compute power you actually consume.

What kind of file formats does e6data support?

We support all types of file formats, like Parquet, ORC, JSON, CSV, AVRO, and others.

What kind of performance improvements can I expect with e6data?

e6data promises a 5 to 10 times faster querying speed across any concurrency at over 50% lower total cost of ownership across the workloads as compared to any compute engine in the market.

What kinds of deployment models are available at e6data ?

We support serverless and in-VPC deployment models. 

How does e6data handle data governance rules?

We can integrate with your existing governance tool, and also have an in-house offering for data governance, access control, and security.