Join us for an exclusive in-person event on “Introduction, optimization, integrations, table management, streaming data” hosted by e6data in Bengaluru!
Lakehouse Days - in collaboration with RisingWave, is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience. In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, merge-on-read query, serverless compaction and Iceberg table sharing with RisingWave, optimizing query performance, handling data transfers with Apache Arrow Flight, and Iceberg’s integration with GCP.
Lakehouse Days - in collaboration with RisingWave, is designed to enable fellow data geeks to meet, network, and have insightful discussions on the entropic world of data.
Reserve your spot through this link: https://lu.ma/pd0r4bmr?utm_source=website
Venue - Accel LaunchPad, Koramangala
Date and time - Mar 22, 2025, from 09:30 AM to 2:00 PM
Topic: Streaming-first Approach to Iceberg with RisingWave
Summary: The session will provide an overview of the technical challenges of building a new Iceberg Table engine that is purpose-built for streaming workloads. The talk will highlight how RisingWave has built end-to-end key capabilities for Iceberg table management, including Iceberg’s merge-on-read query, Serverless Compaction, and Iceberg table sharing to allow direct queries from other engines. A key feature in this project is the native Iceberg compaction service written in Rust using Apache DataFusion and Apache Iceberg-Rust as foundational components.
Time: 09:30 - 10:15 AM IST
Topic: Apache Arrow Flight: Reshaping How We Handle Data Transfers
Summary: In this talk, we will explore how Apache Arrow Flight overcomes the challenges of traditional protocols like ODBC and JDBC by providing a columnar-native transport that maintains data in its original format throughout the transfer process. Arrow Flight promises to enhance analytical workloads and align perfectly with modern data architectures by eliminating unnecessary conversions and streamlining data transfers. Join us to discover how this innovative approach can substantially improve data processing efficiency.
Time: 10:30 - 11:15 AM IST
Topic: Apache Iceberg with Google Cloud Platform (GCP)
Summary: This talk will explore the exciting developments with Apache Iceberg and its integration with Google Cloud Platform. Iceberg is now allowing users to store tables on Google Cloud Storage, which means we can use GCP’s scalable infrastructure alongside Iceberg’s performance enhancements. Popular data processing engines like Apache Spark and Trino have improved their support for Iceberg, making it easier for us to work with these tables directly in the cloud. There’s also a lot of buzz around improving interoperability with BigQuery, which could facilitate smoother data transfers and queries.
Time: 11:30 AM - 12:15 PM IST
We are universally interoperable and open-source friendly. We can integrate across any object store, table format, data catalog, governance tools, BI tools, and other data applications.
We use a usage-based pricing model based on vCPU consumption. Your billing is determined by the number of vCPUs used, ensuring you only pay for the compute power you actually consume.
We support all types of file formats, like Parquet, ORC, JSON, CSV, AVRO, and others.
e6data promises a 5 to 10 times faster querying speed across any concurrency at over 50% lower total cost of ownership across the workloads as compared to any compute engine in the market.
We support serverless and in-VPC deployment models.
We can integrate with your existing governance tool, and also have an in-house offering for data governance, access control, and security.