e6data + Fabric: Run queries 33% faster, no data movement
We have been following the buzz around Microsoft Fabric for a while and recently integrated e6data directly into Fabric. Now you can query your existing OneLake data about 33% faster, cutting analytics costs by 2-3x without changing your Fabric workflow. e6data sits alongside Fabric with zero data duplication. It layers in extra security (column-level masking, GDPR/SOC2/HIPAA compliance) on top of your current Fabric setup.
Doubling down on the cost aspects, in this blog post, we’ll cover:
Microsoft Fabric uses a capacity-based pricing model: you purchase a Fabric capacity (measured in capacity units, CUs) from which all your workloads draw. Fabric supports bursting (temporarily boosting resources) and smoothing (averaging usage over time) to handle peaks. While the simplicity of this model is appreciated, the fixed nature of a purchased capacity can sometimes lead to higher spending in certain workload patterns. A few examples of such scenarios include:
e6data is a lakehouse compute engine that can query data directly in OneLake at a 33% faster speed and 2-3x lower TCO than other query engines, as validated by internal TPC-DS benchmark testing. There’s no data silo or fork, you could have Fabric’s own SQL engine and e6data querying the same OneLake data simultaneously for different workloads.
Here’s how Fabric alone compares to Fabric + e6data in real-world enterprise analytics.
(For reference, approximate pricing: Fabric F2 costs about $263/month pay-as-you-go, so an F32 is around $4,200/month. e6data usage is ~$0.10 per vCPU-hour in these examples.)
e6data’s architecture is built on an atomic architecture that scales granularly with stateless services, with scaling granularity down to 1 vCPU increments to handle petabyte-scale of analytics workloads:
Here's the deal: Fabric’s great for your core workloads, but pairing it with e6data significantly reduces Microsoft Fabric costs during bursty, ad-hoc, or high-concurrency scenarios.
Think of e6data as a specialized compute layer that slides neatly into Fabric whenever your workloads get tricky without replacing Fabric itself.
You get the best of both worlds:
Win-Win!
If you’re a data engineer working with Fabric, give e6data a try on your OneLake data. One-click connect, run your queries, and see the query performance immediately. If you want to gain early access or learn more about our upcoming performance upgrades, reach out to us here, and we’ll be in touch soon!
We are universally interoperable and open-source friendly. We can integrate across any object store, table format, data catalog, governance tools, BI tools, and other data applications.
We use a usage-based pricing model based on vCPU consumption. Your billing is determined by the number of vCPUs used, ensuring you only pay for the compute power you actually consume.
We support all types of file formats, like Parquet, ORC, JSON, CSV, AVRO, and others.
e6data promises a 5 to 10 times faster querying speed across any concurrency at over 50% lower total cost of ownership across the workloads as compared to any compute engine in the market.
We support serverless and in-VPC deployment models.
We can integrate with your existing governance tool, and also have an in-house offering for data governance, access control, and security.