🎸 We read the news today. Oh boy! (with apologies to the Beatles…) 📰 The June Apache Hudi newsletter is out with exciting updates: 🎉 The release of Hudi 0.15.0, with a wide range of new features 💲 The recent $35M Series B funding round for Onehouse 🔎 Lakeview, a free monitoring and management solution for the lakehouse ⚙️ Table Optimizer, a managed service to automatically optimize lakehouse management tasks 🎼 (Both new solutions support Apache Hudi, with planned support for Apache Iceberg and Delta Lake.) ➕ And much more, including contributions from Harsh Dahiya, Soumil S., Gatsby Lee, and Nishant Kumar. With thanks to Dipankar Mazumdar, M.Sc 🥑 ✍️ Sign up for the Apache Hudi newsletter today! https://lnkd.in/gBZD8E_j #onehouse #dataengineering #nolockin #datalakehouse #apachehudi #opensource
Onehouse
Software Development
Menlo Park, California 6,800 followers
The Universal Data Lakehouse
About us
Onehouse delivers a universal data lakehouse through a cloud-native managed lakehouse service built on Apache Hudi, which was created by the founding team while they were at Uber. Onehouse makes it possible to blend the ease of use of a warehouse with the scale of a data lake, by offering a seamless experience for engineers to get their data lakes up and running. Onehouse offers the widest interoperability for your data in the market across table formats, multiple compute engines and multiple cloud providers. We have a stellar team of inspired, seasoned professionals including data, distributed systems, and platform engineers from Uber, LinkedIn, Confluent, and Amazon. Our product team has helped build enterprise data products at major enterprises including Azure Databricks.
- Website
-
https://onehouse.ai
External link for Onehouse
- Industry
- Software Development
- Company size
- 11-50 employees
- Headquarters
- Menlo Park, California
- Type
- Privately Held
- Founded
- 2021
Locations
-
Primary
2550 Sand Hill Rd
STE 200
Menlo Park, California 94025, US
Employees at Onehouse
Updates
-
Onehouse reposted this
At one of my sessions at the Data + AI Summit last month I asked the almost full house audience a live poll of what query engines they were using. I obviously knew Apache Spark would dominate in the audience, but the additional diversity was exciting to see! Almost as many Snowflake users as Databricks users and Trino was not far behind. When it comes to data query engines, ONE SIZE DOES NOT FIT ALL... Each of these compute engines has a specialized match to certain workload types and this is why you see very prevalent diversity even in a room full of random engineers at the Data + AI Summit. <note for the nerds 🤓 , this isn't a scientific measurement of the industry or representative of actual popularity of engines, just a fun live survey of a random community audience. yes, databricks is spark, guess what? snowflake, emr, athena, fabric also can run spark> #apachespark #databricks #snowflake #trino #apacheflink #bigquery #awsathena #awsemr #msfabric #dataengineering #datalakehouse
-
-
Great article from Toplyne on implementing a data lakehouse to: 🔋 Power AI/ML use cases… 🌺 …with fresh data… 🤑 …while eliminating vendor lock-in and reducing overall costs. 🕵️ Read on to learn more about Toplyne's architecture, including Apache Kafka ingestion to the lakehouse, with Dagster and Apache Spark to run feature engineering pipelines. https://lnkd.in/ggkt_i36 #onehouse #dataengineering #nolockin #ai #datalakehouse #apachehudi #opensource
-
-
Onehouse reposted this
Over the past month or so, both me and Shirshanka Das have been inundated with curious questions on #datalakehouse and #data #catalogs. We thought it ll be fun to exchange ideas and think out loud across both these areas together. So this Thursday, July 11th - 9am PST. Its on. We’re going to try something different. See you there.
”Taming the Chaos: A Deep Dive into Table Formats and Catalogs” - Tune in to the next round of DDL on July 11th at 9am US PT to find out more… Join host Shirshanka Das as he sits down with @Vinoth Chandar, Founder/CEO Onehouse as they demystify the recent explosion of table format and catalog tooling options in the data industry. Here is what they’ll discuss: 🗺 How did we get here? Tracing the history of data lakes, table formats, and catalogs. ⚡ The open source resurgence - Databricks’ Unity Catalog, Snowflake’s Polaris, Iceberg Rest Catalog, and beyond. 🔮 Forecasts and predictions of what’s to come in this space. We look forward to seeing you there!
DDL (Ep 08): Taming the Chaos: A Deep Dive into Table Formats and Catalogs
www.linkedin.com
-
Onehouse reposted this
Did you miss my webinar last week on how Onehouse creates the fastest Apache Iceberg tables for Snowflake? Yes it's true, Onehouse provides the fastest time to operational Iceberg tables in production. We offer the fastest streaming ingestion to Iceberg tables, and after our advanced table optimizations you will have the fastest Snowflake queries. Attached are my slides and you can watch the full recording here: 👇 https://lnkd.in/g7Wx7S83 Read more about the solution architecture here: 👇 https://lnkd.in/gkzpC4bU #apacheiceberg #snowflake #datalakehouse #streaminganalytics #realtimeanalytics #dataintegration #apachehudi #deltalake #apachespark
-
💡 What if you could have a single source of truth… ❓...for your data warehouse queries… 🤖 …and your data science & ML/AI queries? 🔒 Open, locked-down secure, and performant. ♾️ That's what the universal data lakehouse is all about. https://lnkd.in/gbefYRxG #onehouse #dataengineering #nolockin #universaldatalakehouse #apachehudi #opensource
-
-
🔥 The data lakehouse is on fire. And Onehouse is gaining recognition as a lakehouse leader. 😁 Amazon S3 is an outstanding service to host your data lakehouse. Fast, cost-efficient, secure, and adaptable to almost any use case. 🌆 Come to the free AWS Summit in New York next Wednesday, free in-person and online. 🤝 Meet with Onehouse in booth 1040 to learn how the data lakehouse will help you get the most out of S3 and scores of other AWS services. https://lnkd.in/gM4kReS6 #onehouse #dataengineering #nolockin #datalakehouse #apachehudi #opensource
-
-
Need help managing your data lakehouse? Here are two new power tools to monitor and optimize your lakehouse tables, for major cost savings and gains in performance: 🔬 Onehouse LakeView, a free tool for data lakehouse observability and management 📐 Onehouse Table Optimizer, a managed service for optimizing your data lakehouse tables in production 🗣️ Join us on 7/16 for live demos and Q&A. #onehouse #dataengineering #nolockin #lakehouse #datalakehouse #universaldatalakehouse #apachehudi #apachextable #opensource
Introducing Onehouse LakeView and Table Optimizer
www.linkedin.com
-
😳 That time when you realized everything you need to know… 💸 …to save money on Snowflake was right in front of you! 🫸🫷 Yes, the hit Onehouse webinar that we all wanted and waited for is now available on demand. ⚡️ That means you can get it right now. No waiting. See you there! https://lnkd.in/gnUmkyCv #onehouse #dataengineering #nolockin #datalakehouse
#ApacheXTable gives you an open data architecture - great for sharing data between Databricks, Snowflake, and more. But what's the best way to manage #ApacheIceberg data for Snowflake? Hear all about it in our upcoming webinar.
Iceberg for Snowflake: Implementing the fastest, most open data lakehouse
www.linkedin.com
-
🤔 Here at Onehouse, we thought we had come up with something original with our Series B announcement and our two new products from Wednesday... 😁 …then we found this old clip from the Oprah Winfrey show. (Thanks, Oprah!) https://lnkd.in/gxyAiHQC #onehouse #dataengineering #nolockin #datalakehouse #universaldatalakehouse #apachehudi #apachextable #opensource
-