Admin

Onehouse Raises $35M to Enhance Open Data Lakehouse Technology

$35M, advance, lakehouse, Onehouse, open data, Secures, technology



In the world of data management and analytics, having a flexible and interoperable infrastructure is key. This is where Onehouse, a data lakehouse vendor, comes in. The company is focused on enabling interoperable data lake technologies and has recently secured $35 million in Series B funding to accelerate its product development and market penetration. This latest funding round was led by Craft Ventures, with participation from Addition and Greylock Partners. To date, Onehouse has raised a total of $68 million, including a seed round of $8 million and a $25 million Series A round.

Onehouse originated from the open-source project Apache Hudi, which was initially developed at Uber. Apache Hudi is an open data lake table format that competes with other open-source formats like Apache Iceberg and Delta Lake. However, Onehouse’s goal is not to compete directly but rather to focus on interoperability between these different formats. In November 2023, Microsoft and Google joined forces with Onehouse to back the OneTable open-source data lake table format interoperability technology. This effort was later moved to the Apache Software Foundation and rebranded as Apache XTable.

With the new funding, Onehouse aims to continue contributing to the development of Apache XTable and advance its Universal Data Lakehouse platform. The Universal Data Lakehouse platform is a managed product offering that provides a neutral, efficient, and interoperable solution for data management. It allows organizations to use different table formats, data catalogs, query engines, and cloud providers seamlessly.

One of the key advantages of Apache XTable is its ability to solve the challenge of having multiple different data lake table formats. It provides interoperability across data lake table metadata in an omnidirectional way. Microsoft has already embraced XTable, using it as a key component for translating between different formats like Snowflake, Apache Iceberg, and Delta Lake.

In addition to focusing on interoperability, Onehouse is also dedicated to advancing the Apache Hudi technology. Work is currently underway on the new Apache Hudi 1.0 release, which will bring a new concurrency model and support for unstructured data. One of the exciting features being developed is the addition of vector search indexes in the extensible indexing subsystem. This will enable vector as well as text searches against data in the data lake, making Apache Hudi more like a database layer on top of data lakes. The company expects Apache Hudi 1.0 to become generally available in the next few months.

Onehouse’s approach of prioritizing interoperability and flexibility sets it apart in the data management landscape. By allowing organizations to use different table formats and query engines without sacrificing performance, Onehouse empowers businesses to leverage their data effectively. With the latest funding, the company is poised to continue making advancements in data lake technologies, enabling seamless data management and analytics.



Source link

Leave a Comment