Skip to content

Apache Hudi

Hudi stands for — Hadoop Upsert Deletes and Incrementals

Apache Hudi (Hadoop Upserts Deletes and Incrementals) is an open-source data management framework that is designed to simplify incremental data processing and data pipeline management for large-scale, high-performance data lakes. It helps us in managing large volumes of data with high velocity.

References

  • https://asrathore08.medium.com/apache-hudi-d259c1f202db