Skip to content

Great Expectations: Implement on Databricks

Great Expectations (GE) is a great python library for data quality. It comes with integrations for Apache Spark and dozens of preconfigured data expectations. Databricks is a top-tier data platform built on Spark. So you’d expect them to integrate seamlessly, but that is not quite the case.

References