Getting Started#

pip install datapact

datapact is a Python library for verifying your data.

  • works with Pandas + Dask DataFrames

  • dozens of existing assertions, easy to add your own

  • great in-editor documentation via docstrings + types

  • two severence levels (.should for warnings, .must for failures)

  • failure notifications via E-Mail, MS Teams, Slack or PagerDuty (via Datapact Track)

Tutorial

Comparison to Great Expectations, TDDA, Monte Carlo, Metaplane

_images/jupyter_screenshot.png