This script uses ydata_profiling to generate a report for all tables within a schema or a select subset of tables, plus a simple HTML table with the percentage of missing and present values per column for quick reference.
Use uv. If you don't have it:
pipx install uv # or pipCreate your .env and set the required values. Run the script:
uv run python main.py
| Value | Description | Default |
|---|---|---|
| DATABASE_URL | Database URL for SQLAlchemy | - |
| OUTPUT_DIR | Output directory | - |
| SCHEMA | Database schema | public |
| TABLES | Comma-separated list of tables | All tables |