Skip to content

danilo-alm/data-profiling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Profiling

This script uses ydata_profiling to generate a report for all tables within a schema or a select subset of tables, plus a simple HTML table with the percentage of missing and present values per column for quick reference.

Running

Use uv. If you don't have it:

pipx install uv  # or pip

Create your .env and set the required values. Run the script:

uv run python main.py

Config (.env)

Value Description Default
DATABASE_URL Database URL for SQLAlchemy -
OUTPUT_DIR Output directory -
SCHEMA Database schema public
TABLES Comma-separated list of tables All tables

About

Generate reports with ydata_profiling

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages