Skip to content

raulcd/datanomy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Datanomy

Explore the anatomy of your columnar data files

Datanomy is a terminal-based tool for inspecting and understanding data files. It provides an interactive view of your data's structure, metadata, and internal organization.

Currently only Parquet available:

Parquet demo

Features for Parquet view

General Structure

General Structure

Schema

Schema

Data

Data

Metadata

Metadata

Stats

Stats

Installation

# From PyPI
uv tool install datanomy
## with pip
pip install datanomy

# From source
uv tool install "datanomy @ git+https://github.com/raulcd/datanomy.git"
## cloning the repo 
git clone https://github.com/raulcd/datanomy.git
cd datanomy
uv sync

Usage

# Run without installing using uvx
uvx datanomy data.parquet

# Inspect a Parquet file
datanomy data.parquet

You can also use from source using uvx. This uses the development version:

uvx "git+https://github.com/raulcd/datanomy.git" data.parquet

Keyboard Shortcuts

  • q - Quit the application

Development

# Install dependencies
uv sync

# Run from source
uv run datanomy path/to/file.parquet
# Install dev dependencies
uv sync --extra dev

# Run tests
uv run pytest

# Format code
uv run ruff format .

# Lint
uv run ruff check .

# Lint
uv run mypy .

License

Apache License 2.0

Contributing

Contributions welcome! Please open an issue or PR.


Built with Textual and PyArrow

About

Dissecting data structures

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages