Skip to content

HakumenNC/datashear

Repository files navigation

DataShear

codecov PyPI - Version PyPI - Python Version

A Python package that allows you to split CSV files

Installation

pip install datashear

Usage

from datashear import Splitter

# Split by number of rows
splitter = Splitter("large_file.csv", output_dir="output")
splitter.by_rows(1000)  # 1000 rows per file

# Split by file size  
splitter.by_size(1024*1024)  # 1MB per file

Development

Setup

  1. Clone the repository:
git clone https://github.com/HakumenNC/datashear.git
cd datashear
  1. Install in development mode:
pip install -e ".[dev]"

Testing

Writing tests means you don’t trust your code.

Running test with pytest

pytest
pytest --cov=src --cov-report=xml

Go further

pytest

The pytest framework makes it easy to write small, readable tests, and can scale to support complex functional testing for applications and libraries.

https://docs.pytest.org/en/stable

Codecov

Codecov doesn’t just measure code coverage—it helps you improve code quality at every step.

https://about.codecov.io

Building

Build the package:

python -m build

Publish

Publishing with twine via github-actions on release

License

MIT License - see LICENSE file for details.

About

A Python package that allows you to split CSV files

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published