Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle on the fly decompression of data files supported by pandas #657

Merged
merged 5 commits into from
Jan 5, 2021

Conversation

sbrugman
Copy link
Collaborator

@sbrugman sbrugman commented Jan 5, 2021

#652 and #649 with unit tests

scholarsmate and others added 3 commits January 5, 2021 17:22
Pandas can handle on the fly decompression from the following extensions:
‘.bz2’, ‘.gz’, ‘.zip’, or ‘.xz’ (otherwise no decompression). Now the
profiler can as well.
@sbrugman sbrugman force-pushed the handle_compression branch from 1a66a11 to 3c8079b Compare January 5, 2021 16:22
@codecov
Copy link

codecov bot commented Jan 5, 2021

Codecov Report

Merging #657 (70caf1b) into develop (1e086f4) will increase coverage by 0.04%.
The diff coverage is 100.00%.

Impacted file tree graph

@@             Coverage Diff             @@
##           develop     #657      +/-   ##
===========================================
+ Coverage    89.40%   89.44%   +0.04%     
===========================================
  Files          132      132              
  Lines         3699     3714      +15     
===========================================
+ Hits          3307     3322      +15     
  Misses         392      392              
Flag Coverage Δ
py3.6-macos-latest-pandas 89.25% <100.00%> (+0.04%) ⬆️
py3.6-ubuntu-latest-pandas 89.25% <100.00%> (+0.04%) ⬆️
py3.7-ubuntu-latest-pandas 89.25% <100.00%> (+0.04%) ⬆️
py3.8-ubuntu-latest-pandas 89.44% <100.00%> (+0.04%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
src/pandas_profiling/utils/dataframe.py 75.86% <100.00%> (+4.43%) ⬆️
tests/unit/test_utils.py 100.00% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1e086f4...70caf1b. Read the comment docs.

@sbrugman sbrugman merged commit 64ad4cc into develop Jan 5, 2021
@sbrugman sbrugman deleted the handle_compression branch January 5, 2021 17:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants