Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DEPR: SparseDtype #56518

Open
jbrockmendel opened this issue Dec 15, 2023 · 1 comment
Open

DEPR: SparseDtype #56518

jbrockmendel opened this issue Dec 15, 2023 · 1 comment
Labels
Deprecate Functionality to remove in pandas Needs Discussion Requires discussion from core team before further action Sparse Sparse Data Type

Comments

@jbrockmendel
Copy link
Member

jbrockmendel commented Dec 15, 2023

Discussed briefly at the sprint. Sparse is not much used, has a bunch of xfailed tests, and adds about 1MB of .so files.

The suggestion here is to deprecate Sparse internally and help [TBD] to implement a 3rd party EA.

IIRC the main user of Sparse is one of the scikits (or pydata/something?) and their actual use case would be better handled with a 2D implementation. Anyone know the appropriate person to ping to get a view on this?

People who use Sparse, please comment so we know you exist!

@jbrockmendel jbrockmendel added Bug Needs Triage Issue that has not been reviewed by a pandas team member Sparse Sparse Data Type Deprecate Functionality to remove in pandas Needs Discussion Requires discussion from core team before further action and removed Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Dec 15, 2023
@dudelstein
Copy link

I use it! I find the functionality quite essential for working with high-cardinality categorical features, where it can reduce memory usage 100x or so. I do find the documentation a bit short and some behavior suprising, for example in this question https://stackoverflow.com/questions/77931609/why-does-pandas-sum-give-wrong-answers-for-sparse-dataframe/77932016#77932016.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Deprecate Functionality to remove in pandas Needs Discussion Requires discussion from core team before further action Sparse Sparse Data Type
Projects
None yet
Development

No branches or pull requests

2 participants