Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Boolean columns should be able to use any of the categorical transformers #527

Closed
npatki opened this issue Jul 12, 2022 · 0 comments · Fixed by #531
Closed

Boolean columns should be able to use any of the categorical transformers #527

npatki opened this issue Jul 12, 2022 · 0 comments · Fixed by #531
Assignees
Labels
feature request Request for a new feature
Milestone

Comments

@npatki
Copy link
Contributor

npatki commented Jul 12, 2022

Problem Description

Right now, I can only use the BinaryEncoder for boolean columns. Meanwhile, categorical columns have many options such as LabelEncoder (with and without noise), OneHotEncoder and more. These options are nice because they can improve the synthetic data quality.

Expected behavior

We can think of boolean data as a special case of categorical data where number of categories = 2. Any categorical transformer should be able to work on boolean data.

We should allow any of the categorical transformers to be used on boolean data. The HyperTransformer should not throw an error if I try to apply a categorical transformer to the boolean sdtype.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request Request for a new feature
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants