-
Notifications
You must be signed in to change notification settings - Fork 839
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset Improvements #278
Dataset Improvements #278
Conversation
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
* minor change to usecols/dropcols usage ([] -> None) * use fetch_openml `as_frame=True` option * binary_race only affects protected attribute unless numeric_only Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request introduces 2 alerts and fixes 3 when merging 37f3345 into 963df2e - view on LGTM.com new alerts:
fixed alerts:
|
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request fixes 3 alerts when merging 78a5c3d into 963df2e - view on LGTM.com fixed alerts:
|
@monindersingh - FYI, this may address some of the questions you had. |
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request fixes 3 alerts when merging da6d549 into 963df2e - view on LGTM.com fixed alerts:
|
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request fixes 3 alerts when merging cf0c6c3 into 963df2e - view on LGTM.com fixed alerts:
|
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
@monindersingh I have a question about MEPS. In the pre-processing, you included
but in the features_to_keep arg of __init__ they're not there:AIF360/aif360/datasets/meps_dataset_panel19_fy2015.py Lines 80 to 85 in 963df2e
Essentially this means those columns are included when we drop NAs but not in the final dataset. This only affects a few rows but what's the reasoning for this? Was it intentional? |
This pull request fixes 3 alerts when merging 6a7bbef into 963df2e - view on LGTM.com fixed alerts:
|
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request fixes 3 alerts when merging 88d2e1c into 963df2e - view on LGTM.com fixed alerts:
|
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request fixes 3 alerts when merging 9d5a8dd into faa75ee - view on LGTM.com fixed alerts:
|
This pull request fixes 3 alerts when merging 2e93e9c into faa75ee - view on LGTM.com fixed alerts:
|
Signed-off-by: Samuel Hoffman <hoffman.sc@gmail.com>
This pull request fixes 3 alerts when merging 28986da into faa75ee - view on LGTM.com fixed alerts:
|
This pull request fixes 3 alerts when merging 230a93b into faa75ee - view on LGTM.com fixed alerts:
|
* allow explicit arrays for prot_attr, target * add MEPS and violent recidivism datasets * option to skip cache * binary_race only affects protected attribute unless numeric_only * remove unused categories after dropping * minimum python version >= 3.7; scikit-learn >= 1.0
Major improvements:
prot_attr
arguments are handled. Now, when processing a dataset and running metrics, an explicit array (or list of arrays) containing protected attribute values per sample may be passed instead of requiring an index name.Small changes:
cache=False
option to dataset fetching functions to skip caching