Skip to content

Conversation

@AlbertoEAF
Copy link

This util serves to extract CSV data into MockInstance / MockDataset objects:

  • Easy CSV parsing of data into OpenML API objects with arbitrary schema

  • Respects all of OpenML API's parameter conventions

  • Automatic encoding into target schema (even with categoricals)

This util serves to extract CSV data into MockInstance / MockDataset objects:

 + Easy CSV parsing of data into OpenML API objects with arbitrary schema

 + Respects all of OpenML API's parameter conventions

 + Automatic encoding into target schema (even with categoricals)
++categoryIndex;
}

throw new RuntimeException("Invalid input value for categorical field!");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@AlbertoEAF
Copy link
Author

Hello, I'm using this code in several tests for an internal machine learning provider.

I saw more times the need to take raw data and generate MockDatasets from it for tests and found myself using this code more than once.

I can create test code for all of this in case you find it necessary, and also please tell me if you find any other issues you believe I should address, or overall comments for improvement.

Thank you! :)

@JPDSousa
Copy link
Contributor

This PR is on hold until we find some time to tackle the missing unit tests.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants