Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Demo/PoC only - xgboost/generalized caching support for model_selection #606

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

JohnZed
Copy link

@JohnZed JohnZed commented Jan 31, 2020

This is a PoC meant to spark discussion for Issue #605 .
It allows users to pass a custom extraction function (extract_fn) to gridsearch, which will allow them to cache custom datatypes. It includes an example of using this to support Dask DMatrix objects. It would need a lot of cleanup before merging, but it shows the general idea.

For a wide (1000 columns x 100k rows) training matrix for a classification problem with max_depth=4, this shaves 45% of the runtime off a hyperparameter sweep on a single worker.

Base automatically changed from master to main February 2, 2021 03:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant