-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: HuggingFace integration #286
Comments
If we could get Huggingface datasets directly to be Pytrees, that would be the ideal I guess. I haven't found out how to do this (maybe because I am not an expert on Pytrees). However, one can easily "view" huggingface datasets as dictionaries:
Now An alternative would be to implement some function
|
When making this change, we could introduce the possibility use dicts for Dataset.X (and potentially Dataset.y), which blends nicely with semantic naming of input (output) dimension as is done in Huggingface datasets/pandas dataframes. |
There has been no recent activity on this issue. To keep our issues log clean, we remove old and inactive issues. |
Feature Request
Demonstrate hoiw a GPJax dataset may be used to hold a HuggingFace dataset.
Describe Preferred Solution
A generic method that can coerce a HuggingFace dataset to a GPJax dataset would be ideally. If this is unfeasible, a notebook would also be most welcome.
Tagging @ingmarschuster who has developed a prototype of this.
The text was updated successfully, but these errors were encountered: