Skip to content

Conversation

@iPieter
Copy link

@iPieter iPieter commented Jun 27, 2025

Fixes #7650.

The metadata files generated by the DatasetDict.save_to_file function are not included in the folder_based_builder's metadata list, causing issues when only 1 actual data file is present, as described in issue #7650.

This PR adds these filenames to the builder, allowing correct loading.

The metadata files generated by the `DatasetDict.save_to_file` function are not included in the folder_based_builder's metadata list, causing issues when only 1 actual data file is present, as described in issue huggingface#7650.
@iPieter iPieter changed the title Extended metadata file names for folder_based_builder fix: Extended metadata file names for folder_based_builder Jun 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

load_dataset defaults to json file format for datasets with 1 shard

1 participant