-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Megatron dataset] Support loading megatron dataset (#6489)
* Adapt to Megatron * fix_print_dataset * fix BlendableDataset * fix BlendableDataset * fix skip_warmup * fix * fix * fix * fix * fix * fix * cache fix * make new dataset * fix loss mask * fix model_zoo/gpt * fix model_zoo/gpt * fix model_zoo/gpt * fix gpt test * fix legacy * fix legacy * hf_model * remove legacy * merge develop gpt * fix model_zoo/gpt for megatron * merge develop * resolve conflict * fix check_rank_flag for data_cache_path * fix check_rank_flag for data_cache_path * remove hcg * fix model_zoo/gpt eval
- Loading branch information
Showing
15 changed files
with
2,728 additions
and
1,060 deletions.
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.