Skip to content

Conversation

@conglongli
Copy link

@conglongli conglongli commented Oct 28, 2021

A replicate of the curriculum learning support I built in https://github.com/bigscience-workshop/Megatron-DeepSpeed. Tested that it works correctly in this repo as well. Also applied some tensorboard logging improvements from https://github.com/bigscience-workshop/Megatron-DeepSpeed.

@conglongli conglongli requested a review from jeffra October 28, 2021 03:49
@jeffra jeffra merged commit db97cd2 into main Oct 28, 2021
@conglongli conglongli deleted the curriculum_learning branch October 30, 2021 04:32
samadejacobs pushed a commit that referenced this pull request Aug 23, 2023
Add support for DeepSpeed's sequence parallelism
saforem2 referenced this pull request in saforem2/Megatron-DeepSpeed Oct 11, 2024
Merge in `tokenizer-tests` branch into `main`
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants