Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSpeed Universal Checkpointing: Blog and Tutorial #5711

Merged
merged 20 commits into from
Jul 1, 2024
Merged

Conversation

samadejacobs
Copy link
Contributor

@samadejacobs samadejacobs commented Jul 1, 2024

Train {GPT,LLaMA, Phi}-like models (or any model) at ultra low-cost with DeepSpeed Universal Checkpointing (UCP). UCP abstracts away the complexities of saving and loading model states. See arxiv paper, blog and tutorial in this PR for details.

@tohtana tohtana merged commit 121efdb into master Jul 1, 2024
10 of 12 checks passed
mauryaavinash95 pushed a commit to DataStates/DeepSpeed that referenced this pull request Jul 10, 2024
Train {GPT,LLaMA, Phi}-like models (or any model) at ultra low-cost with
DeepSpeed Universal Checkpointing (UCP). UCP abstracts away the
complexities of saving and loading model states. See arxiv paper, blog
and tutorial in this PR for details.

---------

Co-authored-by: Masahiro Tanaka <mtanaka@microsoft.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Logan Adams <loadams@microsoft.com>
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants