Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

microsoft / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.2k
Star 36.2k

Code
Issues 995
Pull requests 110
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: microsoft/DeepSpeed

Labels 33 Milestones 0

Labels 33 Milestones 0

New pull request New

Clear current search query, filters, and sorts

110 Open 3,015 Closed

110 Open 3,015 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Remove symlinks

#4323 opened Sep 13, 2023 by mrwyattii

Loading…

2 of 3 tasks

3

Allow TiedLayerSpec to have multiple tied weights

#4216 opened Aug 24, 2023 by zphang

Loading…

1

Training ops kernels: Speeding up the Llama-based MoE architectures

#6734 opened Nov 8, 2024 by RezaYazdaniAminabadi • Draft

1

move CPU_Accelerator --> Xeon_Accelerator

#5126 opened Feb 13, 2024 by mrwyattii

Loading…

17

support autoTP with weight only quantization in DS inference path

#4750 opened Nov 29, 2023 by ftian1

Loading…

6

apply reduce_scatter_coalesced op

#5224 opened Mar 4, 2024 by inkcherry

Loading…

3

fix: RuntimeError for UCP large DP

#6918 opened Dec 29, 2024 by saforem2

Loading…

1

Add FALCON-40B Inference-Kernel Support

#3656 opened Jun 1, 2023 by RezaYazdaniAminabadi

Loading…

1 task done

27

SCR checkpoint engine

#2972 opened Mar 8, 2023 by adammoody

Loading…

2

Optimizer state loading fix for bitsandbytes 8-bit optimizers.

#1582 opened Nov 22, 2021 by TimDettmers

Loading…

Modify unit test to cover more cases

#1775 opened Feb 15, 2022 by RezaYazdaniAminabadi

Loading…

Configure fused fp16 mode

#1882 opened Apr 6, 2022 by tjruwase

Loading…

1

Unify benchmark scripts and knowledge

#2374 opened Sep 28, 2022 by mrwyattii • Draft

Create GPU library abstraction layer

#2437 opened Oct 20, 2022 by mrwyattii • Draft

Add 4-bit quantized inference to run BLOOM-176B on 2 A100 GPUs

#2526 opened Nov 18, 2022 by RezaYazdaniAminabadi

Loading…

3

Remove all unused quantize settings and flags.

#2555 opened Nov 28, 2022 by awan-10

Loading…

Migrate W8A16 Inference to Dequantization Utility

#2580 opened Dec 7, 2022 by cmikeh2

Loading…

[DEBUG] Add diagnostics for cpu-torch-latest intermittent hang

#6942 opened Jan 10, 2025 by loadams • Draft

Add correctness check for sharded checkpoint test

#2643 opened Dec 22, 2022 by mrwyattii

Loading…

2

Preserve prior op builder backend api

#2705 opened Jan 14, 2023 by jeffra

Loading…

5

Refactor/Pydantify compression config

#2748 opened Jan 25, 2023 by mrwyattii • Draft

Refactor/Pydantify profiling config

#2749 opened Jan 25, 2023 by mrwyattii • Draft

Refactor/Pydantify autotuning config

#2769 opened Jan 31, 2023 by mrwyattii • Draft

pre/post forward calls to engine + generate method

#2832 opened Feb 14, 2023 by jeffra

Loading…

2 of 3 tasks

Add validation for injection policy in inference config

#2630 opened Dec 20, 2022 by mrwyattii • Draft

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.