Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix minor comment/docstring typos in runtime and inference modules
#8046 opened Jun 3, 2026 by nathon-lee Contributor Loading…
zero3: defer param release during retain_graph backward #7352
#8045 opened Jun 3, 2026 by nathon-lee Contributor Loading…
Remove AutoSP assertion against Transformers version
#8044 opened Jun 2, 2026 by tohtana Collaborator Loading…
zero3: invalidate coordinator trace on hook re-registration
#8043 opened Jun 2, 2026 by roycho96 Contributor Loading…
Normalize ZeRO-3 DeepCompile grad dtype before reduction
#8038 opened May 30, 2026 by tohtana Collaborator Loading…
Fix DeepCompile ZeRO-1 grad target lifetime
#8036 opened May 29, 2026 by tohtana Collaborator Loading…
Enable bf16 check_grad_overflow by default (matching fp16)
#8035 opened May 29, 2026 by yongzhe-wang Loading…
2 tasks done
Stop obsolete CI jobs on workflow cancellation
#8034 opened May 28, 2026 by tohtana Collaborator Loading…
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027 opened May 26, 2026 by PKUWZP Collaborator Loading…
3 of 5 tasks
Add Qwen 3.5 preset to AutoTP
#7978 opened Apr 16, 2026 by tohtana Collaborator Draft
Fix/warnings stacklevel mvapich runner
#7949 opened Apr 2, 2026 by nathon-lee Contributor Draft
Refactor/torch autocast encapsulate global state
#7946 opened Apr 2, 2026 by nathon-lee Contributor Loading…
Add AutoEP
#7938 opened Mar 31, 2026 by tohtana Collaborator Loading…
Add torch_xla TPU support for ZeRO-1/2
#7917 opened Mar 21, 2026 by PKUWZP Collaborator Loading…
doc: Remove suggestion to build extensions in parallel
#7899 opened Mar 12, 2026 by Flamefire Contributor Loading…
ProTip! Exclude everything labeled bug with -label:bug.