Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Default gradient_clipping to 1.0
#8068 opened Jun 15, 2026 by sfc-gh-truwase Collaborator Loading…
2 tasks
Add configurable engine log level
#8067 opened Jun 15, 2026 by sfc-gh-truwase Collaborator Loading…
2 tasks
Mixed-precision: per-policy param/buffer dtype cast (preserve fp32 buffers)
#8066 opened Jun 15, 2026 by sfc-gh-truwase Collaborator Loading…
4 tasks
Add AutoEP + AutoTP parallel folding
#8064 opened Jun 13, 2026 by tohtana Collaborator Loading…
Support AutoEP with ZeRO-3 zero.Init source modules
#8060 opened Jun 11, 2026 by tohtana Collaborator Loading…
[DeepCompile] fix gather params in dynamo skipped frames for ZeRO3
#8059 opened Jun 11, 2026 by XAheli Loading…
7 tasks done
feat(zenflow): run the overlapped CPU optimizer in a native process
#8058 opened Jun 10, 2026 by Antlera Collaborator Loading…
Fix eigenvalue parsing for compression-only quantize configs
#8057 opened Jun 10, 2026 by sowndappan5 Contributor Loading…
Fix incorrect variable name
#8051 opened Jun 7, 2026 by Muneerali199 Loading…
fix: log eigenvalue monitor values
#8049 opened Jun 5, 2026 by he-yufeng Loading…
fix: log block eigenvalue summary events
#8048 opened Jun 4, 2026 by he-yufeng Loading…
Fix minor comment/docstring typos in runtime and inference modules
#8046 opened Jun 3, 2026 by nathon-lee Contributor Loading…
zero3: defer param release during retain_graph backward #7352
#8045 opened Jun 3, 2026 by nathon-lee Contributor Loading…
Enable bf16 check_grad_overflow by default (matching fp16)
#8035 opened May 29, 2026 by yongzhe-wang Loading…
2 tasks done
Stop obsolete CI jobs on workflow cancellation
#8034 opened May 28, 2026 by tohtana Collaborator Loading…
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027 opened May 26, 2026 by PKUWZP Collaborator Loading…
3 of 5 tasks
Add Qwen 3.5 preset to AutoTP
#7978 opened Apr 16, 2026 by tohtana Collaborator Draft
Fix/warnings stacklevel mvapich runner
#7949 opened Apr 2, 2026 by nathon-lee Contributor Draft
Refactor/torch autocast encapsulate global state
#7946 opened Apr 2, 2026 by nathon-lee Contributor Loading…
ProTip! What’s not been updated in a month: updated:<2026-05-16.