-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Open
Description
This is a living document! For each item here, we intend to link the RFC as well as discussion Slack channel in the DeepSpeed Slack.
New Accelerator Support
- DeepSpeed support on TPU
Emergent Model Architectures
- SuperOffloading for Mixture-of-Expert (MoE) Training
Reinforcement Learning
- DeepSpeed backend integration as the training engine for verl
New Optimizer Support
- Muon Optimizer Support for ZeRO3
Software quality
- CI
- V1 release
delock and eternalNight
Metadata
Metadata
Labels
No labels