https://arxiv.org/abs/2201.05596

DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale (Samyam Rajbhandari, Conglong Li, Zhewei Yao, Minjia Zhang, Reza Yazdani Aminabadi, Ammar Ahmad Awan, Jeff Rasley, Yuxiong He)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

220114 DeepSpeed-MoE.md

220114 DeepSpeed-MoE.md

Files

220114 DeepSpeed-MoE.md

Latest commit

History

220114 DeepSpeed-MoE.md

File metadata and controls