DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning
compression
deep-learning
gpu
inference
pytorch
zero
data-parallelism
model-parallelism
mixture-of-experts
pipeline-parallelism
billion-parameters
trillion-parameters
-
Updated
Nov 21, 2024 - Python