DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters
-
Updated
Dec 20, 2024 - Python