-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Issues: deepspeedai/DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] Enabling hpZ causes an abnormally large loss.
bug
Something isn't working
training
#7164
opened Mar 21, 2025 by
alex-ht
[BUG] circular import on Something isn't working
inference
DeepSpeedTransformerInference
bug
#7159
opened Mar 20, 2025 by
jamesbraza
[BUG] AttributeError: module 'deepspeed' has no attribute 'init_inference'
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#7157
opened Mar 20, 2025 by
Gaop970222
[REQUEST] Support for Expert Optimizer State Partitioning with ZeRO Optimization in DeepSpeed MoE
enhancement
New feature or request
#7156
opened Mar 20, 2025 by
leeruibin
[BUG] Receiving CUDA error: invalid argument using pytorch 2.7 with deepspeed 0.16.4 with Cuda 12.8
bug
Something isn't working
training
#7150
opened Mar 19, 2025 by
rpgmaker
[REQUEST]Does the current version support distributed fine-tuning on mac devices (M2-M4)?
enhancement
New feature or request
#7148
opened Mar 18, 2025 by
hsoftxl
[REQUEST] Support for Nvidia 50 Series GPUs: Pytorch >=2.6 and CUDA 12.8 required
enhancement
New feature or request
#7144
opened Mar 17, 2025 by
elkay
[BUG]DeepSpeed MoE hangs with DDP inference
bug
Something isn't working
inference
#7141
opened Mar 16, 2025 by
JessePrince
[REQUEST]Does DeepSpeed support multi-node inference?
enhancement
New feature or request
#7137
opened Mar 14, 2025 by
zyyyyy5
[BUG]When I use deepspeed ZeRO3 to train the vision-language-action model ,it met error of loading weights
bug
Something isn't working
training
#7136
opened Mar 14, 2025 by
hahans
[REQUEST] Is there any plan to support deepseek v3's MOE structure
enhancement
New feature or request
#7129
opened Mar 11, 2025 by
glowwormX
[BUG] Batch inference DDP + zero stage 3 = inference code hangs
#7128
opened Mar 11, 2025 by
ShengYun-Peng
[BUG] deepspeed gets re-initialized 4x, causing CPU RAM to blow up and OOM using c10d/flyte
bug
Something isn't working
training
#7127
opened Mar 11, 2025 by
j93hahn
AttributeError: partially initialized module 'deepspeed' has no attribute 'init_inference'
bug
Something isn't working
inference
#7121
opened Mar 9, 2025 by
JocelynPanPan
[BUG] OOM when train 70B models using deepspeed 0.16.4
bug
Something isn't working
deepspeed-chat
Related to DeepSpeed-Chat
#7116
opened Mar 8, 2025 by
hijkzzz
Previous Next
ProTip!
Adding no:label will show everything without a label.