-
Notifications
You must be signed in to change notification settings - Fork 74
Issues: aws-samples/awsome-distributed-training
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Organize SM-modelparallelv2 per orchestrator
enhancement
New feature or request
#436
opened Sep 20, 2024 by
mhuguesaws
Pin NCCL and EFA version in FSDP
enhancement
New feature or request
#435
opened Sep 20, 2024 by
mhuguesaws
Warning for maximum sequence length when running FSDP Llama2 example
stale
#354
opened Jun 10, 2024 by
amanshanbhag
SageMaker Hyperpod "Target not connected"
Troubleshooting Tips
These are informational to make it easier to troubleshoot common issues.
#280
opened Apr 22, 2024 by
sean-smith
Add Ubuntu 22.04 support for ansible roles
enhancement
New feature or request
#82
opened Dec 19, 2023 by
mhuguesaws
ProTip!
Add no:assignee to see everything that’s not assigned.