forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 348
Issues: deepspeedai/Megatron-DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Does it support deployment and invocation under the Ascend system & CANN?
#465
opened Feb 21, 2025 by
RyanOvO
Train GPT2 with 2nodes, 2gpu, tp=2 IndexError at megatron/core/tensor_parallel/cross_entropy.py
#457
opened Jan 13, 2025 by
yuanpeng-zhu
[TRACKER] Customer support related PR tracker for Intel devices
#446
opened Sep 20, 2024 by
delock
7 of 12 tasks
A tutorial to help you finetune LLama-2-7b using this repository full of garbarge code with ZeRO2/3 enabled.
#430
opened Jul 25, 2024 by
LLMChild
How to resume training between GPTModel() checkpoint and GPTModelPipe() checkpoint?
#405
opened Jun 27, 2024 by
tiggerwu
Inquiry on Sequence Parallel Support for VocabParallelEmbedding
#389
opened May 18, 2024 by
qinxiangyujiayou
Previous Next
ProTip!
Adding no:label will show everything without a label.