You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't understand how I should set the configs in order to fine-tune 32B or 70B model when I want to do full SFT on 1 8xH100 node. I need to use a specific deepspeed z3 config?
Currently, I run CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 llamafactory-cli train config.yaml where config.yaml looks like:
Reminder
System Info
llamafactory
version: 0.9.2.dev0Reproduction
I don't understand how I should set the configs in order to fine-tune 32B or 70B model when I want to do full SFT on 1 8xH100 node. I need to use a specific deepspeed z3 config?
Currently, I run
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 llamafactory-cli train config.yaml
where config.yaml looks like:and I get OOM.
Others
No response
The text was updated successfully, but these errors were encountered: