Does scheduler mandatory for control job number? #739
-
I am trying to running dpgen in my local desktop machine. (one 12-core cpu, 2 gpu) As far I figured out, number of task in each job is controlled by "group_size" in machine json file. I try large "group_size" after read "Note that group_size should be set as large as possible to ensure there is only one job and avoid running multiple jobs at the same time." In doc/examples/shell.md , but it didn't help. I found some local shell machine examples in this github, (discussions #666 , machine-local-4GPU.json , machine-local.json ) , but all use old dispatcher style. If controlling number of running processes is impossible without scheduler, I considering to learning how to use slurm. This might be most common, easy option, right? Thanks. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
I think that setting |
Beta Was this translation helpful? Give feedback.
-
Added support to infinite |
Beta Was this translation helpful? Give feedback.
I think that setting
group_size
to be a large number can solve your problem. It's probable that there's wrong with yourmachine.json
. Could you post yourmachine.json
?