-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cancel doesn't always work - issue is slurmId #53
Comments
The array job numbering may be off, however it has been working for me just in last day. Not sure what might be different? Are you using the newer array grunter script as found in grunt/grunter_array.py? there was an older version that did things a bit differently. |
Yes the latest version. Only happens sometimes but I don't have a lot of data because I don't often cancel. I can start watching to see if the ID is correct and update the ticket at some point. |
slurmId definitely off by one. I have the latest grunt code. Is there anything in the grunter code that might be out of date and I should change? |
Here is some likely helpful information -- Here is what squeue shows |
I believe the failure to cancel is due to the wrong SlurmId and it always seems to be one less than it really is (e.g. the really ID is 268743 but grunt has it as 268742. This may only be a problem with array jobs. Anyone else see this occasional problem?
Workaround is of course to execute scancel on the server with the correct ID
The text was updated successfully, but these errors were encountered: