-
Notifications
You must be signed in to change notification settings - Fork 4
Problems with all tasks execution #2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Which model are you testing? Is it running locally (e.g. via vLLM) or served via a cloud-based API? |
Qwen-0.5B-Instruct and llama3.1-8B-Instruct, running locally via vLLM, and MiniMaxAI/MiniMax-Text-01, running via OpenRouter (for MiniMaxAI/MiniMax-Text-01 I specified vLLM and changed the tokenizer to the repository from hugging face, so that it loaded without errors) |
Is it possible to fix this bug? |
Have you tried lowering down the timeout? (e.g. using 120 instead of 700 seconds; we opted for the larger number in longer contexts)
It is possible that the hang is just some API request failing and then ends up with retries with long timeouts. |
So your solution is to change timeout from 700 to 120, right? |
Yes, exactly. |
When running the bash run_tests.sh command from the evaluation folder, the test starts on context 250, and then hangs instead of moving to contexts 500, 1000, 2000,...
Here is the Traceback when the keyboard is interrupted:
The text was updated successfully, but these errors were encountered: