-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
28 create simplest running azure batch kick off script #30
28 create simplest running azure batch kick off script #30
Conversation
Note that this skeleton calls some function which do not yet exist, so pre-commit will fail on ruff check
This tests that calls are made to `client.add_task()`. Still need to add tests that check more specifically what is called, but this is a good start
Still to do:
The hard part is figuring out exactly how to name the model configuration files, and the post production configuration files. I think that part will depend on the geographies that the models are running on, and the dates. Once that part is better defined, I think this will be easy to do. |
Hey @natemcintosh, what's the plan for this? Do you need anything from me? |
Hi George, I've mostly been working on setting up the gold tables for the NSSP pull. The list of tasks above is pretty much what still needs to be done. If you have any input on the best way to name the configuration files (they each need to have a unique name for when we upload to Azure blob storage), that would be helpful. The other major improvement that could be made is to improve the testing. Right now |
I'll think about names and start planning on an e2e example |
NamingFor naming, I think we need to consider the following:
Idea: Name files using both the date and a descriptive handle, e.g., Another thought, since the number of files will be large, perhaps each project should be within a folder named
|
@gvegayon I like the idea of having top level folder with a date and "run" name, e.g. Something we have been doing at the top level of the Rt pipeline is using two storage containers: However, if we think it would be more advantageous to store both input and output under the same run folder (which I think is what you are suggesting above?), it would certainly make it easier to see both the inputs and outputs for a model run in one place. I'm not 100% sure that
What do you think? |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #30 +/- ##
==========================================
- Coverage 92.24% 90.89% -1.35%
==========================================
Files 34 34
Lines 683 802 +119
==========================================
+ Hits 630 729 +99
- Misses 53 73 +20
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
Closing this due to lack of updates and near-term plans to separate pyrenew codebase from particular models and their fitting workflows. |
Initial take on this, based on discussion here.