feat: finemapping template and DAG for UKB PPP #10

tskir · 2024-07-09T16:10:18Z

The idea is to have a common finemapping template, which specific DAGs can reuse and modify according to their needs.

tskir · 2024-09-18T14:26:47Z

@project-defiant This has been quite substantially rewritten compared to the first draft. Could you do another round of reviews, please?

project-defiant

I think it is a good setup for me to use it during the gwas_catalog etl step (also for other ones that require finemapping).

project-defiant · 2024-09-18T14:32:32Z

src/ot_orchestration/dags/ukb_ppp_finemapping.py

+    **common.shared_dag_kwargs,
+) as dag:
+    (
+        FinemappingBatchOperator.partial(


The partial will not work beyond the threshold, and I have tested it on local airflow DAG, this breaks on around ~5k partial tasks even with the threshold increase.

https://airflow.apache.org/docs/apache-airflow/stable/authoring-and-scheduling/dynamic-task-mapping.html#placing-limits-on-mapped-tasks

Indeed, but just to clarify, in this PR the partial/expand routine iterates not on individual loci (of which there are potentially 100,000s in the worst case), but on chunks of the manifest, of which there are <10 in either case

* feat: template for creating finemapping jobs * feat: example DAG for creating finemapping jobs * fix: quote parameters containing = for Hydra * chore: add GENTROPY_DOCKER_IMAGE to common layer * feat: always use a list of jobs in the DAG * refactor: use manifest as input * feat: implement generate_manifests_for_finemapping * refactor: rewrite the DAG to use new functions * fix: import errors in DAG * fix: multiple fixes following test runs

tskir requested review from d0choa, ireneisdoomed and project-defiant July 9, 2024 16:10

project-defiant approved these changes Jul 11, 2024

View reviewed changes

tskir added 7 commits September 18, 2024 10:12

feat: template for creating finemapping jobs

9e9ca75

feat: example DAG for creating finemapping jobs

0a24cfc

fix: quote parameters containing = for Hydra

c64baa2

chore: add GENTROPY_DOCKER_IMAGE to common layer

6662014

feat: always use a list of jobs in the DAG

260ab6a

refactor: use manifest as input

4943cc9

feat: implement generate_manifests_for_finemapping

300425b

tskir force-pushed the tskir-finemapping branch from 99e0990 to 300425b Compare September 18, 2024 10:28

tskir added 3 commits September 18, 2024 11:39

refactor: rewrite the DAG to use new functions

c877191

fix: import errors in DAG

a7ff30a

fix: multiple fixes following test runs

080cc78

tskir marked this pull request as ready for review September 18, 2024 14:25

tskir requested a review from project-defiant September 18, 2024 14:25

project-defiant approved these changes Sep 18, 2024

View reviewed changes

tskir merged commit a838e93 into dev Sep 18, 2024
2 checks passed

tskir deleted the tskir-finemapping branch September 18, 2024 15:14

This was referenced Sep 19, 2024

Implement parallel finemapping computation opentargets/issues#3302

Closed

feat(airflow): prototype finemapping batch job opentargets/gentropy#581

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: finemapping template and DAG for UKB PPP #10

feat: finemapping template and DAG for UKB PPP #10

tskir commented Jul 9, 2024 •

edited

Loading

tskir commented Sep 18, 2024

project-defiant left a comment

project-defiant Sep 18, 2024

tskir Sep 18, 2024

feat: finemapping template and DAG for UKB PPP #10

feat: finemapping template and DAG for UKB PPP #10

Conversation

tskir commented Jul 9, 2024 • edited Loading

tskir commented Sep 18, 2024

project-defiant left a comment

Choose a reason for hiding this comment

project-defiant Sep 18, 2024

Choose a reason for hiding this comment

tskir Sep 18, 2024

Choose a reason for hiding this comment

tskir commented Jul 9, 2024 •

edited

Loading