fix: update sequenza #601

DivyaratanPopli · 2025-01-30T13:00:12Z

Modified the environment, wrapper and Snakefile for cnv calling step to be able to install and run a new version of sequenza (@07116cc).

…on, and removed github

…ithub and bitbucket

…116cc

github-actions · 2025-01-30T13:02:17Z

Please format your Python code with ruff: make fmt
Please check your Python code with ruff: make check
Please format your Snakemake code with snakefmt: make snakefmt

You can trigger all lints locally by running make lint

coveralls · 2025-01-30T14:21:17Z

coverage: 85.808%. remained the same
when pulling de8b4df on 599-sequenza-version-is-outdated
into f49fafb on main.

ericblanc20

Very good, what is now left:

Moving the packages description to the wrapper side of snappy, and
Creating a model for sequenza.extract & sequenza.fit.

ericblanc20 · 2025-01-30T15:22:46Z

snappy_pipeline/workflows/somatic_targeted_seq_cnv_calling/Snakefile

As discussed, the definition of the package(s) should be put near the wrappers, not in the Snakefile.

I suggest to create a R_environment.json file in the snappy_wrappers/wrappers/sequenza/install folder, such as:

[ { "name": "Runuran", "repository": "cran" }, { "name": "sequenza", "repository": "bitbucket", "url": "sequenzatools/sequenza@07116cc" } ]

ericblanc20 · 2025-01-30T15:54:31Z

snappy_wrappers/utils.py

I would re-write the function, making use of the R_environment.json information. As the package name is provided by the user, there is much simplification to be gained, for example:

def install_R_package( dest: str, name: str, repository: str = "cran", url: str | None = None ) -> subprocess.CompletedProcess: assert dest, "Missing R package destination folder" os.makedirs(os.path.dirname(dest), mode=0o750, exist_ok=True) match repository: case "cran": if not url: url = "https://cloud.r-project.org" install_cmd = f"install.packages('{name}', lib='{dest}', repos='{url}', update=FALSE, ask=FALSE)" case "bioconductor": install_cmd = f"BiocManager::install('{name}', lib='{dest}', update=FALSE, ask=FALSE)" case "github": assert url, f"Can't install R package '{name}' from github, URL is missing" install_cmd = f"remotes::install_github('{url}', lib='{dest}', upgrade='never')" case "bitbucket": assert url, f"Can't install R package '{name}' from bitbucket, URL is missing" install_cmd = f"remotes::install_bitbucket('{url}', lib='{dest}', upgrade='never')" case "local": assert url, f"Can't install local R package '{name}', missing path" assert os.path.exists(url), f"Can't find local R package '{name}' at location '{url}'" install_cmd = f"install.packages('{url}', repos=NULL, lib='{dest}', update=FALSE, ask=FALSE)" case _: raise ValueError("Unknown repository '{repository}'" R_script = [ f".libPaths(c(.libPaths(), '{dest}'))", install_cmd, f"status <- try(find.package('{name}', lib.loc='{dest}', quiet=FALSE, verbose=TRUE))", "status <- ifelse(is(status, 'try-error'), 1, 0)", "quit(save='no', status=status, runLast=FALSE)", ] cmd = ["R", "--vanilla", "-e", "; ".join(R_script)] return subprocess.run(cmd, text=True, check=True)

(Please check my code, I haven't tested it...)

Also, I would perhaps add another function, for example install_R_packages, which would read the json file & loop over all packages for installation. For example:

import json def install_R_packages(dest: str, filename: str): with open(filename, "rt") as f: packages = json.load(filename) for package in packages: status = install_R_package(dest, name=package["name"], repository=package["repository"], url=package.get("url", None)) status.check_returncode()

Then, the wrapper snappy_wrappers/wrappers/sequenza/install/wrapper.py might just read:

# -*- coding: utf-8 -*- """Installation of sequenza non-standard packages""" import os import sys # The following is required for being able to import snappy_wrappers modules # inside wrappers. These run in an "inner" snakemake process which uses its # own conda environment which cannot see the snappy_pipeline installation. base_dir = os.path.abspath(os.path.dirname(__file__)) while os.path.basename(base_dir) != "snappy_wrappers": base_dir = os.path.dirname(base_dir) sys.path.insert(0, os.path.dirname(base_dir)) from snappy_wrappers.utils import install_R_packages __author__ = "Eric Blanc <eric.blanc@bih-charite.de>" dest = os.path.dirname(str(snakemake.output.done)) install_R_packages(dest, os.path.join(os.path.dirname(__file__), "R_environment.json"))

These are just suggestions. I believe that they could facilitate maintaining the R packages, but perhaps there are unforeseen problems with them, or you have a simpler solution (for example, I don't like json file, I would prefer yaml, but the latter would require adding to the environment.yaml some yaml python library, which I think will only make the environment more difficult to maintain).

ericblanc20 · 2025-01-30T16:02:49Z

snappy_wrappers/wrappers/sequenza/run/wrapper.py

Please check the arguments of R function sequenza.extract & sequenza.fit, & build a model from them.

The man pages for the functions can be viewed from R using:

library(sequenza) ?sequenza.extract ?sequenza.fit

But I have a suspicion that the man pages are not yet updated for the new code.
Typing: sequenza.extract will list the function's complete code. You can check the argument list, and verify if some of them should not be included in the list (verbosity, input file(s) selected by the pipeline, plots size & format, ...)

tedil · 2025-01-31T10:55:30Z

@DivyaratanPopli we have a pre-commit config which runs formatting (on python and snakemake files) on each commit automatically, see https://github.com/bihealth/snappy-pipeline/blob/main/docs/installation.rst#installing-pre-commit-hooks which can be very convenient ;)

tedil · 2025-02-05T09:44:02Z

Another possibility, though I am not sure how well this works with wrappers:
Snakemake provides so called post-deployment scripts, which allow you to install additional things into the freshly created conda environment.

Divya Ratan Popli and others added 4 commits January 24, 2025 16:35

fix: added the latest version of sequenza on bitbucket for installati…

929a357

…on, and removed github

fix: Modified the regular experession for installing R package from g…

aa89cff

…ithub and bitbucket

fix: Added required R packages for sequenza@07116cc

531d8aa

fix: Commented out inputing of parameters not required by sequenza@07…

632c9f1

…116cc

DivyaratanPopli linked an issue Jan 30, 2025 that may be closed by this pull request

Sequenza version is outdated #599

Open

ericblanc20 changed the title ~~599 sequenza version is outdated~~ fix: sequenza version is outdated Jan 30, 2025

test: fix R installation test (including regex)

de8b4df

ericblanc20 self-requested a review January 30, 2025 15:04

ericblanc20 reviewed Jan 30, 2025

View reviewed changes

tedil changed the title ~~fix: sequenza version is outdated~~ fix: update sequenza Jan 31, 2025

tedil assigned DivyaratanPopli Feb 5, 2025

Divya Ratan Popli added 2 commits February 11, 2025 14:57

fix: Changed sequenza installation from Snakefile to the wrapper

1706863

test: modified tests for installation of R packages

816eb35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: update sequenza #601

fix: update sequenza #601

DivyaratanPopli commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

coveralls commented Jan 30, 2025

ericblanc20 left a comment

ericblanc20 Jan 30, 2025

ericblanc20 Jan 30, 2025 •

edited

Loading

ericblanc20 Jan 30, 2025 •

edited

Loading

ericblanc20 Jan 30, 2025

tedil commented Jan 31, 2025

tedil commented Feb 5, 2025

fix: update sequenza #601

Are you sure you want to change the base?

fix: update sequenza #601

Conversation

DivyaratanPopli commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

coveralls commented Jan 30, 2025

ericblanc20 left a comment

Choose a reason for hiding this comment

ericblanc20 Jan 30, 2025

Choose a reason for hiding this comment

ericblanc20 Jan 30, 2025 • edited Loading

Choose a reason for hiding this comment

ericblanc20 Jan 30, 2025 • edited Loading

Choose a reason for hiding this comment

ericblanc20 Jan 30, 2025

Choose a reason for hiding this comment

tedil commented Jan 31, 2025

tedil commented Feb 5, 2025

ericblanc20 Jan 30, 2025 •

edited

Loading

ericblanc20 Jan 30, 2025 •

edited

Loading