Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

create Docker containers and remove envmodules #102

Merged
merged 112 commits into from
Sep 12, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
112 commits
Select commit Hold shift + click to select a range
e91d2e6
chore: auto-add issues to user kanbans
kelly-sovacool Mar 7, 2024
8d73967
feat: create initial charlie-specific docker recipes
kelly-sovacool Mar 28, 2024
bd349c4
chore: Merge branch 'master' into iss-99
kelly-sovacool Mar 28, 2024
80ea1a3
refactor: switch modules to containers
kelly-sovacool Mar 29, 2024
90ac9c9
fix: don't use conda for python 2.7
kelly-sovacool Apr 2, 2024
057dd94
fix: wget to specific filename
kelly-sovacool Apr 3, 2024
096d1ba
fix: install python 3.6 for cufflinks
kelly-sovacool Apr 3, 2024
112cb83
fix: typo
kelly-sovacool Apr 3, 2024
0c0d01c
ci: name run based on docker image path
kelly-sovacool Apr 3, 2024
94ca47c
feat: add bowtie1 container
kelly-sovacool Apr 3, 2024
82bd8b4
refactor: remove tools resource file, set containers in configfile
kelly-sovacool Apr 3, 2024
214653a
ci: debug docker/dcc/Dockerfile
kelly-sovacool Apr 3, 2024
2dc5a7c
chore: Merge branch 'master' into iss-99
kelly-sovacool Apr 3, 2024
71a3c41
fix: untar dir name
kelly-sovacool Apr 3, 2024
fd8f568
fix: install setuptools
kelly-sovacool Apr 3, 2024
077042d
fix: install setuptools
kelly-sovacool Apr 3, 2024
84de62a
fix: get pip for python2
kelly-sovacool Apr 3, 2024
976256b
fix: use apt-get to install pip for py2
kelly-sovacool Apr 3, 2024
c578e55
fix: DCC v0.5.0 actually requires python 3
kelly-sovacool Apr 4, 2024
b9e153d
feat: install nclscan from github
kelly-sovacool Apr 4, 2024
49dd5fd
fix: remove exact python for star docker
kelly-sovacool Apr 4, 2024
ecca3c2
refactor: rename nclscan docker
kelly-sovacool Apr 4, 2024
aeb5ca3
feat: try activating non-base conda env
kelly-sovacool Apr 4, 2024
ff9e977
refactor: remove py27 from DCC docker name
kelly-sovacool Apr 4, 2024
6b49770
fix: roll novoalign back to v3
kelly-sovacool Apr 4, 2024
4c2111d
feat: add conda activate to container's bashrc
kelly-sovacool Apr 4, 2024
72baf3c
fix: add compbiocore conda channel for novoalign v3
kelly-sovacool Apr 4, 2024
133180b
fix: specify exact compbiocore::novoalign version
kelly-sovacool Apr 4, 2024
1c34da4
refactor: update NCLscan config for docker
kelly-sovacool Apr 4, 2024
2f386d9
refactor: use nclscan in path
kelly-sovacool Apr 4, 2024
4539a9b
feat: use singularity with snakemake
kelly-sovacool Apr 10, 2024
14a59cd
docs: add KLS to authors
kelly-sovacool Apr 10, 2024
3989c3c
fix: remove "module load" statements
kelly-sovacool Apr 11, 2024
efcfec2
fix: add containers for local rules
kelly-sovacool Apr 11, 2024
dbf0dc5
fix: remove tools var
kelly-sovacool Apr 12, 2024
9fcb3ee
fix: use container for cutadapt
kelly-sovacool Apr 12, 2024
5664d0a
fix: use fastqc & multiqc containers
kelly-sovacool Apr 12, 2024
dbd895b
chore: delete commented-out code
kelly-sovacool Apr 12, 2024
f44ea80
fix: url for multiqc docker
kelly-sovacool Apr 15, 2024
f32dd4d
fix: docker URLs for cutadapt & fastqc
kelly-sovacool Apr 15, 2024
b4fca9e
fix: do not use conda inside rule code
kelly-sovacool Apr 15, 2024
96013b0
style: indent for readability
kelly-sovacool Apr 15, 2024
4d94781
feat: install fastq-filter in cutadapt container
kelly-sovacool Apr 18, 2024
587cff2
ci: don't run docker auto on PRs
kelly-sovacool Apr 18, 2024
ca297b3
fix: need pandas & argparse for create_index
kelly-sovacool Apr 18, 2024
a37351a
ci: move nclscan to star docker
kelly-sovacool Apr 26, 2024
d34bfeb
refactor: switch nclscan container to star_ucsc_cufflinks everywhere
kelly-sovacool Apr 26, 2024
d623c9f
fix: keep path intact
kelly-sovacool Apr 26, 2024
43cd16a
fix: external tools are in PATH, don't use absolute paths
kelly-sovacool Apr 29, 2024
b46f345
feat: only load modules on biowulf & frce
kelly-sovacool Apr 29, 2024
fd80a94
feat: create docker for ciri2
kelly-sovacool Apr 29, 2024
9890132
refactor: deprecate nclscan_dir
kelly-sovacool Apr 29, 2024
c8c9a1e
feat: create circRNA_finder docker
kelly-sovacool Apr 29, 2024
effb089
fix: docker container name must be lowercase
kelly-sovacool Apr 29, 2024
1b2fb9a
feat: add find_circ to star container
kelly-sovacool Apr 29, 2024
496c74a
fix: install process for circRNA_finder
kelly-sovacool Apr 29, 2024
f4abd10
fix: find_circ version string
kelly-sovacool Apr 29, 2024
d0ead61
fix: find_circ doesn't have setup.py script
kelly-sovacool Apr 29, 2024
1f3570e
feat: install HTSeq
kelly-sovacool Apr 30, 2024
b3ebb45
fix: ucsc is now part of star_ucsc_cufflinks container
kelly-sovacool Apr 30, 2024
45d981a
fix: picard has a different CLI when installed via conda
kelly-sovacool Apr 30, 2024
fbdb664
fix: just specify major version for snakemake
kelly-sovacool Apr 30, 2024
09f8bae
ci: attempt to install DCC 0.5.0 & python 3.5 via docker
kelly-sovacool May 2, 2024
afb84d4
ci: don't specify python version for dcc conda env
kelly-sovacool May 3, 2024
da7d573
fix: install DCC from dev version
kelly-sovacool May 3, 2024
3670d65
fix: syntax for picard installed via conda
kelly-sovacool May 6, 2024
2863f7a
fix: sacct not available in container
kelly-sovacool May 6, 2024
d7949d0
fix: use output directory in STAR prefix
kelly-sovacool May 6, 2024
29cc85c
fix: remove hard-coded file path
kelly-sovacool May 6, 2024
1d06750
fix: remove unused imports
kelly-sovacool May 7, 2024
65be7ba
fix: switch to container with ucsc bedSort
kelly-sovacool May 7, 2024
203879c
fix: bash variable access -- must escape curly braces in snakemake
kelly-sovacool May 9, 2024
ded2eb7
fix: typo in snakemake variable inside bash script
kelly-sovacool May 9, 2024
3605cbb
feat: create new container for clear using conda
kelly-sovacool May 10, 2024
3ef58b7
chore: Merge branch 'iss-99' of https://github.com/CCBR/charlie into …
kelly-sovacool May 10, 2024
4f593d1
fix: remove stray add statement
kelly-sovacool May 10, 2024
8c47ee3
fix: loosen python version restriction
kelly-sovacool May 10, 2024
7db26c3
fix: try using python 2.7 with no other versions specified
kelly-sovacool May 10, 2024
d23568e
feat: pip install clear after conda
kelly-sovacool May 10, 2024
bf35156
feat: use newer docker for CLEAR
kelly-sovacool May 13, 2024
3e14732
chore: Merge branch 'iss-99' of https://github.com/CCBR/charlie into …
kelly-sovacool May 13, 2024
e89af63
docs: draft docs for using charlie on different platforms
kelly-sovacool May 15, 2024
27bf691
feat: create smk_base() python function
kelly-sovacool May 16, 2024
d766df0
ci: setup testing infra
kelly-sovacool May 16, 2024
600aaa3
feat: run spooker on workflow completion
kelly-sovacool May 16, 2024
c745698
feat: add version function
kelly-sovacool May 16, 2024
7d51e06
chore: don't track pyc files
kelly-sovacool May 16, 2024
1098231
feat: create different config template files for different platforms
kelly-sovacool May 16, 2024
86a207c
fix: make sure spooker is in the path on frce
kelly-sovacool May 16, 2024
7780ab0
fix: frce/fnlcr config template subdir name
kelly-sovacool May 16, 2024
9933fb4
fix: ccr partition doesn't exist on FRCE
kelly-sovacool May 16, 2024
46a32e9
fix: PLATFORM global var is all caps
kelly-sovacool May 16, 2024
80f1718
feat: raise specific errors rather than generic sys.exit
kelly-sovacool May 16, 2024
14995e2
fix: charlie dir is all-caps
kelly-sovacool May 16, 2024
0cf3ae1
fix: activate conda env on frce
kelly-sovacool May 17, 2024
4651323
fix: remove python load statements
kelly-sovacool May 17, 2024
785f38b
fix: load different modules and conda envs depending on platform
kelly-sovacool May 17, 2024
7e3b51b
fix: remove newlines from bash string command
kelly-sovacool May 17, 2024
e1d4e7a
fix: set python & snakemake versions before module statements
kelly-sovacool May 17, 2024
7403fc3
fix: use snakemake installation in shared conda env on FRCE
kelly-sovacool May 20, 2024
c3ea715
docs: create CHANGELOG from old release notes
kopardev Jun 3, 2024
5cef3a7
docs: add changelog entry for dockerization
kelly-sovacool Jun 3, 2024
9e1608c
feat: set tempdir depending on HPC
kelly-sovacool Jun 3, 2024
035d453
fix: indent typo
kelly-sovacool Jun 3, 2024
df1ac0f
fix: indent typo
kelly-sovacool Jun 3, 2024
f3bdf1a
chore: Merge branch 'iss-99' of https://github.com/CCBR/charlie into …
kelly-sovacool Jun 3, 2024
9e2a442
fix: simplify singularity binds function
kelly-sovacool Jun 4, 2024
f51e7a9
fix: put back code for getting bind paths from config
kelly-sovacool Jun 5, 2024
b5cea0a
fix: set container for merge_per_sample rule
kelly-sovacool Jun 10, 2024
0b417c9
chore: unneeded remove comment
kelly-sovacool Jun 10, 2024
8e2c529
feat: add --singcache and set default singularity cache dir
kelly-sovacool Sep 10, 2024
65b73ad
Merge branch 'main' into iss-99
kelly-sovacool Sep 12, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
39 changes: 39 additions & 0 deletions .github/workflows/build.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
name: build

on:
push:
branches:
- master
- main
- develop
pull_request:

jobs:
build:
runs-on: ubuntu-latest
strategy:
matrix:
python-version: ["3.11"]
snakemake-version: ["7.32.3"]
steps:
- uses: actions/checkout@v4
- uses: mamba-org/setup-micromamba@v1
with:
environment-name: build
cache-environment: true
condarc: |
channels:
- conda-forge
- bioconda
create-args: >-
python=${{ matrix.python-version }}
snakemake=${{ matrix.snakemake-version }}
setuptools
pip
pytest
- name: Test
run: |
python -m pytest
env:
TMPDIR: ${{ runner.temp }}
shell: micromamba-shell {0}
5 changes: 0 additions & 5 deletions .github/workflows/docker-auto.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,11 +6,6 @@ on:
- main
paths:
- "docker/**"
pull_request:
branches:
- main
paths:
- "docker/**"

jobs:
generate-matrix:
Expand Down
14 changes: 14 additions & 0 deletions .github/workflows/projects.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
name: Add issues/PRs to user projects

on:
issues:
types:
- assigned
pull_request:
types:
- assigned

jobs:
add-to-project:
uses: CCBR/.github/.github/workflows/auto-add-user-project.yml@v0.1.0
secrets: inherit
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -22,3 +22,5 @@ resources/mouse_mm9_circRNAs_putative_spliced_sequence.fa.gz
workflow/scripts/copy_strand_info.py
.tests/lint_workdir
**/tmp*
**__pycache__/**
*.pyc
75 changes: 75 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
# CHARLIE development version

- Major updates to convert CHARLIE from a biowulf-specific to a platform-agnostic pipeline (#102, @kelly-sovacool):
- All rules now use containers instead of envmodules.
- Default config and cluster config files are provided for use on biowulf and FRCE.
- New entry `TEMPDIR` in the config file sets the temporary directory location for rules that require transient storage.
- New `--singcache` argument to provide a singularity cache dir location. The singularity cache dir is automatically set inside `/data/$USER/` or `$WORKDIR/` if `--singcache` is not provided.

# CHARLIE 0.10.1

- strand are reported together, strand from all callers are reported,
- both + and - flanking sites are reported,
- rev-comp function updated,
- updated versions of tools to match available tools on BIOWULF.

# CHARLIE 0.9.0

Significant upgrades since the last release:

- updates to wrapper script, many new arguments/options added
- new per-sample counts table format
- new all-sample master counts matrix with min-nreads filtering and ntools column to show number of tools supporting the circRNA call
- new version of Snakemake
- cluster_status script added for forced completion of pipeline upon TIMEOUTs
- updated flowchart from lucid charts
- added circRNAfinder, find_circ, circExplorer2_bwa and other tools
- optimized execution and resource requirements
- updated viral annotations (Thanks Sara!)
- new method to extract linear counts, create linear BAMs using circExplorer2 outputs
- new job reporting using jobby and its derivatives
- separated creation of BWA and BOWTIE2 index from creation of STAR index to speed things up
- parallelized find_circ
- better cleanup (eg. deleting \_STARgenome folders, etc.) for much smaller digital footprint
- multitude of comments throughout the snakefiles including listing of output file column descriptions
- preliminary GH actions added

# CHARLIE 0.7.0

- 5 circRNA callers
- all-sample counts matrix with annotations

# CHARLIE 0.6.9

- Optimized pysam scripts
- fixed premature completion of singularity rules

# CHARLIE 0.6.5

- updated config.yaml to use the latest HSV-1 annotations received from Sarah (050421)

# CHARLIE 0.6.4

- create linear reads BAM file
- create linear reads BigWigs for each region in the .regions file.

# CHARLIE 0.6.3

- QOS not working for Taka... removed from cluster.json
- recall rule requires python/3.7 ... env module updated

# CHARLIE 0.6.2

- BSJ files are in BSJ subfolder... bug fix for v0.6.1

# CHARLIE 0.6.1

- customBSJs recalled from STAR alignments
- only for PE
- removes erroneously called CircExplorer BSJs
- create sense and anti-sense BSJ BAMs and BW for each reference (host+viruses)
- find reads which contribute to CIRI BSJs but not on the STAR list of BSJ reads, see if they contribute to novel (not called by STAR) BSJs and append novel BSJs to customBSJ list

# CHARLIE 0.6.0

cutadapt_min_length to cutadapt rule... setting it to 15 in config (for miRNAs, Biot and short viral features)
5 changes: 5 additions & 0 deletions CITATION.cff
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,11 @@ authors:
orcid: https://orcid.org/0000-0001-8978-8495
affiliation: Advanced Biomedical Computational Science, Frederick National Laboratory
for Cancer Research, Frederick, MD 21702, USA
- family-names: Sovacool
given-names: Kelly
orcid: https://orcid.org/0000-0003-3283-829X
affiliation: Advanced Biomedical Computational Science, Frederick National Laboratory
for Cancer Research, Frederick, MD 21702, USA
title: 'CHARLIE: Circrnas in Host And viRuses anaLysis pIpEline'
url: https://ccbr.github.io/CHARLIE/
repository-code: https://github.com/CCBR/CHARLIE
Expand Down
108 changes: 55 additions & 53 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,49 +1,50 @@
# CHARLIE
![img](https://img.shields.io/github/issues/CCBR/CHARLIE?style=for-the-badge)![img](https://img.shields.io/github/forks/CCBR/CHARLIE?style=for-the-badge)![img](https://img.shields.io/github/stars/CCBR/CHARLIE?style=for-the-badge)![img](https://img.shields.io/github/license/CCBR/CHARLIE?style=for-the-badge)

![img](https://img.shields.io/github/issues/CCBR/CHARLIE?style=for-the-badge)![img](https://img.shields.io/github/forks/CCBR/CHARLIE?style=for-the-badge)![img](https://img.shields.io/github/stars/CCBR/CHARLIE?style=for-the-badge)![img](https://img.shields.io/github/license/CCBR/CHARLIE?style=for-the-badge)

### Table of Contents

- [CHARLIE - **C**ircrnas in **H**ost **A**nd vi**R**uses ana**L**ysis p**I**p**E**line](#charlie)
- [Table of Contents](#table-of-contents)
- [1. Introduction](#1-introduction)
- [2. Flowchart](#2-flowchart)
- [3. Software Dependencies](#3-software-dependencies)
- [4. Usage](#4-usage)
- [5. License](#5-license)
- [6. Testing](#6-testing)
- [6.1 Test data](#61-test-data)
- [6.2 Expected output](#62-expected-output)
- [Table of Contents](#table-of-contents)
- [1. Introduction](#1-introduction)
- [2. Flowchart](#2-flowchart)
- [3. Software Dependencies](#3-software-dependencies)
- [4. Usage](#4-usage)
- [5. License](#5-license)
- [6. Testing](#6-testing)
- [6.1 Test data](#61-test-data)
- [6.2 Expected output](#62-expected-output)

### 1. Introduction

**C**ircrnas in **H**ost **A**nd vi**R**uses ana**L**ysis p**I**p**E**line
**C**ircrnas in **H**ost **A**nd vi**R**uses ana**L**ysis p**I**p**E**line

Things to know about CHARLIE:

- Snakemake workflow to detect, annotate and quantify (DAQ) host and viral circular RNAs.
- Primirarily developed to run on [BIOWULF](https://hpc.nih.gov/)
- Reach out to [Vishal Koparde](mailto:vishal.koparde@nihgov) for questions/comments/requests.


This circularRNA detection pipeline uses CIRCExplorer2, CIRI2 and many other tools in parallel to detect, quantify and annotate circRNAs. Here is a list of tools that can be run using CHARLIE:

| circRNA Detection Tool | Aligner(s) | Run by default |
| ---------------------- | ---------- | -------------- |
| [CIRCExplorer2](https://github.com/YangLab/CIRCexplorer2) | STAR<sup>1</sup> | Yes |
| [CIRI2](https://sourceforge.net/projects/ciri/files/CIRI2/) | BWA<sup>1</sup> | Yes |
| [CIRCExplorer2](https://github.com/YangLab/CIRCexplorer2) | BWA<sup>1</sup> | Yes |
| [CLEAR](https://github.com/YangLab/CLEAR) | STAR<sup>1</sup> | Yes |
| [DCC](https://github.com/dieterich-lab/DCC) | STAR<sup>2</sup> | Yes |
| [circRNAFinder](https://github.com/bioxfu/circRNAFinder) | STAR<sup>3</sup> | Yes |
| [find_circ](https://github.com/marvin-jens/find_circ) | Bowtie2 | Yes |
| [MapSplice](https://github.com/merckey/MapSplice2) | BWA<sup>2</sup> | No |
| [NCLScan](https://github.com/TreesLab/NCLscan) | NovoAlign | No |
| circRNA Detection Tool | Aligner(s) | Run by default |
| ----------------------------------------------------------- | ---------------- | -------------- |
| [CIRCExplorer2](https://github.com/YangLab/CIRCexplorer2) | STAR<sup>1</sup> | Yes |
| [CIRI2](https://sourceforge.net/projects/ciri/files/CIRI2/) | BWA<sup>1</sup> | Yes |
| [CIRCExplorer2](https://github.com/YangLab/CIRCexplorer2) | BWA<sup>1</sup> | Yes |
| [CLEAR](https://github.com/YangLab/CLEAR) | STAR<sup>1</sup> | Yes |
| [DCC](https://github.com/dieterich-lab/DCC) | STAR<sup>2</sup> | Yes |
| [circRNAFinder](https://github.com/bioxfu/circRNAFinder) | STAR<sup>3</sup> | Yes |
| [find_circ](https://github.com/marvin-jens/find_circ) | Bowtie2 | Yes |
| [MapSplice](https://github.com/merckey/MapSplice2) | BWA<sup>2</sup> | No |
| [NCLScan](https://github.com/TreesLab/NCLscan) | NovoAlign | No |

> Note: STAR<sup>1</sup>, STAR<sup>2</sup>, STAR<sup>3</sup> denote 3 different sets of alignment parameters, etc.

> Note: BWA<sup>1</sup>, BWA<sup>2</sup> denote 2 different alignment parameters, etc.

### 2. Flowchart

![](docs/images/CHARLIE_v0.8.x.png)

For complete documentation with tutorial go [here](https://CCBR.github.io/CHARLIE/).
Expand All @@ -54,33 +55,32 @@ For complete documentation with tutorial go [here](https://CCBR.github.io/CHARLI

The following version of various bioinformatics tools are using within CHARLIE:

| tool | version |
| ------------- | --------- |
| blat | 3.5 |
| bedtools | 2.30.0 |
| bowtie | 2-2.5.1 |
| bowtie | 1.3.1 |
| bwa | 0.7.17 |
| circexplorer2 | 2.3.8 |
| cufflinks | 2.2.1 |
| cutadapt | 4.4 |
| fastqc | 0.11.9 |
| hisat | 2.2.2.1 |
| java | 18.0.1.1 |
| multiqc | 1.9 |
| parallel | 20231122 |
| perl | 5.34 |
| picard | 2.27.3 |
| python | 2.7 |
| python | 3.8 |
| sambamba | 0.8.2 |
| samtools | 1.16.1 |
| STAR | 2.7.6a |
| stringtie | 2.2.1 |
| ucsc | 450 |
| R | 4.0.5 |
| novocraft | 4.03.05 |

| tool | version |
| ------------- | -------- |
| blat | 3.5 |
| bedtools | 2.30.0 |
| bowtie | 2-2.5.1 |
| bowtie | 1.3.1 |
| bwa | 0.7.17 |
| circexplorer2 | 2.3.8 |
| cufflinks | 2.2.1 |
| cutadapt | 4.4 |
| fastqc | 0.11.9 |
| hisat | 2.2.2.1 |
| java | 18.0.1.1 |
| multiqc | 1.9 |
| parallel | 20231122 |
| perl | 5.34 |
| picard | 2.27.3 |
| python | 2.7 |
| python | 3.8 |
| sambamba | 0.8.2 |
| samtools | 1.16.1 |
| STAR | 2.7.6a |
| stringtie | 2.2.1 |
| ucsc | 450 |
| R | 4.0.5 |
| novocraft | 4.03.05 |

### 4. Usage

Expand Down Expand Up @@ -155,6 +155,7 @@ Required Arguments:

Optional Arguments:

--singcache|-c : singularity cache directory. Default is `/data/${USER}/.singularity` if available, or falls back to `${WORKDIR}/.singularity`. Use this flag to specify a different singularity cache directory.
--host|-g : supply host at command line. hg38 or mm39. (--runmode=init only)
--additives|-a : supply comma-separated list of additives at command line. ERCC or BAC16Insert or both (--runmode=init only)
--viruses|-v : supply comma-separated list of viruses at command line (--runmode=init only)
Expand Down Expand Up @@ -219,8 +220,8 @@ This will create the folder provided by `-w=`. The user should have write permis

Test data (1 paired-end subsample and 1 single-end subsample) have been including under the `.tests/dummy_fastqs` folder. After running in `-m=init`, `samples.tsv` should be edited to point the copies of the above mentioned samples with the column headers:

- sampleName
- path_to_R1_fastq
- sampleName
- path_to_R1_fastq
- path_to_R2_fastq

Column `path_to_R2_fastq` will be blank in case of single-end samples.
Expand All @@ -234,6 +235,7 @@ bash <path to charlie> -w=<path to output dir> -m=dryrun
This will create the reference fasta and gtf file based on the selections made in the `config.yaml`.

#### Run

If `-m=dryrun` was sucessful, then simply do `-m=run`. The output will look something like this

```
Expand Down Expand Up @@ -307,5 +309,5 @@ Expected output from the sample data is stored under `.tests/expected_output`.
More details about running test data can be found [here](https://ccbr.github.io/CHARLIE/tutorial).

> DISCLAIMER:
>
>
> CHARLIE is built to be run only on [BIOWULF](https://hpc.nih.gov). A newer HPC-agnostic version of CHARLIE is planned for 2024.
1 change: 1 addition & 0 deletions VERSION
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
0.10.1-dev
Loading