Pipeline_Content

Pipeline content

Global workflow

This workflows takes fastq files, genome sequences and annotations as input, and returns abundance estimates along side with optional quality metrics.

If you use this pipeline, cite them all, please!

MultiQC

MultiQC, just like FastQC, do not have any other purpose than quality metrics. It gathers all Flagstat and all FastQC individual metrics into one single report.

Citation:

Ewels, Philip, et al. "MultiQC: summarize analysis results for multiple tools and samples in a single report." Bioinformatics 32.19 (2016): 3047-3048.

Salmon

Salmon is a tool for transcript quantification from RNA-seq data. It uses pseudo-mapping to compute quantification estimates on transcripts.

Citation:

Patro, Rob, et al. “Salmon provides fast and bias-aware quantification of transcript expression.” Nature Methods (2017).

tximport

tximport is a tool designed to import transcript quantifications from Salmon into genes quantification for DESeq2.

Citation:

Love, Michael I., Charlotte Soneson, and Mark D. Robinson. "Importing transcript abundance datasets with tximport." dim (txi. inf. rep $ infReps $ sample1) 1.178136 (2017): 5.
Soneson, Charlotte, Michael I. Love, and Mark D. Robinson. "Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences." F1000Research 4 (2015).

DESeq2

DESeq2 is a very famous tool amon the field of bioinformatics that performs differential gene expression.

Citation:

Love, Michael I., Wolfgang Huber, and Simon Anders. "Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2." Genome biology 15.12 (2014): 550.
Love, Michael, Simon Anders, and Wolfgang Huber. "Differential analysis of count data–the DESeq2 package." Genome Biol 15.550 (2014): 10-1186.

PCAExplorer

PCAExplorer is a program that aims to ease the analysis and exploration of PCA, their axes and the genes counts.

Citation:

Marini, Federico, and Harald Binder. "pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components." BMC bioinformatics 20.1 (2019): 1-8.

EnhancedVolcano

EnhancedVolcano is a program that eases the construction and annotation of Volcano Plots.

Citation:

Blighe, K, S Rana, and M Lewis. 2018. “EnhancedVolcano: Publication-ready volcano plots with enhanced colouring and labeling.”.

Bioinfokit

Bioinfokit is a python library designed to perform many graphs and usual processes in bioinformatics.

Citation:

Renesh Bedre.(2020, July 29). bioinfokit: Bioinformatics data analysis and visualization toolkit. Zenodo. doi

Snakemake

Snakemake is a pipeline/workflow manager written in python. It is used to handle the tools interaction, dependencies, command lines and cluster reservation. It is the skeleton of this pipeline. This pipeline is powered by the Snakemake-Wrappers, the Snakemake Workflows, and the conda project.

Citation: