Skip to content

Cancer Data Science's go to place for excellent genomics tools and packages

License

Notifications You must be signed in to change notification settings

broadinstitute/AwesomeGenomics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

AwesomeGenomics

Cancer Data Science's go to place for excellent genomics tools and packages

If something needs to be changed or added, feel free to create a pull request.

Awesome awesomeness

Here is a list of lists of awesome bio-informatics tools!

Awesome Bio reads

Others

Awesome python

Computational

Worklow Management

  • Terra: a very low barrier to entry, workflow and data management platform for medical and research genomics
    • Dalmatian: to interact with Terra in python
  • Nextflow
  • Google Genomics Pipelines
  • reflow
  • snakemake

Dataset Management

bash Basics

python Basics

  • bokeh: Best Interactive Plot with JS

R Basics

Neat python

  • POT: library for solving optimal transport optimization problems.
  • Itrask: set of differential privacy tools for analyzing data!
  • JKBio: Jeremie Kalfon's python scripts for genomics.
  • CDSpy: some plotting scripts for genomic analysis.
  • Selene: a framework for training sequence-level deep learning networks
  • nbstripout: removes notebook outputs before git pushing
  • voila: turns your jupyter notebook into awesome web apps
  • ngrock: secure introspectable tunnels: transforms http://8.8.8.8:8888 into https://www.ngrock.id.com
  • SublimeJEDI: python+sublime
  • functional-python: learn!
  • python parser
  • plot in your terminal

Neat R

Neat Bash

Other

Genomics

read mapper

  • NextGenMap: NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms
  • [bwa]
  • [bowtie]
  • []
  • []

Mutations

  • Mutect1: _an awesome cancer mutation caller, especially for calling point mutations)
  • Lancet: Based on microAssembly with a decision model
  • [R] ACE: Absolute CN estimation from low coverage WGS
  • Absolute: Absolute CN estimation from WES/WGS giving off many predictions to choose from (need prior knowledge)
    • DoAbsolute: an R package to Automate the Absolute algorithm
  • Strelka: small variant caller (germline/somatic)
  • Manta: the SV caller version of Strelka
  • DeepVariant: Fast Variant Calling with DL

Effect Prediction

  • [py] DeepSea: Prediction of Effect of non coding variant on ChIP seq binding/Expression/.. with DL
  • [py] SpliceAI: Predicting Effect of Variant on the splicing (modeling the spliceosome) with DL
  • [py] ExPecto: Predicting effect of NC variant on Expression with DL

Annotators

  • OncoKB: annotates MAF from oncoKB DB
  • Oncotator: annotates MAF from many DBs (not very well documented)

Expression

Next gen Expression

Differential Expression

Others

Single Cell

some competitors: [py] awesome single cell

Expression

  • novosparc: reconstruct 3D disposition from scRNAseq and some known location+expression atlases

Differential Expression

Epigenomics

A lot is available for ChIPseq from crazyhottommy's repo

ChIPseq and related

  • MACS2: go to peak caller
  • EPIC2: calling peaks from ChIP seq
  • RSEG: another peak caller
  • coda: denoising ChIPseq data with CNNs
  • CREAM: identifying clusters of functional regions within the genome from ChIPseq data
  • ngsplot: multi omics viz tool at specific locus
  • epi-corr: correlation tool for pairs of ChIP seq data
  • SUPERmerge: a ChIP-seq read pileup analysis and annotation algorithm for investigating alignment (BAM) files of diffuse histone modification ChIP-seq datasets with broad chromatin domains at a single base pair resolution level
  • nf-core/chipseq: Complete ChIPseq pipeline on nextflow
  • pyBigWig: interacts with bigwig from python
  • deepTools: a set of cmd line tools for epigenomics data
  • EnrichedHeatmap: make the famous enrichment at locus heatmap plots.

diff binding

Predictors

  • DeepBind: predicting binding location from previous binding data with CNN -DeeperBind: a deeper version
  • Basenji: Predicts Binding from Mutations with CNN
  • ABC model: Predicts Enhancer-gene links

ATACseq

HiCseq and related

  • HiCExplorer: process, normalize and visualize HiC data

About

Cancer Data Science's go to place for excellent genomics tools and packages

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published