-
Notifications
You must be signed in to change notification settings - Fork 104
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix old pages from 2018 / 2019 events
- Loading branch information
Showing
35 changed files
with
493 additions
and
451 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,21 +1,23 @@ | ||
title=Migrating legacy workflows to Nextflow | ||
date=2019-05-28 | ||
type=col8 | ||
tags=nextflow,nfcamp,2019,workshop | ||
status=published | ||
~~~~~~ | ||
--- | ||
title: Migrating legacy workflows to Nextflow | ||
date: 2019-05-28 | ||
type: col8 | ||
tags: nextflow,nfcamp,2019,workshop | ||
status: published | ||
layout: "@layouts/Page.astro" | ||
--- | ||
|
||
## Migrating legacy workflows to Nextflow | ||
|
||
### Alexander Peltzer | ||
*Bioinformatics Research Scientist, Quantitative Biology Center Tübingen, Germany* | ||
*Bioinformatics Research Scientist, Quantitative Biology Center Tübingen, Germany* | ||
|
||
Many researchers and institutions face similar issues: Having in-house legacy workflows written in bash or other formats and now facing issues with reproducibility, maintenance of these pipelines and the increasing computational demands poses severe threats to being able to address modern computational questions using such legacy workflows. In this talk, I intend to highlight the efforts we took at the QBIC to maintain compatibility between existing workflows but simultaneously porting all of our existing legacy pipelines to the Nextflow and nf-core frameworks to be able to answer these threats. With a specific application case on one of the most widely used ancient DNA pipelines, I intend to highlight the benefits of these migrated pipeline in comparison to the previously existing workflow in a 1:1 setting. | ||
|
||
### Bio | ||
|
||
Alexander Peltzer is a bioinformatics research scientist at the Quantitative Biology Center in Tübingen and is working in maintaining and developing modern solutions for data management and analysis for various omics technologies there. Before, he obtained a PhD in bioinformatics at the University of Tübingen and the Max Planck Institute for the Science of Human History where he worked on computational methods for ancient DNA reconstruction. | ||
|
||
### Registration | ||
### Registration | ||
|
||
To attend Nextflow Camp 2019 register at [this link](https://www.crg.eu/en/event/coursescrg-nextflow-2019). | ||
To attend Nextflow Camp 2019 register at [this link](https://www.crg.eu/en/event/coursescrg-nextflow-2019). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,33 +1,35 @@ | ||
title=DolphinNext: A graphical user interface for distributed data processing of high throughput genomics | ||
date=2019-05-28 | ||
type=col8 | ||
tags=nextflow,nfcamp,2019,workshop | ||
status=published | ||
~~~~~~ | ||
--- | ||
title: DolphinNext: A graphical user interface for distributed data processing of high throughput genomics | ||
date: 2019-05-28 | ||
type: col8 | ||
tags: nextflow,nfcamp,2019,workshop | ||
status: published | ||
layout: "@layouts/Page.astro" | ||
--- | ||
|
||
## DolphinNext: A graphical user interface for distributed data processing of high throughput genomics | ||
|
||
### Alper Kucukural | ||
*Assistant Professor, UMass Medical School, USA* | ||
*Assistant Professor, UMass Medical School, USA* | ||
|
||
Emergence of new biomedical technologies, like next-generation sequencing (NGS) which is producing vast amounts of genomic data every day, is driving a big data revolution in biology. The dramatic increase in the volume, as well as the production rate of genomic data, has now made the data analysis new bottleneck for scientific discovery. Naturally, the need for highly-parallel data processing frameworks is greater than ever. It is also important for these frameworks to have certain design characteristics such as flexibility, portability, and reproducibility. Processing of sequencing data usually involves many different programs, each of which performs a specific step in the overall pipeline. Flexibility ensures that the pipelines can support a variety of use cases or data types without the need to modify existing pipelines or create new ones. Portability gives user the freedom to choose computational resources as he/she deems fit. Reproducibility across computing environments, which warrants credibility of the results, is a particularly important feature in the face of the sheer volume of data and complexity of the pipelines. | ||
|
||
There exist several platforms that offer graphical user interfaces for designing and execution of complex pipelines (e.g. Galaxy, GenePattern, GeneProf). Unfortunately, none of these platforms supports parallelism or portability across computing environments. To address these and additional shortcomings discussed in this paper, we have created DolphinNext, an easy-to-use graphical user interface for creating and deploying complex workflows for parallel processing of high throughput genomic data. DolphinNext relies on Nextflow which is a framework enabling scalable and reproducible workflows using software containers. The central idea behind the creation of DolphinNext is to facilitate building and deployment of complex pipelines using a graphically-enabled modular approach. DolphinNext provides: | ||
|
||
1. A drag and drop user interface that abstracts Nextflow pipelines and allows users to create pipelines without familiarity with Nextflow. | ||
2. Reproducible pipelines by providing version tracking, and by creating a stand-alone version of any pipeline instance to run independently or to share in the publications. | ||
3. Seamless portability to distributed computational environments such as high performance clusters or cloud based solutions. | ||
1. A drag and drop user interface that abstracts Nextflow pipelines and allows users to create pipelines without familiarity with Nextflow. | ||
2. Reproducible pipelines by providing version tracking, and by creating a stand-alone version of any pipeline instance to run independently or to share in the publications. | ||
3. Seamless portability to distributed computational environments such as high performance clusters or cloud based solutions. | ||
4. A graphical user interface to monitor pipeline execution that allows restarting of intermediate processes even after parameter changes. | ||
|
||
### Bio | ||
|
||
Dr. Kucukural designs and implements reusable, robust and production grade bioinformatics analysis pipelines and pipeline generation tools for processing next-generation sequencing data. | ||
Dr. Kucukural designs and implements reusable, robust and production grade bioinformatics analysis pipelines and pipeline generation tools for processing next-generation sequencing data. | ||
He mainly works on NGS data analysis; RNA-Seq, RIP-Seq, Chip-Seq, CLIP-Seq and derivatives. He implemented algorithms to reduce noise by calling the peaks caused by experimental and alignment biases especially for RIP and CLIP-Seq data. | ||
|
||
Dr. Kucukural worked on analyzing deep sequencing data to discover key elements of splicing of pre-mRNAs to have better understanding of post-transcriptional regulations of RNAs. Moreover, he has deep knowledge of finding genome wide mRNA targets of RNA binding proteins (RBPs). Analyzing RNA targets of tdp43 RBP with deep sequencing on Rat and human was another focus of his research to understand the mechanisms of neuro-degenerative diseases such as Alzheimer and ALS. | ||
Dr. Kucukural worked on analyzing deep sequencing data to discover key elements of splicing of pre-mRNAs to have better understanding of post-transcriptional regulations of RNAs. Moreover, he has deep knowledge of finding genome wide mRNA targets of RNA binding proteins (RBPs). Analyzing RNA targets of tdp43 RBP with deep sequencing on Rat and human was another focus of his research to understand the mechanisms of neuro-degenerative diseases such as Alzheimer and ALS. | ||
|
||
He also worked for protein structure characterization and prediction. He applied techniques from graph theory on protein structure analysis and implemented the theories from computer sciences to biology to find solutions in drug design and small molecular docking fields. He developed applications using genetic algorithms to discover biomarkers and implemented feature detection methods using various clustering, classification and machine learning algorithms such as hidden markov models and support vector machines. | ||
|
||
### Registration | ||
### Registration | ||
|
||
To attend Nextflow Camp 2019 register at [this link](https://www.crg.eu/en/event/coursescrg-nextflow-2019). | ||
To attend Nextflow Camp 2019 register at [this link](https://www.crg.eu/en/event/coursescrg-nextflow-2019). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,25 +1,27 @@ | ||
title=Scientific workflows beyond the Ivory Tower | ||
date=2019-05-28 | ||
type=col8 | ||
tags=nextflow,nfcamp,2019,workshop | ||
status=published | ||
~~~~~~ | ||
--- | ||
title: Scientific workflows beyond the Ivory Tower | ||
date: 2019-05-28 | ||
type: col8 | ||
tags: nextflow,nfcamp,2019,workshop | ||
status: published | ||
layout: "@layouts/Page.astro" | ||
--- | ||
|
||
## Scientific workflows beyond the Ivory Tower | ||
|
||
### Evan Floden | ||
*Seqera Labs & Centre for Genomic Regulation (CRG), Spain* | ||
*Seqera Labs & Centre for Genomic Regulation (CRG), Spain* | ||
|
||
Evan will discuss the progress of Seqera Labs over the past 12 months | ||
as well unveil some exciting new developments for the Nextflow project. | ||
|
||
### Bio | ||
|
||
Evan is a workflow developer helping scientists and engineers to find solutions to their problems using Nextflow. | ||
Evan is a workflow developer helping scientists and engineers to find solutions to their problems using Nextflow. | ||
His background is in Biotechnology and Bioinformatics and he worked on the Nextflow project | ||
during his PhD from 2014 before co-founding Seqera Labs. Evan's interests include genomics, scientific workflow | ||
optimization, home automation and bicycle touring. | ||
during his PhD from 2014 before co-founding Seqera Labs. Evan's interests include genomics, scientific workflow | ||
optimization, home automation and bicycle touring. | ||
|
||
### Registration | ||
### Registration | ||
|
||
To attend Nextflow Camp 2019 register at [this link](https://www.crg.eu/en/event/coursescrg-nextflow-2019). |
Oops, something went wrong.