Skip to content

sk7-dotcom/Data_Integration_Exercise

Repository files navigation

CHIP-seq and RNA-seq integration tutorial

Presented by:

Meagan Archer, Meghan Pepler, Sreedevi Kesavan, and Stephanie Ali Fairbairn


April 7th 2021

Welcome to the second tutorial for Biology 722!

We are going to be spending this lesson:

  1. Introducing Streptomyces venezuelae and the biological question we want to answer today.
  2. Discussing the ChIP-seq analysis pipeline and how integrating these results into the RNA-seq data can give greater biological clarity to data.
  3. Workshop on ChIP-seq pipeline and integration.
  4. Results and Discussion on biological question.

Streptomyces venezuelae

Streptomyces are soil-dwelling Gram-positive bacteria that have a complex multi-stage lifecycle and secondary metabolism. They specialize in antibiotic production and are the organism for todays analysis.

Biological question for today

Antibiotic production in this bacteria is goverened by ~30 biosynthetic gene clusters. The products of these genes coordinate together to produce a specific antibiotic. Each cluster will encode: enzymes responsible for the catalysis and modification of the core antibiotic backbone, one or more gene product(s) to regulate the activity of the cluster, and resistance factors or efflux pumps to prevent cell suicide when the cluster is active. Lsr2 is a nucleoid associated protein that behaves similarly to Histone proteins in eucaryotic systems. It has the ability to bend and contort DNA to regulate gene expression. Here we investigate whether Lsr2 binding has an repressive effect on the biosynthetic clusters.

Chromatin Immunoprecipitation Sequencing (ChIP-Seq)

By combining chromatin immunoprecipitation (ChIP) assays with sequencing, ChIP sequencing (ChIP-Seq) is a powerful method for identifying genome-wide DNA binding sites for transcription factors and other proteins. Following ChIP protocols, DNA-bound protein is immunoprecipitated using a specific antibody. The bound DNA is then coprecipitated, purified, and sequenced.

Figure Can be found here

The application of next-generation sequencing (NGS) to ChIP has revealed insights into gene regulation events that play a role in various diseases and biological pathways, such as development and cancer progression. ChIP-Seq enables thorough examination of the interactions between proteins and nucleic acids on a genome-wide scale.

How to ChIP

This is a page for going into more detail on the ChIP-seq analysis process. This page details the process our experiment took, but for more detail on file types and even pipelines that are run entirely in the cluster go to HBC training website above.

Tools used in this tutorial (does not incluse packages for RNA-seq pipeline)

Most tools have been run with Bioconductor tools but can just as easily be run with shell scripting. Look into the HBC training page for alternative code to several tools listed below:

  1. Rbowtie2
  2. Rsamtools
  3. FastQC
  4. MACS3 (MACSr is supposed to come out soon for the Bioconductor environment.)
  5. BedTools or bedr

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages