Skip to content

biomed-AI/SCHAP

Repository files navigation

SCHAP: Single Cell High-performance Analysis Platform

Overview

SCHAP is a Single-Cell High-performance Analysis Platform primarily designed for analyzing scRNA-seq and scATAC-seq data. It provides users with the option of one-click or step-by-step data analysis control, as well as user-friendly visualization of the results on the website. The platform is supported by the Starlight cloud platform on Tianhe-2, supporting CPU and GPU heterogeneous computing scheduling. It can allocate different types of computing resources for computing tasks based on CPU-intensive and memory-intensive computing requirements.

Accessing SCHAP

The website link is available at https://bio-web1.nscc-gz.cn/database/SCHAP. The application is free and open to all users with no login requirement. The analyzed results are easily downloaded via URL.

Data Availability

  • The scRNA-seq (GSE145230 and GSE132509) and scATAC-seq (mouse brain 4k) datasets used for this manuscript are hosted by the following service: Google Drive & FigShare

  • The source code is already packaged and hosted on FigShare

Requirements

You'll need to install the following packages in order to run the codes in your local environment.

  • Centos 7.6.1810

  • cellranger 7.1.0
  • cellranger-atac 2.1.0
  • Seurat v4
  • Harmony
  • clusterprofile
  • CellChat
  • velocyto
  • Signac
  • Monocle 3
  • Cireco
  • SNPsea
  • liftover
  • ADClust
  • SingleR
  • GraphCS
  • SANGO

Installation

Algorithm Installation

Environment Installation

  • To run scRNA-generic / preprocess & visualization for ADClust and GraphCS
conda create -n singler --file spec-scrna-list.txt
  • To run the visualization for SANGO
conda create -n monocle --file spec-monocle-list.txt
conda create -n motif --file spec-motif-list.txt
conda create -n snpsea --file spec-snpsea-list.txt
  • To run scATAC-generic
conda create -n atacenv --file spec-scatac-list.txt
  • To run velocyto trajectory analysis
conda create -n R4velocyto --file spec-R4velocyto-list.txt

Usage

Due to the upload size limits for website, you can also utilize our code in your local environment.

  • cellranger count

The code to process the fastq files of scRNA-seq data.

bash entrypoint_cellranger.sh
  • cellranger-atac count

The code to process the fastq files of scATAC-seq data.

bash entrypoint_cellranger_atac.sh
  • scRNA-generic

The workflow mainly includes Seurat, Harmony, SingleR, and CellChat. We used Seurat and Harmony were utilized for data integration and clustering, SingleR for cell annotation, and CellChat for calculating cell communication.

bash entrypoint_scRNA_generic.sh
  • scATAC-generic

The workflow mainly includes Seurat and Signac. Seurat and Signac were utilized for data integration and clustering, with Signac used for annotation and computing QC metrics.

bash entrypoint_scatac.sh
  • ADClust

The workflow mainly includes Seurat, Harmony, ADClust, SingleR, and CellChat. We used Seurat and Harmony were utilized for data integration, ADClust for clustering, and SingleR for cell annotation. Finally, CellChat was employed to calculate cell communication.

Preprocess the raw data to satisfy the input of ADClust

bash entrypoint_ADClust_1.sh

Visualization for the result generated by ADClust

bash entrypoint_ADClust_2.sh
  • GraphCS

The workflow mainly includes Seurat, Harmony, GraphCS, and CellChat. We used Seurat and Harmony were utilized for data integration and clustering, GraphCS for cell annotation, and CellChat for calculating cell communication.

Preprocess the raw data to satisfy the input of GraphCS

bash entrypoint_GraphCS_1.sh

Visualization for the result generated by GraphCS

bash entrypoint_GraphCS_2.sh
  • Velocyto trajectory analysis

The tool mainly includes Seurat and velocyto, we can use it for trajectory analysis.

bash entrypoint_velocyto.sh
  • SANGO

This is a workflow for cell annotation of scATAC-seq data using the SANGO tool, and we also perform downstream analysis of the annotated data, including enrichment analysis, chromatin proximity analysis, and chromatin interaction analysis.

The reference datasets used for the visualization of SANGO are hosted by the following service: Google Drive

Visualization for the result generated by SANGO

bash entrypoint_SANGO.sh

Draw the motifplot for the result generated by SANGO

bash entrypoint_motifplot.sh

Tutorial

For the website tutorial, please refer to: https://bio-web1.nscc-gz.cn/database/SCHAP/documentation#1

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published