diff --git a/README.md b/README.md index 2ee03b6..99380c4 100644 --- a/README.md +++ b/README.md @@ -10,9 +10,8 @@ RNA-Seq Standard Pipeline is designed to perform standard RNA-Seq analysis for I 2. This pipeline not only process RNA-seq data from local directory, but also be able to download data from SRA(download .sra file and convert to fastq.gz file) and TCGA(download .bam file and convert to fastq.gz file). Please follow the following instruction to setup the [meta_data/download_list.txt](meta_data/download_list.txt) file: - * a. For local stored RNA-seq data in "fastq.gz" format, the files name must follow the following naming convention: - + ID_L001_R1.fastq.gz (L001 means lane 1, R1 means read 1 for paired-end sequencing). - In the [meta_data/download_list.txt](meta_data/download_list.txt) file, the first column should be the sample name, the second column shold be the ID, and the third column should be the directory to the fastq.gz file + * a. For local stored RNA-seq data in "fastq.gz" format. In the [meta_data/download_list.txt](meta_data/download_list.txt) file, the first column should be the sample name, the second column shold be the ID, and the third column should be the directory to. the fastq.gz file the files name must follow the following naming convention: + + ID_L001_R1.fastq.gz (L001 means lane 1, R1 means read 1 for paired-end sequencing) * b. For GEO samples from SRA, sample names should be in the first column in [meta_data/download_list.txt](meta_data/download_list.txt), and the corresponding SRX number should be in the second column and the third column you need to write "SRA".