Skip to content

Custom pipeline for denoising and processing Illumina paired-end data by combining paired and unpaired reads

Notifications You must be signed in to change notification settings

camillethuyentruong/Illumina_paired_end

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 

Repository files navigation

Pipeline for denoising and processing Illumina paired-end data by combining paired and unpaired reads

Programs needed

Trimmomatic

QIIME

ITSx

Denoising with Trimmomatic

1_make_Trimmomatic.pl: Perl script to automate the Trimmomatic command for all files in a folder

1.1 Make the script executable

chmod +x my_Make_Trimmomatic.sh ./my_Make_Trimmomatic.sh

1.2 Execute the script

make_Trimmomatic.pl -d /path_to_Illmina_read_folder -p/path_to_Trimmomatic_application

1.3 Move paired reads, singe forward reads R1 (reverse read was descarded) and single reverse reads R2 (forward read was descarded) into separate folders

mv *paired.fastq ./paired mv *R1_unpaired.fastq ./unpairedR1 mv *R2_unpaired.fastq ./unpairedR2

Pair reads from paired folder in QIIME

multiple_join_paired_ends.py -i /path_to_paired_read_folder -o join_paired_ends -p join_paired_ends_parameters.txt

2_make_separate_forSplitlibraries.sh: Perl script to reorganize the data from the output directories of join_paired_ends.py in Qiime, moving joined and unjoined forward reads to separate folders

2.1 Make the script executable

chmod +x make_separate_forSplitlibraries.sh ./make_separate_forSplitlibraries.sh

2.2 Execute the script

make_separate_forSplitlibraries.pl -d /path_to_join_paired_ends_folder

Convert to fasta format in QIIME

joined reads from joined_paired_ends

multiple_split_libraries_fastq.py -i /path_to_folder_joined_fastq -o split_libraries_join --demultiplexing_method sampleid_by_file -p split_libraries_parameters.txt

unjoined forwards reads from joined_paired_ends

multiple_split_libraries_fastq.py -i /path_to_folder_un1_fastq -o split_libraries_un1 --demultiplexing_method sampleid_by_file -p split_libraries_parameters.txt --sampleid_indicator .fastq

single forwards reads R1 from Trimmomatic

multiple_split_libraries_fastq.py -i /path_to_folder_unpairedR1 -o split_libraries_singleR1 --demultiplexing_method sampleid_by_file -p split_libraries_parameters.txt --sampleid_indicator _unpaired

single reverse reads R2 from Trimmomatic

multiple_split_libraries_fastq.py -i /path_to_folder_unpairedR2 -o split_libraries_singleR2 --demultiplexing_method sampleid_by_file -p split_libraries_parameters.txt --sampleid_indicator _unpaired

Eliminate reads of size <150 bp (for each read type) in QIIME

filter_short_reads.py input_file.fasta > output_file.fasta

Eliminate chimeric sequences (for each read type) in QIIME

identify_chimeric_seqs.py -i input_file.fasta -o output_file.fasta -m usearch61 --non_chimeras_retention intersection -r /path_to_UNITE_dynamic_ITS_reference_database.fna filter_fasta.py -f input_file.fasta -s output_file.fasta/chimeras.txt -o output_file.nochimera.fasta -n

Filter non-ITS and non-fungal reads (for each read type) in ITSx

itsx -i /path_to_output_file.nochimera.fasta -o ITSx_output_file -t F --preserve T --save_regions ITS1 --reset T --graphical

Create OTU table by merging all read type in QIIME

pick_open_reference_otus.py -m -m sortmerna_sumaclust -i ITSx_output_file_join,ITSx_output_file_un1,ITSx_output_file_singleR1,ITSx_output_file_singleR2 -o output_folder/ -p pick_otu_parameters.txt -s 0.5 --min_otu_size 1 --suppress_align_and_tree --new_ref_set_id name_of_new_sequence_reference_database -r /path_to_custom/UNITE_sequence_reference_database.fna

Filter rare OTUs

3_my_BIOM_filter_by_sample.sh: Perl script to filter OTUs from an OTU table (in csv format) based upon a percent threshold of reads per sample [default = 0.05%]

3.1 Make the script executable chmod +x my_BIOM_filter_by_sample.sh ./my_BIOM_filter_by_sample.sh

3.2 Execute the script BIOM_filter_by_sample.pl -t 0.0005 -c otu_table.csv

About

Custom pipeline for denoising and processing Illumina paired-end data by combining paired and unpaired reads

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages