Skip to content
Pankaj Kumar edited this page Sep 4, 2019 · 14 revisions

Welcome to the Circle_finder wiki!

### This is step by step guide to run Circle_Finder

Step 1: Clone the repository

git clone https://github.com/pk7zuva/Circle_finder.git

Step 2: Change to "Circle_finder" directory

cd Circle_finder

In this directory you will find four types of files: 1) *.c 2) *.sh 3) *.txt and 4) C executable that has no extension

Note: Though the "C" executable files are provided it is advisable to make these executable afresh

Step 3: Type the following command on your terminal one by one

cc -o ADDRESS2PROFILEPAIREND address2profile.pairend.c

cc -o DIRECT.REPEAT.FINDER1 direct.repeat.finder1.c

cc -o JUNCTIONAL.TAG junctional.tag.c

cc -o LEFT.ALIGNMENT left.alignment.c

cc -o MIDNA_START_END_SCORE midna_start_end_score.c

Step 4: Download the whole genome files and bowtie index files from link given in file "download-link-hg38-and-bowtie-index.txt"

cat download-link-hg38-and-bowtie-index.txt

http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.1.bt2 http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.2.bt2 http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.3.bt2 http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.4.bt2 http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa.amb http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa.ann http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa.bwt http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa.fai http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa.pac http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa.sa http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.rev.1.bt2 http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.rev.2.bt2

Example download command: wget http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/hg38.fa

Step 5: Download the fastq files. Link to download these files is given in file "fastq-file-download-link.txt"

cat fastq-file-download-link.txt

http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/Index11_1.fq http://genome.bioch.virginia.edu/CIRCLE_FINDER_MASTER/Index11_2.fq

Step 6: You are all set to run the pipeline

bash /path-of-the-"Circle_finder"-directory/microDNA.InOne.sh /path-of-the-"Circle_finder"-directory/hg38 Index11_1.fq Index11_2.fq 24 C4-2 49 10000 /path-of-the-"Circle_finder"-directory &

Step 7: Final output file "microDNA.JT.postmotif.fa"

head microDNA.JT.postmotif.fa chr1 28761 29551 0 1 NOMOTIF

chr1 199385 199915 0 1 GTC

chr1 631932 632604 0 1 NOMOTIF

chr1 632019 632252 1 0 CA

chr1 632112 632242 0 1 T

chr1 889483 890225 4 0 C

chr1 897103 898784 2 0 C

chr1 980217 981339 0 1 G

chr1 982484 982697 1 0 NOMOTIF

chr1 983705 984358 0 2 C

Step 8: Explanation of output

Column 1 "Chromosome name"

Column 2 "start position of circle"

Column 3 "end position of circle"

Column 4 "Number of reads mapping on circle junction from "+" strand"

Column 5 "Number of reads mapping on circle junction from "-" strand"

Column 6 "micro homology (if any) at the junction of circle"

Step 9: If you wish to extract only those circular DNA that has evidence of at least one read mapping on circle junction as "+" and "-" orientation

awk '$4>0 && $5>0' microDNA.JT.postmotif.fa | head

chr1 1069854 1070524 1 2 C

chr1 1069934 1071919 6 2 NOMOTIF

chr1 1070501 1070786 1 2 GAGTC

chr1 1428170 1428595 5 5 NOMOTIF

chr1 1459119 1460224 6 2 NOMOTIF

chr1 1459425 1462380 3 1 GGG

chr1 1495168 1495816 1 3 GG

chr1 1579383 1580962 1 1 GTA

chr1 1667878 1668245 9 6 C

chr1 1772882 1773318 2 3 A

Clone this wiki locally