The GenomeIndexing is a Java tool for extracting indices and seeds from genome sequences using Minimizer and Genome-on-Diet algorithms.
This project apply the concepts explained in this lecture, which is part
of the Introduction to Bioinformatics
course taught at the Islamic University of Gaza (IUG).
-
The first line of the input contains one integer
n
— the number of test cases. -
Each test case consists of three lines:
-
The first line of each test case contains one integer
t
— the type of the algorithm to be used on the following genome sequence. This project covers only two types:- Minimizer,
t
must be1
. - Genome-on-Diet,
t
must be2
.
- Minimizer,
-
The second line contains the genome sequence.
-
The third line contains the integers
k
,w
, andp
(if applicable) respectively.
-
Example: sample-input.txt
To run this project, you need:
- Java Development Kit (JDK) 18 or later installed.
- An IDE (e.g., IntelliJ IDEA, Eclipse, or VS Code) or the command line for running Java applications.
You can install To run this project, follow these steps:
-
Clone the repository:
git clone https://github.com/MohammedOH/GenomeIndexing.git cd GenomeIndexing
-
Create the input file, and follow the Input file structure:
touch input.txt
or
echo "" > input.txt
For more help see sample-input.txt file
-
Compile the Project:
javac -source 18 -target 18 -d out src/*.java
-
Run the Project:
java -cp out Main
-
Open the output file
output.txt
.
This project is licensed under the MIT License.
Mohammed O. AlHabbash,
mohammad.o.a1999@gmail.com