-
Notifications
You must be signed in to change notification settings - Fork 1
Input
Jorge edited this page Jan 8, 2022
·
1 revision
- GenBank files (.gb or .gbk format). They need to have the
CDS
feature, containing thetranslation
qualifier, e.g.:
CDS join(9347..9379,9435..10346)
/codon_start=1
/gene="estA2"
/note="aflatoxin formation"
/product="esterase"
/protein_id="AAN76353.1"
/transl_table=1
/translation="METPFAAPWHQFVEDLGQTPCLPGKDLDSILAGWGQLAGTLATRY
GFPPPDESVTTEDVQLDGLWLRCYTPPNATGQEPVGLYFHGGGWVMGGVKEEDGFCRVI
SRQCQMRLVSVEYRKAPQTRYPGALNDGVSAALWVLSRYENQPLVLMGTSAGGNLAFGT
ALRLIDQDMVDKVSGVVALAPITVHPDAVPEYLKEQYTAYEENAELTVNSRAAMQVFFD
CYKAPVDDVYTSCLLHPRLLALPKVYIAELGLDTLRDDARLMKGALDTAKVPVMYDAYP
GYPHCLFMFPFKSLGEHQRPFLGGVAKAVRWMS"
- Amino acid sequences (.fasta format)
- Binary BGC files (.bgc or .bgccase)
- Binary Protein files (.proteincase)
To enter the above, use parameters --inputfolders
or --files
. The former will only read GenBank files (it does so recursively, starting from the given folder).
Learn more about internal file formats here