Skip to content
Steve Bond edited this page Nov 3, 2015 · 2 revisions

--consensus, -con

Description

Condense alignments down to a single majority-rule consensus sequence. If two or more residues are tied for the highest frequency at a given column, then an ambiguous 'X' (protein) or 'N' (nucleotide) is used at that position.

Example(s)

Input file: Panx_C-terms.stklm

# STOCKHOLM 1.0
#=GF SQ 3
Mle-Panxα9  ---atgttaga------catactttcaaagtttaaaggagttactccttttaaaggtataacgatag
Mle-Panxα7A atgggggtggaaattctgtttcccataatcaacagagccaccgctccgatcaagtctgttaacatcg
Mle-Panxα4  atggttattga------gctgctagctggatacaaaggtctgtccccgtttaaagacgcgactgttg
//
# STOCKHOLM 1.0
#=GF SQ 3
Mle-Panxα9  -mldilskf--kgvtpfkgitiddgwdqlnrsfmfvllvvmgttvtvr-qytgsviscdgfkkfg--stfaedycwtqg
Mle-Panxα7A mgveilfpiinratapiksvniddlssqlnrtfmfylsltfaititirqqlggayiacdgfsrdeeyerfaeewcwssg
Mle-Panxα4  mviellagy--kglspfkdatvddswdqinrcyvfiamvvmgavttmr-qysgtliacdgftkfh--pqfaedycwsig
//

Usage example

$: alb Panx_C-terms.stklm -con

Output

# STOCKHOLM 1.0
#=GF SQ 1
consensus atggtgNtNga------gNtNctNNcaaNNtacaaaggNNtNNctccgtttaaagNtgtNacNatNg
#=GS consensus AC consensus
#=GS consensus DE Original sequences: Mle-Panxα9, Mle-Panxα7A, Mle-Panxα4
//
# STOCKHOLM 1.0
#=GF SQ 1
consensus mXXeilXXX--kgXXpfkXXtiddXwdqlnrXfmfXlXvvmgXtXtXr-qyXgXXiacdgfXkfX--XXfaedycwsXg
#=GS consensus AC consensus
#=GS consensus DE Original sequences: Mle-Panxα9, Mle-Panxα7A, Mle-Panxα4
//

Main Toolkit Pages





Further Reading

Clone this wiki locally