You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was trying to run the perl script to have output from many sequences, is there an example input for the perl script (I have already successfully compiled everything needed). I want to compare OMH estimated Jaccard index with exact jaccard for 1000 sequences, so all versus all.
Thanks,
Jianshu
The text was updated successfully, but these errors were encountered:
Happy to hear you're looking to use OMH, thanks for pointing out this issue. I think it stems mainly from some missing documentation.
I made the random sequences README a little more verbose, and I hope it helps. At the last minute we added the k-mer size to the input of the perl script and this was not documented. The input file is expecting the captured standard output from the generator python script, but as long as its a tab delimited list of pairs of sequences it should capture it okay and be able to output a list of OMH values (in the 4th column of another tab delimited file).
As an example, the output from generate_random_pairs.py would be something like (here this is k=5, n=10 with --trim on):
Hello Team,
I was trying to run the perl script to have output from many sequences, is there an example input for the perl script (I have already successfully compiled everything needed). I want to compare OMH estimated Jaccard index with exact jaccard for 1000 sequences, so all versus all.
Thanks,
Jianshu
The text was updated successfully, but these errors were encountered: