Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to Find Taxon_ID of the Top-hit Species in Kaiju Output File Despite Being the Highest-Reads Species in Kaiju2table Output #265

Open
SenyuanGu opened this issue Jun 29, 2023 · 4 comments

Comments

@SenyuanGu
Copy link

hi, I ran kaiju and kaiju2table. Why can't I find the taxon_id of the species with the highest number of reads in the output file of Kaiju, even though it was the top hit in the output of kaiju2table

@pmenzel
Copy link
Member

pmenzel commented Jun 29, 2023

that should normally not be the case, did you run kaiju2table on rank species ? How about the other species?

@SenyuanGu
Copy link
Author

Yes, I ran kaiju2table at the species level. Other species can be found in the Kaiju output txt file. I am wondering if this result is due to the fact that I annotated the contigs using Kaiju?

@pmenzel
Copy link
Member

pmenzel commented Jul 2, 2023

Which database did you use? Sometimes there is a mismatch between the taxonomy info in the source database and the names.dmp / nodes.dmp files.

You can find out the correct taxon ID by searching for the species name in names.dmp.

@SenyuanGu
Copy link
Author

Thank you for your help! It works!! The ID and species name are indeed mismatched. The txid for Fragilariopsis cylindrus is 635003, but the result obtained from running kaiju2table is 186039.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants