Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need Help - #1880

Open
taranglute opened this issue Sep 18, 2018 · 0 comments
Open

Need Help - #1880

taranglute opened this issue Sep 18, 2018 · 0 comments

Comments

@taranglute
Copy link

Next Generation sequencing data are usually available in multiple compressed files.

I want to pass multiple input files (compressed or uncompressed FASTQ files) to Khmer, (i) abundance-dist-single.py which generate the full k-mer abundance histogram and (ii) unique-kmers.py which estimates the total number of distinct k-mers (F0) for large k lengths.

In the help document of khmer it is stated that ‘To count k-mers in multiple files use ‘load-into-counting.py’ and ‘./abundance-dist.py’ and hence I followed the following commands for two input FASTQ files,

./load-into-counting.py -k 25 -x 5e7 -T 16 out123 SRR072005.fastq SRR072006.fastq

./abundance-dist.py out123 SRR072005.fastq SRR072006.fastq histo

usage: abundance-dist.py [--version] [--info] [-h] [-z] [-s] [-b] [-f] [-q]
input_count_graph_filename input_sequence_filename
output_histogram_filename

abundance-dist.py: error: unrecognized arguments: histogram

What is the correct sequence of commands to run Khmer to generate the full k-mer abundance histogram and the total number of distinct k-mers (F0) on input having multiple (compressed/uncompressed) FASTQ files?

Please Help

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant