Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using bcalm2 on a set of k-mers where k is a multiple of 4 #46

Closed
amatur opened this issue Oct 16, 2019 · 2 comments
Closed

Using bcalm2 on a set of k-mers where k is a multiple of 4 #46

amatur opened this issue Oct 16, 2019 · 2 comments

Comments

@amatur
Copy link

amatur commented Oct 16, 2019

Hi, I have a set of k-mers (with k=20) from which I want the compacted unitigs. I was going to make a fasta file out of them, with one k-mer per sequence, as I am okay with the abundance information being lost. But the problem is that, bcalm2 does not work with k-mer size of multiples of 4. Do you know any workaround in this case?

I saw in the discussion in issue #30 that bcalm2 can take as input a custom HDF5 file that contains counted k-mers. Can I use the hdf5 file generated by DSK as an input to bcalm2? I get error when I try that.

@rchikhi
Copy link
Member

rchikhi commented Oct 17, 2019

Hi Amatur, I'd have to investigate that again. For now, could you use k=19 perhaps..?

@amatur
Copy link
Author

amatur commented Nov 3, 2019

@rchikhi for the few datasets that I had, I used bcalm2 with k=20 anyway (allowing multiples of 4 by modifying bcalm_1.cpp), and thankfully I did not run into any bugs.

@amatur amatur closed this as completed Nov 3, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants