Skip to content

K-mer counting with threads#107

Merged
standage merged 6 commits into
masterfrom
feature/thread-count
Sep 18, 2017
Merged

K-mer counting with threads#107
standage merged 6 commits into
masterfrom
feature/thread-count

Conversation

@standage

@standage standage commented Aug 8, 2017

Copy link
Copy Markdown
Collaborator

In #102 I started exploring parallel k-mer counting. With the 1-thread-per-sample approach, the most I can hope for is a 3-4x speedup for processing a simplex. Looking more closely at some of the khmer scripts, I figured it would be much better to process one sample at a time but using as many threads as the user can spare. This indeed seems to be giving great speedups. Will post numbers soon.

Runtime for kevlar count and kevlar novel is one of the big pain points right now, and this PR should bring the days of required processing time down to hours.

@standage

standage commented Aug 8, 2017

Copy link
Copy Markdown
Collaborator Author

Will need to wait to merge this until dib-lab/khmer#1753 is merged.

@codecov

codecov Bot commented Sep 18, 2017

Copy link
Copy Markdown

Codecov Report

Merging #107 into master will increase coverage by 0.95%.
The diff coverage is 100%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #107      +/-   ##
==========================================
+ Coverage   89.62%   90.57%   +0.95%     
==========================================
  Files          32       32              
  Lines        1552     1560       +8     
  Branches      247      249       +2     
==========================================
+ Hits         1391     1413      +22     
+ Misses        112       98      -14     
  Partials       49       49
Impacted Files Coverage Δ
kevlar/count.py 94.11% <ø> (ø) ⬆️
kevlar/novel.py 78.26% <ø> (ø) ⬆️
kevlar/cli/count.py 100% <100%> (ø) ⬆️
kevlar/counting.py 82.27% <100%> (+20.63%) ⬆️
kevlar/cli/novel.py 100% <100%> (ø) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4700328...3095e31. Read the comment docs.

@standage standage merged commit 1672f70 into master Sep 18, 2017
@standage standage deleted the feature/thread-count branch September 18, 2017 21:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant