Multithread forest application to matrices #175

salbert83 · 2022-06-18T11:09:16Z

I guess one could multithread application of a forest to a vector instead. I think it is better to do it at this level.

Master

codecov-commenter · 2022-06-18T11:11:59Z

Codecov Report

Merging #175 (bd1a773) into master (7e090bb) will not change coverage.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master     #175   +/-   ##
=======================================
  Coverage   89.51%   89.51%           
=======================================
  Files          10       10           
  Lines         992      992           
=======================================
  Hits          888      888           
  Misses        104      104

Impacted Files	Coverage Δ
src/classification/main.jl	`97.56% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7e090bb...bd1a773. Read the comment docs.

ablaom · 2022-06-20T04:12:08Z

For prediction in a forest I agree that it makes sense to always use multithreading. @OkonSamuel Your thoughts? (And can you please review?)

There are possiblities for multithreading in training both forests and an individual tree, but let's leave that for a separate PR. In that case we might want to make the mode of acceleration switchable (between CPU1() and CPUThreads()).l

OkonSamuel · 2022-06-20T23:07:18Z

src/classification/main.jl

@@ -271,7 +271,7 @@ end
 function apply_forest(forest::Ensemble{S, T}, features::AbstractMatrix{S}) where {S, T}
    N = size(features,1)
    predictions = Array{T}(undef, N)
-    for i in 1:N
+    Threads.@threads for i in 1:N


I think we should check for the case of Threads.nthreads() == 1. For this case due to task overhead single threaded implementation is better.

OkonSamuel · 2022-06-20T23:11:16Z

src/classification/main.jl

@@ -271,7 +271,7 @@ end
 function apply_forest(forest::Ensemble{S, T}, features::AbstractMatrix{S}) where {S, T}
    N = size(features,1)
    predictions = Array{T}(undef, N)
-    for i in 1:N
+    Threads.@threads for i in 1:N
        predictions[i] = apply_forest(forest, features[i, :])


Note: With the current implementation, the speed improvements from using multithreading will come with significantly increased memory imprint especially for large forests. We could re-write the existing codebase to reduce this.

Do these criticisms apply similarly to the current use of multithreading in building a forest?

Do these criticisms apply similarly to the current use of multithreading in building a forest?

Yes. The current implementation might be allocation heavy. Adding multithreading affects this.

OkonSamuel

I think we should check for the case of Threads.nthreads() == 1. For this case due to task overhead single threaded implementation is better.
Other than this, LGTM.
I think there is room for improving performance, but this can be addressed later.

salbert83 added 2 commits June 14, 2022 21:07

Merge pull request #1 from JuliaAI/master

f635bd5

Master

Multithread forest application to a matrix

bd1a773

ablaom requested a review from OkonSamuel June 20, 2022 04:06

OkonSamuel reviewed Jun 20, 2022

View reviewed changes

OkonSamuel requested changes Jun 20, 2022

View reviewed changes

salbert83 closed this Jun 21, 2022

ablaom mentioned this pull request Jun 26, 2022

Multithreaded support for apply_forest #176

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multithread forest application to matrices #175

Multithread forest application to matrices #175

Uh oh!

salbert83 commented Jun 18, 2022

Uh oh!

codecov-commenter commented Jun 18, 2022 •

edited

Loading

Uh oh!

ablaom commented Jun 20, 2022 •

edited

Loading

Uh oh!

OkonSamuel Jun 20, 2022

Uh oh!

OkonSamuel Jun 20, 2022

Uh oh!

salbert83 Jun 20, 2022

Uh oh!

OkonSamuel Jun 21, 2022

Uh oh!

OkonSamuel left a comment

Uh oh!

Uh oh!

Multithread forest application to matrices #175

Multithread forest application to matrices #175

Uh oh!

Conversation

salbert83 commented Jun 18, 2022

Uh oh!

codecov-commenter commented Jun 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ablaom commented Jun 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OkonSamuel Jun 20, 2022

Choose a reason for hiding this comment

Uh oh!

OkonSamuel Jun 20, 2022

Choose a reason for hiding this comment

Uh oh!

salbert83 Jun 20, 2022

Choose a reason for hiding this comment

Uh oh!

OkonSamuel Jun 21, 2022

Choose a reason for hiding this comment

Uh oh!

OkonSamuel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov-commenter commented Jun 18, 2022 •

edited

Loading

ablaom commented Jun 20, 2022 •

edited

Loading