Evaluate pending nodes in parallel #524

iFrostizz · 2023-07-05T11:27:13Z

When having a lot of branching, it may take a lot of time to evaluate proofs on pending nodes because this is done sequentially.
In this case, we can run them on the pending leaves by running the proofs in parallel at a small cost.
This feature adds a new parameter max_workers with a default of 1 for the old behavior in the advance_proof of the Prover and will cap the amount of proving processes running in parallel whenever there are these pending nodes.

iFrostizz · 2023-07-06T07:23:41Z

This may lead to significant efficiency improvements in proofs that are branching a lot !
For the ArithmeticTest.test_max1 which has one branch:

Before (one kore client executing requests sequentially):

After (with 3 clients in parallel, although only two were used):

…ar-split

tothtamas28

Thank you for this PR. Parallelizing KCFG exploration is a feature we've been contemplating for a long time but so far no one actually attempted to do it. This is a step in the right direction.

Based on your submission though I now think that introducing this feature in the codebase might require further planning, and possibly multiple enabling changes. Below are a few thoughts I have on the issue.

We need an approach that makes locking in the client unnecessary. Each thread in the pool should be assigned its own server (to be more precise, a port number to a running server). Instantiating a client is not a big overhead so it can be done on-demand inside the handler. We can even introduce an abstraction for this so that algorithms can be coded against it (e.g. KoreServerPool).
Class KCFGExplore has too many responsibilities. On one hand, it implements logic, on the other hand, it manages resources. Resource management should be factored out of this class (i.e. KCFGExplore should receive a KoreClient as a constructor argument). If this is done, instantiating KCFGExplore inside the handler will also become an option.
We need a thread-safe data model. It must be ensured that no race condition occurs on a data write.
If I understand correctly, parallelism in this PR is utilized on branching, i.e. at each moment, nodes submitted to the pool have a common parent. The ideal model is where a pending node is submitted to the pool as soon as it is discovered. (This is remark for the future though, the branching parallelism is already an improvement.)

tothtamas28 · 2023-07-07T10:03:47Z

@ehildenb, this is probably interesting for you too.

ehildenb · 2023-07-10T10:35:02Z

@iFrostizz I discussed with @tothtamas28 a bit this change. I think that the main thing we want is to leave the parallelization up to the client, and have a thread-safe data-model. This is important for not causing race conditions, and getting maximal utilization of the processes available (instead of launching threads for each pending node, and waiting for those to all synchronize).

I think a better design is to break the proof into subproofs based on the pending nodes. So each time we have a branch, if parallelism is turned on, turn it into a subproof. That way, we are synchronizing (and locking) on the Proof datastructure, which should be thread-safe since each proof carries its own KCFG. Then the user can tell us "what is the maximal number of pending nodes allowed before breaking into subproofs?", and once that level of branching is reached, then launch separate processes (as they wish) for each subproof.

I think it's best if @tothtamas28 takes the first attempt at this, because parallel/thread-safe programming is pretty related to data-modelling and that's what he's been working on with these data-structures anyway.

François Guyot and others added 12 commits July 5, 2023 07:42

run pending in parallel

96c3d2a

parallel advance

5c9cb5a

add multi-rpc support

ab1199d

list init

a41d69f

Merge a41d69f into 8c1b2bb

3fc21b3

Set Version: 0.1.359

48ac0a4

manage server locks

616044d

acquire lock

8770b93

truth boolean check

a00b4a4

acquire new

4565a42

fmt

526710d

remove _busy

1930c52

François Guyot added 2 commits July 6, 2023 07:40

free memory after use

d8b1b46

optionally release

79ae610

iFrostizz self-assigned this Jul 6, 2023

François Guyot and others added 5 commits July 6, 2023 21:14

async dispatch

ca8a44a

Merge ca8a44a into 6e5f8e6

ac8a64b

Set Version: 0.1.360

42a2098

fmt

8c98c4d

Merge branch 'par-split' of github.com:runtimeverification/pyk into p…

e417a1b

…ar-split

iFrostizz requested a review from tothtamas28 July 6, 2023 21:34

tothtamas28 reviewed Jul 7, 2023

View reviewed changes

François Guyot added 4 commits July 10, 2023 06:48

only pool necessary workers

f1403f5

abstract over KoreServerPool

b1a718f

format

6acd860

write bug_report back in test

fb6e25e

iFrostizz requested a review from tothtamas28 July 10, 2023 09:51

iFrostizz closed this Jul 10, 2023

nwatson22 mentioned this pull request Sep 6, 2023

Add parameter to advance_proof to split into subproofs after enough branches have been created #638

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evaluate pending nodes in parallel #524

Evaluate pending nodes in parallel #524

Uh oh!

iFrostizz commented Jul 5, 2023 •

edited

Loading

Uh oh!

iFrostizz commented Jul 6, 2023

Uh oh!

tothtamas28 left a comment •

edited

Loading

Uh oh!

tothtamas28 commented Jul 7, 2023

Uh oh!

ehildenb commented Jul 10, 2023

Uh oh!

Uh oh!

Evaluate pending nodes in parallel #524

Evaluate pending nodes in parallel #524

Uh oh!

Conversation

iFrostizz commented Jul 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iFrostizz commented Jul 6, 2023

Uh oh!

tothtamas28 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tothtamas28 commented Jul 7, 2023

Uh oh!

ehildenb commented Jul 10, 2023

Uh oh!

Uh oh!

iFrostizz commented Jul 5, 2023 •

edited

Loading

tothtamas28 left a comment •

edited

Loading