Benchmarking #248

torfjelde · 2021-05-18T20:15:53Z

Changes to DPPL can often have quite significant effects for compilation time and performance of both itself and downstream packages. It's also sometimes difficult to discover these performance regressions.

E.g. in #221 we made a small simplification to the compiler and it ended up taking quite a while to figure out what was going wrong and had to test several models to identify the issue.

So, this is a WIP PR for including a small set of models which we can weave into a document where we can look at the changes. It's unclear to me whether this should go in DPPL itself or in a separate package. I found it useful myself and figured I'd put it here so we can start maybe get some "standard" benchmarks to run for testing purposes. IMO we don't need many of them, as we will add more as we go along.

For each model the following will be included in the document:

Benchmarked evaluation of the model on untyped and typed VarInfo.
Timing of the compilation of the model in the typed VarInfo.
Lowered code for the model.
- If :prefix is provided to weave, the string-representation of code_typed for the evaluation of the model will be saved to a file $(prefix)_(model.name). Furthermore, if :prefix_old is provided, pointing to :prefix used for a previous run (likely using a different version of DPPL), we will diff the code_typed for the two models by loading the saved files.

devmotion

I assume this can be very helpful 👍

Maybe put it in /benchmarks/? Since it's for benchmarking DynamicPPL specifically it seems fine to me to put it in this repo.

torfjelde · 2021-05-18T20:26:03Z

Maybe put it in /benchmarks/? Since it's for benchmarking DynamicPPL specifically it seems fine to me to put it in this repo.

Good point 👍

I'll also add README.md to the folder specifying how it can be used.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Co-authored-by: David Widmann <[email protected]>

benchmarks/utils.jl

benchmarks/src/DynamicPPLBenchmarks.jl

torfjelde · 2021-07-08T17:06:04Z

@devmotion Could you maybe have another look at this? Found it extremely useful when evaluating whether #269 and #271 will cause issues, so it would be nice to just get it merged. Then we can develop it further once it's there, and potentially separate it out into it's own project if it reaches sufficient complexity/size.

yebai · 2021-07-13T16:43:02Z

bors r+

Changes to DPPL can often have quite significant effects for compilation time and performance of both itself and downstream packages. It's also sometimes difficult to discover these performance regressions. E.g. in #221 we made a small simplification to the compiler and it ended up taking quite a while to figure out what was going wrong and had to test several models to identify the issue. So, this is a WIP PR for including a small set of models which we can `weave` into a document where we can look at the changes. It's unclear to me whether this should go in DPPL itself or in a separate package. I found it useful myself and figured I'd put it here so we can start maybe get some "standard" benchmarks to run for testing purposes. IMO we don't need many of them, as we will add more as we go along. For each model the following will be included in the document: - Benchmarked evaluation of the model on untyped and typed `VarInfo`. - Timing of the compilation of the model in the typed `VarInfo`. - Lowered code for the model. - If `:prefix` is provided to `weave`, the string-representation of `code_typed` for the evaluation of the model will be saved to a file `$(prefix)_(model.name)`. Furthermore, if `:prefix_old` is provided, pointing to `:prefix` used for a previous run (likely using a different version of DPPL), we will `diff` the `code_typed` for the two models by loading the saved files.

bors · 2021-07-13T16:58:17Z

Build failed:

test (1.3, ubuntu-latest, x64, 1)

yebai · 2021-07-13T17:02:46Z

@torfjelde Some tests are failing in loglikelihoods.jl. Can you take a look?

Related #268

torfjelde · 2021-07-13T19:34:06Z

@torfjelde Some tests are failing in loglikelihoods.jl. Can you take a look?

Seems like there has been some downstream changes that are causing these. Having a look.

Related #268

Can you elaborate on why #268 is related? Not quite seeing the connection 😕

EDIT: Did you mean to point to #272 ?

torfjelde · 2021-07-13T19:46:54Z

Also, this PR makes absolutely no changes to DynamicPPL's functionality; it only touches benchmarks/. So this PR shouldn't be held back by failing tests since it's not responsible for those.

Still need to figure out what is breaking the tests though!

yebai · 2021-07-13T21:50:38Z

Can you elaborate on why #268 is related? Not quite seeing the connection 😕

Sorry for the confusion - I mean we are experiencing similar falling tests there.

initial stuff

e32b59f

devmotion reviewed May 18, 2021

View reviewed changes

torfjelde and others added 27 commits May 18, 2021 22:31

moved benchmark folder and added README

e83a671

unwrap distributions and varnames at model-level

3ab2bee

removed _tilde and renamed tilde_assume and others

a549d1f

formatting

e0f77bc

updated compiler for new tilde-methods

8e4fa91

fixed calls to dot_assume

1c9a2d5

added sampling context and unwrap_childcontext

d70e1be

updated tilde methods

f743990

updated model call signature

3d2e7e2

updated compiler

4f1d396

formatting

b187d74

added getsym for vectors

ee99f8c

Update src/varname.jl

c4845d0

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fixed some signatures for Model

a0c05f3

fixed a method call

307cd7e

fixed method signatures

5972771

sort of fixed the matchingvalue functionality for model

c4ecd0e

formatting

a34b51c

removed redundant _tilde method

b89ff7e

Merge branch 'tor/tilde-simplification' into tor/sampler-context

6368282

removed left-over acclogp! that should not be here anymore

e4a2cf8

export SamplingContext

7605785

use context instead of ctx to refer to contexts

354ac52

formatting

b7a2b3b

use context instead of ctx for variables

9e0fc9a

use context instead of ctx to refer to contexts

7a4a1a3

Update src/compiler.jl

7899473

Co-authored-by: David Widmann <[email protected]>

Merge branch 'master' into tor/benchmarks

f25afa3

github-actions bot reviewed Jul 8, 2021

View reviewed changes

benchmarks/utils.jl Outdated Show resolved Hide resolved

benchmarks/utils.jl Outdated Show resolved Hide resolved

benchmarks/utils.jl Outdated Show resolved Hide resolved

updated to work with master

e67ca2a

torfjelde mentioned this pull request Jul 8, 2021

Simplify isassumption check #271

Closed

torfjelde added 4 commits July 8, 2021 02:41

changed the output structure a bit

6ac63b8

make benchmarks a proper project

74a3f97

forgot to include src

5742f90

updated jmd files

34cfabc

github-actions bot reviewed Jul 8, 2021

View reviewed changes

added some docs

abb1768

github-actions bot reviewed Jul 8, 2021

View reviewed changes

benchmarks/src/DynamicPPLBenchmarks.jl Outdated Show resolved Hide resolved

benchmarks/src/DynamicPPLBenchmarks.jl Outdated Show resolved Hide resolved

torfjelde added 2 commits July 8, 2021 17:32

updated README

4ea7bfc

formatting

1147f64

github-actions bot reviewed Jul 8, 2021

View reviewed changes

torfjelde mentioned this pull request Jul 8, 2021

Support for immutable AbstractVarInfo #269

Closed

make sure we are evaluating rather than sampling

cfb8635

torfjelde marked this pull request as ready for review July 8, 2021 17:06

torfjelde changed the title ~~[WIP] Benchmarking~~ Benchmarking Jul 10, 2021

yebai approved these changes Jul 13, 2021

View reviewed changes

torfjelde mentioned this pull request Jul 13, 2021

Use view whenever possible #272

Merged

yebai merged commit 4de6f54 into master Jul 13, 2021

yebai deleted the tor/benchmarks branch July 13, 2021 21:51

Benchmarking #248

Benchmarking #248

Uh oh!

Conversation

torfjelde commented May 18, 2021

Uh oh!

devmotion left a comment

Choose a reason for hiding this comment

Uh oh!

torfjelde commented May 18, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

torfjelde commented Jul 8, 2021

Uh oh!

yebai commented Jul 13, 2021

Uh oh!

bors bot commented Jul 13, 2021

Uh oh!

yebai commented Jul 13, 2021

Uh oh!

torfjelde commented Jul 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

torfjelde commented Jul 13, 2021

Uh oh!

yebai commented Jul 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

torfjelde commented Jul 13, 2021 •

edited

Loading