Actually perform iteration with DynamicHMC #1186

devmotion · 2020-03-26T20:42:04Z

So far, sampling with DynamicHMC just calls DynamicHMC and obtains all samples before any sampling step in Turing is performed. In each sampling step in Turing, we then just pop one sample and save it in a Transition.

This approach has some clear disadvantages, such as being memory inefficient, invalidating any progress logging in Turing, not being able to resume sampling, and also not being able to perform sampling until a convergence criterion is met (not supported in Turing yet but supported in AbstractMCMC).

The disadvantage of making use of the iteration interface in DynamicHMC is that the interface is not part of the official API and hence allowed to break at any time.

In principle, it would be trivial to update the log density function (which is now implemented more clearly as a callable struct that wraps the model and the sampler instead of wrapping a closure) in the sampling step to support (some kind of) Gibbs sampling. It is not included yet since I'm not sure how reasonable it would be to that without adapting the step size and/or the metric of the HMC algorithm (or how they could be updated using DynamicHMC).

devmotion · 2020-03-26T20:44:12Z

test/runtests.jl

-                if Int === Int64 && Pkg.installed()["DynamicHMC"].major == 2
-                    include("contrib/inference/dynamichmc.jl")
-                end
+                include("contrib/inference/dynamichmc.jl")


Pkg.installed is deprecated on Julia 1.4. I assume that it is unlikely that anyone will run the tests with DynamicHMC < 2 and according to the comment the bug (?) was observed on 32bit with DynamicHMC < 2, so I guess it should be safe to remove the check completely.

Happy to remove this check.

devmotion · 2020-03-26T21:50:27Z

not being able to resume sampling

I just noticed that it won't be possible even with these changes since we deliberately only save the samples and their log probability (the full transition is only kept until the next step is performed). However, that means that the resulting Chains object is not sufficient for resuming sampling (the same is true for Gibbs sampling BTW). In general, I think to base resuming on a Chains object is not the most intuitive approach (it's not even the only possible output format); I guess it would be more reasonable to have a general implementation that resumes sampling based on a model, a sampler, and a transition. This general approach could be even be added to AbstractMCMC I assume and might allow us to remove all/most checks for resume_from === nothing and set_resume! in the initializations of the samplers. We would just somehow need the possibility to obtain the final transition from sample, then we could save it (maybe even as part of a Chains object).

devmotion · 2020-03-26T21:58:43Z

Maybe we could just mcmcsample (and mcmcpsample?) to return the final transition together with the bundled samples. In the standard sample (and psample?) implementation in AbstractMCMC we could just throw it away (?), but in Turing, e.g., we could try to save it somehow.

cpfiffer · 2020-03-27T01:18:55Z

In the standard sample (and psample?) implementation in AbstractMCMC we could just throw it away (?), but in Turing, e.g., we could try to save it somehow.

Maybe chains.info? It's a little hacky but it might work.

devmotion · 2020-03-27T06:06:31Z

We could do that but I'm a bit worried that I guess we would have to silently remove the transition if the last sample of the chain is modified or removed. Maybe it should be more explicit, maybe we need an alternative to sample that returns the last transition as well.

cpfiffer · 2020-03-27T13:28:05Z

Perhaps it could be added to chains as a first-class citizen, such that we introduce a new field chain.state that holds whatever thing (Transition) you need to resume sampling.

devmotion · 2020-03-27T13:34:58Z

I like that! But still it wouldn't resolve the issue that you might lose the ability to resume sampling at some point without noticing, would it?

cpfiffer · 2020-03-27T13:41:06Z

Maybe the only time would be chain saving/loading, but that needs to be overhauled to use JLD2 or something better than what we have now. Chain subsetting would just carry over chain.state, so the resumed sampling might not mean much.

One "pie in the sky" way to solve this, which is a very big way to solve this, is to just remove the entire axis array setup, and just store all the raw Transition structs. As long as Transitions support getproperty/getfield/getindex and some metadata, you should be able to retrieve all the individual parameter values quite quickly. This way you can always resume no matter what you do to the chain, but it would be a lot of work and might not be very robust.

devmotion · 2020-03-27T13:58:58Z

This way you can always resume no matter what you do to the chain, but it would be a lot of work and might not be very robust.

Sure, that's something one could do, similar to just specifying chain_type = nothing. But the main issue with Gibbs and DynamicHMC (and possibly other samplers in the future) is that one wants to include method-specific stuff in the transition for only the next step (so things that are neither samples nor statistics of samples) (more like a state that can't be computed a priori and is modified during sampling) but then just save the samples and their statistics preferably. The same setup, e.g., could allow to remove the state-related hacks and type instabilities in the other HMC samplers as well.

Maybe that shouldn't be included in the transitions at all but rather we should have something like SamplerWithState that is, similar to the transitions, not initialized and computed a priori when Sampler is defined (since we haven't computed a sample and hence a depending state yet) but only after the first sample step. Basically, in all subsequent steps we would then not call step! with a Sampler but with a SamplerWithState. Then it would also make sense to return both the samples and the SamplerWithState which could be used for resuming sampling again.

cpfiffer · 2020-03-29T15:19:45Z

Did you want to park this until #31 goes through, or merge it over an revisit it if the tooling there makes it into AbstractMCMC? I'm fine with this code as-is.

cpfiffer · 2020-03-29T15:20:31Z

Also do you have any idea what the performance drop is due to not running all the samples at once?

yebai · 2020-06-29T17:17:28Z

Shall we give this PR another try?

devmotion · 2020-06-29T17:23:19Z

I'd say it's more reasonable to open a new PR that upgrades the DynamicHMC sampler (and probably the other samplers at the same time?) to the proposed interface in TuringLang/AbstractMCMC.jl#42, if we agree on it. I wanted to update the AbstractMCMC integration in DynamicPPL and Turing first before merging that PR, to see if it causes any problems (shouldn't be the case but I assumed it would be better to check it first). I kept this PR open mainly as a reminder to fix the integration.

devmotion · 2020-12-22T21:03:39Z

Closed in favour of #1497.

Actually perform iteration with DynamicHMC

34bb018

devmotion commented Mar 26, 2020

View reviewed changes

Merge branch 'master' into dynamichmc_iteration

898afb5

devmotion mentioned this pull request Mar 27, 2020

[RFC] Sampler states TuringLang/AbstractMCMC.jl#31

Closed

devmotion mentioned this pull request Apr 4, 2020

Fix crashes in CI tests #1202

Merged

yebai force-pushed the master branch 2 times, most recently from 2661769 to 701f5d2 Compare May 4, 2020 21:46

devmotion mentioned this pull request Dec 22, 2020

Use iteration interface of DynamicHMC #1497

Merged

devmotion closed this Dec 22, 2020

devmotion deleted the dynamichmc_iteration branch December 22, 2020 21:03

Actually perform iteration with DynamicHMC #1186

Actually perform iteration with DynamicHMC #1186

Uh oh!

Conversation

devmotion commented Mar 26, 2020

Uh oh!

devmotion Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

yebai Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

devmotion commented Mar 26, 2020

Uh oh!

devmotion commented Mar 26, 2020

Uh oh!

cpfiffer commented Mar 27, 2020

Uh oh!

devmotion commented Mar 27, 2020

Uh oh!

cpfiffer commented Mar 27, 2020

Uh oh!

devmotion commented Mar 27, 2020

Uh oh!

cpfiffer commented Mar 27, 2020

Uh oh!

devmotion commented Mar 27, 2020

Uh oh!

cpfiffer commented Mar 29, 2020

Uh oh!

cpfiffer commented Mar 29, 2020

Uh oh!

yebai commented Jun 29, 2020

Uh oh!

devmotion commented Jun 29, 2020

Uh oh!

devmotion commented Dec 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants