[WIP] PR for issue #634 #750

trappmartin · 2019-04-13T11:10:39Z

This is a work in progress PR refactoring the assume and observe interface (#634). Do not merge!

Todo:

Define abstract runner interface.
Define runner for particle based inference.
Define runner to evaluate logjoint(m::Model, v::VarInfo).
Define runner to evaluate logpdf(m::Model, v::VarInfo).
Separate compiler and other core parts from SampleFromPrior.
Use runners inside of samplers instead of overloading assume and observe.
~~[ ] Fix Vectorisation issues: Assume in vectorisation of HMC bug. #760 Observe vectorisation issue. #761 .~~

… martint/#634

yebai · 2019-04-23T15:56:19Z

Why do we call setgid! only if assume is called for a single distribution during HMC sampling.
Why do we increase vi.num_produce on observe only for a single distribution and not during vectorisation? Is this a bug?

These are likely bugs in the vectorisation implementation (related #476).

We sometimes increase the logp of VarInfo using acclogp!(vi, ...) and sometimes not. Is there a rational behind this?

I'm not sure I understand the issue here. Can you clarify a bit?

I'd prefer if we have a more flexible and consistent interface for manipulating the lp_ variable. Currently, the user has no chance to manipulate this if needed. I feel something similar to acclogp!(vi, ...) but without the need of passing vi in the form of a macro would be a solution. Any objections?

Have an API for manipulating logp sounds good!

trappmartin · 2019-04-23T16:11:10Z

We sometimes increase the logp of VarInfo using acclogp!(vi, ...) and sometimes not. Is there a rational behind this?

I'm not sure I understand the issue here. Can you clarify a bit?

We increase log p inside the compiler using: $vi.logp += $lp and additionally increase this values inside the particle Gibbs sampler:

./src/inference/pgibbs.jl:174:        acclogp!(vi, logpdf_with_trans(dist, r, istrans(vi, vn)))

I suspect this is a bug.

yebai · 2019-04-23T16:20:39Z

Places where we modify logp:

Inside the compiler:

Turing.jl/src/core/compiler.jl

Line 52 in 820051e

$vi.logp += Turing.observe($sampler, $dist, $observation, $vi)
Turing.jl/src/core/compiler.jl

Line 104 in 820051e

$vi.logp += $lp

Inside PG:

Turing.jl/src/inference/pgibbs.jl

Line 174 in e585eea

acclogp!(vi, logpdf_with_trans(dist, r, istrans(vi, vn)))

It does seem L174 in PG is a duplicate of L104 in the compiler. Since PG is not making use of VarInfo.logp in the resampling step, this is not hurting us yet...

yebai · 2019-04-23T16:24:07Z

UPDATE:

Actually, the following line "cancels" the increment of logp in the compiler (i.e. L104), since it returns 0.

Turing.jl/src/inference/pgibbs.jl

Line 176 in e585eea

return r, zero(Real)

trappmartin · 2019-04-23T16:25:38Z

It seems it's a very good idea that we refactor that bit of the code base.

cpfiffer · 2019-05-03T12:28:02Z

src/inference/runners.jl

+end
+
+#################################
+# Compute the log joint Runner. #


Suggested change

# Compute the log joint Runner. #

# Compute the particle filtering Runner. #

mohdibntarek · 2019-07-05T00:12:05Z

Thanks Martin for this PR.

I suggest we keep the vectorization issues out of this, since that's a totally different beast.

About this PR and its underlying issue, while I am not exactly against them, I just don't get the appeal of introducing a plethora of new types to do exactly the same thing we are doing now without these types at all. The abstractions we are introducing here don't add much functional value for now as far as I can see. If anything they are a bit confusing. For example, the relation between SampleFromDistribution, ComputeLogJointDensity and Sampler is a bit grey in my eyes since they all subtype AbstractRunner but are used in very different situations. For instance, why is ComputeLogJointDensity not just a function? Why do I have to do Sampler(ComputeLogJointDensity(), selector) which is unnecessarily complicated and obscure imo. What are we sampling? What are we selecting? Can any AbstractRunner be passed to Sampler as a first argument? Another question is when should we use SampleFromPrior vs nothing when trying to run the model initially. Why was nothing introduced back when we removed it earlier and replaced it with SampleFromPrior or SampleFromUniform everywhere?

I think what we need to do is work from the use-cases backwards. So firstly, we specify the API functions that we want to work, e.g. rand(::Sampler(::Model, ::InferenceAlgorithm)) then we write minimal code to define these functions. There is no need to introduce a new type if it doesn't add functional value via dispatch or data organization. There might be a philosophical appeal to abstracting everything and introducing a new type for everything, but that's just code that needs to be maintained later so simple is usually better.

I may be just failing to see the value of this re-structuring so please explain it to me if I got it all wrong. Sorry for the late night rant :)

yebai · 2019-07-05T13:34:07Z

About this PR and its underlying issue, while I am not exactly against them, I just don't get the appeal of introducing a plethora of new types to do exactly the same thing we are doing now without these types at all.

This is meant to avoid direct dispatch assume and observe on Sampler types, and implement a standard set of APIs (e.g. logpdf, rand). The current code bases contain redundant assume and observe implementations, which are largely unnecessary. We can keep modifying the signature, e.g.

assume(spl::{HMC, NUTS, SGLD}, ...)

But this looks a bit ugly and doesn't support plug-and-play inference. Introducing an intermediate type such as Runner allows samplers to share the same set of assume and observe implementations for many use cases.

For instance, why is ComputeLogJointDensity not just a function?

This is meant to reduce the number of functions the Turing compiler has to generate. By lowering the model into IRs and allow users to overload assume and observe, we save a lot of compiler transformations. I like to keep the compiler as simple as possible (though it's already quite complex) and do most stuff via standard Julia features such as multi-dispatching. I think this is beneficial in the longer term.

xukai92 · 2019-07-17T14:48:10Z

It seems that the issue is not introducing these new runner types, but the fact these types are currently used to construct Sampler are not very intuitive. Not sure how to change this though.

trappmartin · 2019-07-18T09:35:23Z

I agree. I'll create a new PR in which I will try to find a solution that doesn't require to construct Sampler.

xukai92 · 2019-08-23T23:04:20Z

What's the plan for this issue?

trappmartin · 2019-08-24T08:45:54Z

Once the inference changeover is merged I’ll work on a new PR that contains the main features of this PR.

yebai · 2019-11-22T17:40:15Z

I suspect that a lot of features planned in this PR is now added in #965, e.g. the Runner type is replaced by a similar type Context. I propose we close this PR for now, and move useful code contained in this PR to a new PR if needed.

trappmartin added 6 commits April 4, 2019 16:51

Added additional abstract types.

750d6da

Changed AbstractSampler to AbstractRunner

946233c

WIP

5560d9f

wip

2fd3bff

Work in progress.

c4aa64c

Merge branch 'master' of https://github.com/TuringLang/Turing.jl into…

fe63a3e

… martint/#634

trappmartin self-assigned this Apr 13, 2019

trappmartin added 3 commits April 22, 2019 14:44

Merge branch 'master' of https://github.com/TuringLang/Turing.jl into…

43a3e7d

… martint/#634

work in progress, tests failing.

5d1e162

work in progress.

c48aa18

yebai self-requested a review April 23, 2019 15:57

Minor refactoring.

7a5bb36

trappmartin added 2 commits April 24, 2019 13:48

Work in progress.

70f9f32

merged

b3bb26e

yebai mentioned this pull request Apr 29, 2019

RFC Sampler type #771

Closed

trappmartin added 3 commits May 2, 2019 18:36

wip

a4741a5

Runners now use Sampler interface.

1ee3c31

import logpdf

bb9247d

cpfiffer reviewed May 3, 2019

View reviewed changes

cpfiffer mentioned this pull request May 16, 2019

Abstract out inference procedures #788

Closed

trappmartin mentioned this pull request Jun 21, 2019

Integrate Turing.jl probablistic programming JuliaAI/MLJ.jl#157

Open

yebai mentioned this pull request Jul 4, 2019

observe and assume #504

Closed

yebai mentioned this pull request Aug 12, 2019

API design for non-Turing samplers #886

Closed

yebai closed this Nov 22, 2019

trappmartin deleted the martint/#634 branch September 24, 2020 13:00

	# Compute the log joint Runner. #
	# Compute the particle filtering Runner. #

[WIP] PR for issue #634 #750

[WIP] PR for issue #634 #750

Uh oh!

Conversation

trappmartin commented Apr 13, 2019 • edited by yebai Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yebai commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trappmartin commented Apr 23, 2019

Uh oh!

yebai commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yebai commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trappmartin commented Apr 23, 2019

Uh oh!

cpfiffer May 3, 2019

Choose a reason for hiding this comment

Uh oh!

mohdibntarek commented Jul 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yebai commented Jul 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xukai92 commented Jul 17, 2019

Uh oh!

trappmartin commented Jul 18, 2019

Uh oh!

xukai92 commented Aug 23, 2019

Uh oh!

trappmartin commented Aug 24, 2019

Uh oh!

yebai commented Nov 22, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

trappmartin commented Apr 13, 2019 •

edited by yebai

Loading

yebai commented Apr 23, 2019 •

edited

Loading

yebai commented Apr 23, 2019 •

edited

Loading

yebai commented Apr 23, 2019 •

edited

Loading

mohdibntarek commented Jul 5, 2019 •

edited

Loading

yebai commented Jul 5, 2019 •

edited

Loading