Fix log evidence computation #1266

devmotion · 2020-05-06T06:38:54Z

This PR fixes TuringLang/DynamicPPL.jl#104 and transfers the corresponding tests to Turing. The main problem is that the formula that I based the computation of the log evidence in #1237 on is only valid if resampling is performed in every time step, or rather if the weights are reset to 1/N before every reweighting step. A reference for the more general formula (which was used before) is eq (14) in Sequential Monte Carlo Samplers by P. Del Moral, A. Doucet and A. Jasra. The fix is particularly important since by default we don't perform resampling in every step but only in an adaptive way based on the estimated ESS.

More formally, we save only the unnormalized logarithmic weights, and accumulate them until the next resampling is performed, in which case the are reset to 0. Let logw_k^i denote the unnormalized logarithmic weight of the ith particle in the kth step. Hence in the notation of Del Moral, Doucet, and Jasra we have logw_k^i = logw_{k-1}^i + log(w_k^i), and hence by induction normalizing logw_k^i yields exp(logw_k^i) / \sum_{j=1}^N exp(logw_k^j) = w_k^i * exp(logw_{k-1}^i) / \sum_{j=1}^N w_k^j * exp(logw_{k-1}^j) = w_k^i * W_{k-1}^i / \sum_{j=1}^N w_k^j W_{k-1}^j = W_k^i, i.e., the unnormalized weights in algorithm 3.1.1, as desired. Hence from eq. (14) we can compute the increase of the log evidence by log(Z_k) - log(Z_{k-1}) = log(Z_k / Z_{k-1}) = log(\sum_{i=1}^N W_{k-1}^i * w_{k-1}^i) = log(\sum_{i=1}^N exp(logw_{k-1}^i) * w_{k-1}^i) - log(\sum_{i=1}^N exp(logw_{k-1}^i)) = log(\sum_{i=1}^N \exp(logw_{k-1}^i + log(w_{k-1}^i))) - log(\sum_{i=1}^N exp(logw_{k-1}^i)) = log(\sum_{i=1}^N exp(logw_k^i)) - log(\sum_{i=1}^N exp(logw_{k-1}^i)).

Thus only if logw_{k-1}^i are all 0 (such as in the initial step and after resampling) we obtain the formula log(Z_k) - log(Z_{k-1}) = log(\sum_{i=1}^N exp(logw_k^i)) - log(N) from the reference on which the linked PR was based on.

As a final remark, I'm a bit unsatisfied with the function name logZ since, as shown above, it does not compute log(Z) but the logarithm of the normalization factor of the current unnormalized weights.

Fixes TuringLang/DynamicPPL.jl#104

codecov · 2020-05-06T07:14:26Z

Codecov Report

Merging #1266 into master will increase coverage by 0.24%.
The diff coverage is 94.44%.

@@            Coverage Diff             @@
##           master    #1266      +/-   ##
==========================================
+ Coverage   66.84%   67.08%   +0.24%     
==========================================
  Files          25       25              
  Lines        1327     1343      +16     
==========================================
+ Hits          887      901      +14     
- Misses        440      442       +2

Impacted Files	Coverage Δ
src/core/Core.jl	`100.00% <ø> (ø)`
src/core/container.jl	`91.58% <93.02%> (-1.60%)`	⬇️
src/inference/AdvancedSMC.jl	`98.55% <100.00%> (+0.67%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 16fd11c...c9d3eff. Read the comment docs.

devmotion · 2020-05-06T07:56:29Z

The test error on Windows is still the HMC error that #1264 is supposed to fix.

yebai

Thanks @devmotion for fixing this - it is a very subtle issue and I'm glad that we now got it correct.

yebai · 2020-05-06T14:08:38Z

src/core/container.jl

+        else
+            # Increase the unnormalized logarithmic weights, accounting for the variables
+            # of other samplers.
+            increase_logweight!(pc, i, score + getlogp(p.vi))


As a side note, getlotp(p.vi) will always return 0, since the assume and observe functions for particle samplers does not modify vi.logp by default. This doesn't affect correctness, but worth to pay attention.

See:

https://github.com/TuringLang/Turing.jl/blob/v0.9.0/src/inference/AdvancedSMC.jl#L288

https://github.com/TuringLang/Turing.jl/blob/v0.9.0/src/inference/AdvancedSMC.jl#L293

That's not completely true (or maybe I misunderstand you), in this line getlogp can actually return nonzero values due to

Turing.jl/src/inference/AdvancedSMC.jl

Line 286 in 35e3fe8

acclogp!(vi, logpdf_with_trans(dist, r, istrans(vi, vn)))

. However, since we call resetlogp! in one of the following lines, this won't show up in the saved transitions.

I must have missed that line, thanks for the pointer!

src/core/container.jl

yebai · 2020-05-06T14:17:14Z

src/inference/AdvancedSMC.jl

    params = tonamedtuple(particle.vi)
+
+    # This is pretty useless since we reset the log probability continuously in the
+    # particle sweep.


thanks for the note, also see my comment above about assume and observe functions.

src/core/container.jl

* Fix particle filters with adaptive resampling and add documentation Fixes TuringLang/DynamicPPL.jl#104 * Fix and extend tests of `ParticleContainer` * Move logevidence tests from DynamicPPL * Add more convenient constructors for Particle Gibbs * Relax type annotations * Check for approximate equality only * Add docstring and reference

devmotion added 6 commits May 6, 2020 07:41

Fix particle filters with adaptive resampling and add documentation

f5063f9

Fixes TuringLang/DynamicPPL.jl#104

Fix and extend tests of ParticleContainer

6ad9998

Move logevidence tests from DynamicPPL

edfa693

Add more convenient constructors for Particle Gibbs

f5c831c

Relax type annotations

c00e181

Check for approximate equality only

5915528

yebai approved these changes May 6, 2020

View reviewed changes

yebai reviewed May 6, 2020

View reviewed changes

src/core/container.jl Show resolved Hide resolved

Add docstring and reference

c9d3eff

yebai merged commit 1218317 into TuringLang:master May 6, 2020

devmotion mentioned this pull request Oct 6, 2020

Update to AbstractMCMC 2 TuringLang/DynamicPPL.jl#150

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix log evidence computation #1266

Fix log evidence computation #1266

Uh oh!

devmotion commented May 6, 2020

Uh oh!

codecov bot commented May 6, 2020 •

edited

Loading

Uh oh!

devmotion commented May 6, 2020

Uh oh!

yebai left a comment

Uh oh!

yebai May 6, 2020

Uh oh!

devmotion May 6, 2020

Uh oh!

yebai May 6, 2020

Uh oh!

Uh oh!

yebai May 6, 2020

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix log evidence computation #1266

Fix log evidence computation #1266

Uh oh!

Conversation

devmotion commented May 6, 2020

Uh oh!

codecov bot commented May 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

devmotion commented May 6, 2020

Uh oh!

yebai left a comment

Choose a reason for hiding this comment

Uh oh!

yebai May 6, 2020

Choose a reason for hiding this comment

Uh oh!

devmotion May 6, 2020

Choose a reason for hiding this comment

Uh oh!

yebai May 6, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yebai May 6, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented May 6, 2020 •

edited

Loading