Reduce deepcopy #254

xukai92 · 2017-05-12T21:07:21Z

Address #237

Also fix #245

Almost done. But I also want to implement a cache of gradient - we can actually cache some of the gradient for leapfrog. The gradient used in NUTS is especially low-efficient because our leapfrog also re-compute grad in the beginning and the recursion of NUTS calls the leapfrog by only 1 step each time:

The way our trick to cache gradient inside leapfrog makes the number of computing gradient to be t+1
Calling leapfrog only one-step each time means the number to be 1 + 1, and calling t times 1-step leapfrog means 2t

I think it's fine to just cache the gradient by a dictionary inside spl.info?

xukai92 · 2017-05-12T21:59:14Z

HMC on LDA before cache was 429s (https://travis-ci.org/yebai/Turing.jl/jobs/231713149) and after is 369s (https://travis-ci.org/yebai/Turing.jl/jobs/231726143). Cache saves 1 run of gradient if the previous step is not rejected.

xukai92 · 2017-05-12T22:27:16Z

218s after setting chunksize to 60 (https://travis-ci.org/yebai/Turing.jl/jobs/231734804)

yebai · 2017-05-12T22:42:53Z

I think it's fine to just cache the gradient by a dictionary inside spl.info?

Yes, we can do that first.

218s after setting chunksize to 60

Nice!

xukai92 · 2017-05-12T23:49:10Z

I think appveyor fails because the cache dict uses Vector{Dual} as keys, and somehow win32 doesn't support it. Will fix it tomorrow.

yebai · 2017-05-13T10:42:19Z

UPDATE:

Collecting 1000 samples for LDA:

Turing.HMCDA takes 58.7 seconds.
Turing.NUTS takes 301.90 seconds.
Stan.NUTS takes 7.66134 seconds.

…into reduce-deepcopy

xukai92 · 2017-05-13T17:57:08Z

@yebai I guess I will leave the vectorization of assume for future. The reason is that the vectorization is only make things faster if all of reconstruct vectorize link and invlink are vectorized, where the vectorization of link and invlink seems to be tricky for SimplexDistribution (for LDA).

Shall we merge this PR then?

xukai92 · 2017-05-13T17:58:35Z

Another related issue I was stuck is the convention of matrix, i.e. default functions all treat thing as matrix and we now write things as vector of vector because of our chain interface issue (https://github.com/yebai/Turing.jl/issues/207).

xukai92 · 2017-05-14T16:17:50Z

Can we merge this to master?

xukai92 added 30 commits May 11, 2017 13:35

Make last to last!

ac71f50

Remove old comments

557df1a

Improve transformation interface

f4ebf85

Make VarInfo.trans a vector-of-vector

2daf6cc

Remove deepcopy from link/invlink

0b201d2

Use realpart in expand

478d9fd

Merge branch 'master' into reduce-deepcopy

3e7178b

Improve last!

9d1c211

Make VarInfo.logp into a vector

efe151a

Remove deepcopy from ad.jl

817ffc3

Remove duplicated expand from ad.jl

1738812

Remove expand in runmodel

ab444cb

Rename varibales in ad.jl

6b78abd

Give up improve AD

e47fb3e

Change link and invlink to in-place

04dfeff

Fix HMC accept reate log

029a3f7

Remove expand in ad.jl

e1a43a4

Remove deepcopy from leapfrog

59b7521

Reduce deepcopy in HMCDA and NUTS

2d3daf9

Fix benchmark path bug

6b0c2ac

Fix model bug

d3cf32b

Improve inv/link

1fc34dc

Remove deepcopy from hmcda

91e5f43

Fix model name

a08f29c

Remove expand! in leapfrog

99a8be4

Pass vi into recursion

1854e01

Reorder params

d5cd278

Use new leapfrog in HMCDA

0fdc4be

Improve HMCDA

47f6342

Improve HMCDA code

63b4b02

Improve observe code

b68b60a

Fix setchunksize

5e745da

Vectorize observe #255

bb25da0

Much faster LDA (#117, #255)

81f77a8

Hong Ge and others added 12 commits May 13, 2017 11:47

Increase max_treedepth for NUTS.

b1f3c51

Add NUTS to lda.run.

d8c2d41

Faster MoC.

2278d1b

Add a quick test

f958da2

Merge branch 'reduce-deepcopy' of https://github.com/yebai/Turing.jl …

7319a55

…into reduce-deepcopy

Fix cache for win32

4102830

Support new syntax for vec obs

a9be13d

sync model

76b8917

Assume vec for HMC

a1afe04

Vectorize construct for uni

af2b6c6

Improve vec test

78abffb

Prepare test for vec Mv

3cabb33

Remove another copy()

3f8a1fd

yebai merged commit abd6176 into master May 14, 2017

This was referenced May 25, 2017

Separate the change of sampling space (θ) from VarInfo #253

Closed

Reduce deepcopy for VarInfo #237

Closed

yebai deleted the reduce-deepcopy branch May 26, 2017 23:29

torfjelde mentioned this pull request Mar 12, 2025

Better handling of interaction between contexts TuringLang/DynamicPPL.jl#274

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce deepcopy #254

Reduce deepcopy #254

Uh oh!

xukai92 commented May 12, 2017

Uh oh!

xukai92 commented May 12, 2017 •

edited

Loading

Uh oh!

xukai92 commented May 12, 2017

Uh oh!

yebai commented May 12, 2017 •

edited

Loading

Uh oh!

xukai92 commented May 12, 2017

Uh oh!

yebai commented May 13, 2017 •

edited

Loading

Uh oh!

xukai92 commented May 13, 2017

Uh oh!

xukai92 commented May 13, 2017

Uh oh!

xukai92 commented May 14, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Reduce deepcopy #254

Reduce deepcopy #254

Uh oh!

Conversation

xukai92 commented May 12, 2017

Uh oh!

xukai92 commented May 12, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xukai92 commented May 12, 2017

Uh oh!

yebai commented May 12, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xukai92 commented May 12, 2017

Uh oh!

yebai commented May 13, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xukai92 commented May 13, 2017

Uh oh!

xukai92 commented May 13, 2017

Uh oh!

xukai92 commented May 14, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xukai92 commented May 12, 2017 •

edited

Loading

yebai commented May 12, 2017 •

edited

Loading

yebai commented May 13, 2017 •

edited

Loading