Fix some corner cases when sampling a SumTree #83

findmyway · 2020-06-24T11:55:09Z

Though the fix here seems to be trivial. But it fixes some big problems in prioritized experience buffer.

Before this, the loss of rainbow with atari pong:

After this fix:

The reason is that, in some cases, get(t::SumTree, v) might return ind, p where p=0.0. And it makes the rescaled weights decreased to nearly zero.

findmyway added 2 commits June 24, 2020 02:41

fix bug in sum tree

dbfd899

sampling trick =。=

e8f3961

findmyway mentioned this pull request Jun 24, 2020

fix IQN JuliaReinforcementLearning/ReinforcementLearningZoo.jl#53

Merged

findmyway merged commit ab738de into JuliaReinforcementLearning:master Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix some corner cases when sampling a SumTree #83

Fix some corner cases when sampling a SumTree #83

Uh oh!

findmyway commented Jun 24, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix some corner cases when sampling a SumTree #83

Fix some corner cases when sampling a SumTree #83

Uh oh!

Conversation

findmyway commented Jun 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

findmyway commented Jun 24, 2020 •

edited

Loading