Improve tabular learner #140

findmyway · 2020-10-08T17:05:55Z

Changes:

TabularLearner returns the value it stores only. Previously it returns the probability with length equals to get_actions(env).
Add an alias of TabularRandomPolicy
expected_policy_values is removed. (This is a breaking change but I think no one has used it)

improve tabular learner

17569c8

findmyway merged commit 1c23c91 into JuliaReinforcementLearning:master Oct 8, 2020

Provide feedback