Skip to content
This repository was archived by the owner on May 6, 2021. It is now read-only.

Conversation

@findmyway
Copy link
Member

Changes:

  • TabularLearner returns the value it stores only. Previously it returns the probability with length equals to get_actions(env).
  • Add an alias of TabularRandomPolicy
  • expected_policy_values is removed. (This is a breaking change but I think no one has used it)

@findmyway findmyway merged commit 1c23c91 into JuliaReinforcementLearning:master Oct 8, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant