Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Thanks for the tutorial/video. This was fun. May have gone a bit overboard. xD
Wish I'd made it smarter, might play with the epsilon idea in the future. I've reinforced my hunger for reinforcement learning!
Elements of my solution:
random_start()start anywhere on the board.difficultyparameter that scales up the number of walls.create_reds(),create_greens()andcreate_walls()that add more special squares to help and hinder the agent.(x,y)scaling for larger boardsmax_q()changes for more randomized/Q-sensitive agent decision making.