Multi-step Prediction using Tree Generation for Reinforcement Learning

Publication Year2022

0
Citations
286
Usage
0
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Usage
286
- Downloads
  186
- Abstract Views
  100

Thesis / Dissertation Description

The goal of reinforcement learning is to learn a policy that maximizes a reward function. In some environments with complete information, search algorithms are highly useful in simulating action sequences in a game tree. However, in many practical environments, such effective search strategies are not applicable since their state transition information may not be available. This paper proposes a novel method to approximate a game tree that enables reinforcement learning to use search strategies even in incomplete information environments. With an approximated game tree, the agent predicts all possible states multiple steps into the future and evaluates the states to determine the best action sequences with the highest return. Our proposal differs from deep reinforcement learning in that it uses deep learning for not only the state evaluation but also game tree approximation. This allows it to perform better in completing complex tasks as well as learning in sparse reward environments.

Bibliographic Details

DOI10.31979/etd.63be-cbuw

REPOSITORY URLhttps://scholarworks.sjsu.edu/etd_projects/1195

URL IDhttps://scholarworks.sjsu.edu/etd_projects/1195; http://dx.doi.org/10.31979/etd.63be-cbuw; https://scholarworks.sjsu.edu/cgi/viewcontent.cgi?article=2195&context=etd_projects; https://dx.doi.org/10.31979/etd.63be-cbuw; https://scholarworks.sjsu.edu/etd_projects/1195/

AUTHOR(S)

Kevin Prakash

PUBLISHER(S)

San Jose State University Library

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know