Knowledge Transfer in Reinforcement Learning: How agents should benefit from prior knowledge

Zhaodong   Wang

Back

Knowledge Transfer in Reinforcement Learning: How agents should benefit from prior knowledge

Dissertation

Open access

Knowledge Transfer in Reinforcement Learning: How agents should benefit from prior knowledge

Zhaodong Wang

Doctor of Philosophy (PhD), Washington State University

01/2019

Handle:

https://hdl.handle.net/2376/17895

Files and links (1)

pdf

Zhaodong Thesis5.94 MBDownload View

Open Access

Abstract

Learning from Demonstration

Reinforcement Learning

Transfer Learning

Machine Learning

Reinforcement learning (RL) has had many successes in different tasks, but in practice, it often requires significant amounts of data or training time to learn high-performing policies. For complicated tasks, the learning speed may be too slow to be feasible. To speed up the learning, external prior knowledge could be leveraged by the RL agent. Knowledge transfer could be performed to allow a trained (source) agent to assist a new (target) agent. It is also possible that a human demonstrator could behave as the source. With help from the prior, the new agent is expected to learn faster than learning from scratch. In most cases, prior knowledge cannot be directly copied from the source due to the different internal learning representations (e.g., from a human to a virtual agent). We need to re-summarize the usable transferred policy by observing the state-action behavior of the source agent. We propose methods that could analyze the offline confidence in summarized knowledge, which would help the agent better understand the source policy. Given the limited observation data, the agent can estimate which parts of the policy are trustworthy based on the confidence level. However, only measuring the offline confidence based on observed data is not enough. This dissertation extends the confidence analysis to the online learning setting, by further updating the confidence values based on how well the prior policies could adapt to the target tasks. Our online confidence mechanism could also remove manually defined sensitive parameters tuning and let the learner effectively adjust by itself. Confidence measurement could be done by leveraging supervised learning techniques but the demonstrations data could contain more sequential information like spatial or temporal structure. In addition to the confidence measurement of supervised methods, we further explore model-based methods to summarize the policy. Knowledge transfer is more challenging in real-life tasks, this dissertation would discuss techniques that enables the transfer from a model-based simulation to real robots. For extremely complicated learning areas like deep learning (DL), we propose correlated-feature networks that enable the agent to progressively reuse weights considering the feature relations between source and target tasks.

Metrics

124 File views/ downloads

88 Record Views

Details

Title: Knowledge Transfer in Reinforcement Learning
Creators: Zhaodong Wang
Contributors: Matthew E Taylor (Advisor)
Assefaw Gebremedhin (Committee Member)
Aaron Crandall (Committee Member)
John P Swensen (Committee Member)
Awarding Institution: Washington State University
Academic Unit: School of Electrical Engineering and Computer Science
Theses and Dissertations: Doctor of Philosophy (PhD), Washington State University
Number of pages: 136
Identifiers: 99900581813301842
Language: English
Resource Type: Dissertation