You are in : Home » Results & submissions » Select UOA » 11 - Computer Science and Informatics » View submission: University of York » Outputs » Detail

Output details

11 - Computer Science and Informatics

University of York

Return to search Previous output Next output

Output 45 of 139 in the submission

Output title

Dynamic Potential-Based Reward Shaping

Type

E - Conference contribution

DOI

Name of conference/published proceedings

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems

Volume number

Issue number

First page of article

433

ISSN of proceedings

Year of publication

2012

URL

http://www.ifaamas.org/Proceedings/aamas2012/papers/2C_3.pdf

Number of additional authors

Additional information

<12>This paper shows that the theoretical guarantees of potential-based reward shaping (PBRS), an important and useful reinforcement learning (RL) tool, extend to cases where reward shaping is dynamically changing

during learning. This is especially crucial for cases where the reward shaping is being learned online.

As one conference referee noted, "this is a big result". Many researchers in private

correspondence have expressed surprise at our result, and the work already led to a joint publication at

AAMAS with researchers from the University of Nebraska. The paper shows rigour, containing both theoretical proofs and empirical demonstrations of the theoretical guarantees.

Interdisciplinary

Cross-referral requested

Research group

G - Artificial Intelligence

Citation count

Proposed double-weighted

Double-weighted statement

Reserve for a double-weighted output

Non-English

English abstract