For the current REF see the REF 2021 website REF 2021 logo

Output details

11 - Computer Science and Informatics

University of York

Return to search Previous output Next output
Output 45 of 139 in the submission
Output title

Dynamic Potential-Based Reward Shaping

Type
E - Conference contribution
DOI
-
Name of conference/published proceedings
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems
Volume number
-
Issue number
-
First page of article
433
ISSN of proceedings
-
Year of publication
2012
Number of additional authors
1
Additional information

<12>This paper shows that the theoretical guarantees of potential-based reward shaping (PBRS), an important and useful reinforcement learning (RL) tool, extend to cases where reward shaping is dynamically changing

during learning. This is especially crucial for cases where the reward shaping is being learned online.

As one conference referee noted, "this is a big result". Many researchers in private

correspondence have expressed surprise at our result, and the work already led to a joint publication at

AAMAS with researchers from the University of Nebraska. The paper shows rigour, containing both theoretical proofs and empirical demonstrations of the theoretical guarantees.

Interdisciplinary
-
Cross-referral requested
-
Research group
G - Artificial Intelligence
Citation count
-
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-