Output details
11 - Computer Science and Informatics
Heriot-Watt University
Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets
<20>This work became the basis of an EC FP7 project "CLASSiC" (Computational Learning in Adaptive Systems for Spoken Conversation, 5m euros, 2008-2011), coordinated by Lemon, which developed an end-to-end statistical approach to Spoken Dialogue Systems. This then led to EC project "PARLANCE" (FP7, 2011,1m euro), which extends these methods to incremental dialogue. The work also generated an ESPRC project (£358,000, 2009), with Lemon as PI, exploring state compression methods for complex extensions of the problem. This research also led to special sessions at Interspeech 2009 and 2011 organised by Lemon, and an invited talk at ECAI 2012.