Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

Pei-Hao Su | David Vandyke | Milica Gašić | Nikola Mrkšić | Tsung-Hsien Wen | Steve Young |

Paper Details:

Month: September
Year: 2015
Location: Prague, Czech Republic
Venue: SIGDIAL | WS |
SIG: SIGDIAL