Abstract
We present and evaluate a new model for Natural Language Generation (NLG) in Spoken Dialogue Systems, based on statistical planning, given noisy feedback from the current generation context (e.g. a user and a surface realiser). The model is adaptive and incremental at the turn level, and optimises NLG actions with respect to a data-driven objective function. We study its use in a standard NLG problem: how to present information (in this case a set of search results) to users, given the complex trade-offs between utterance length, amount of information conveyed, and cognitive load. We set these trade-offs in an objective function by analysing existing match data. We then train a NLG policy using Reinforcement Learning (RL), which adapts its behaviour to noisy feedback from the current generation context. This policy is compared to several baselines derived from previous work in this area. The learned policy significantly outperforms all the prior approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baddeley, A.: Working memory and language: an overview. Journal of Communication Disorder 36(3), 189–208 (2001)
Boidin, C., Rieser, V., van der Plas, L., Lemon, O., Chevelu, J.: Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems. In: Proceedings of the Interspeech Special Session: Machine Learning for Adaptivity in Spoken Dialogue (2009)
Branavan, S., Chen, H., Zettlemoyer, L., Barzilay, R.: Reinforcement learning for mapping instructions to actions. In: Proceedings of ACL, pp. 82–90 (2009)
van Deemter, K.: What game theory can do for NLG: the case of vague language (keynote paper). In: 12th European Workshop on Natural Language Generation (ENLG), pp. 154–161 (2009)
Demberg, V., Moore, J.D.: Information presentation in spoken dialogue systems. In: Proceedings of EACL, pp. 65–72 (2006)
Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., Young, S.: Training and Evaluation of the HIS POMDP Dialogue System in Noise. In: Proceedings of SIGdial Workshop on Discourse and Dialogue, pp. 112–119 (2008)
Henderson, J., Lemon, O.: Mixture Model POMDPs for Efficient Handling of Uncertainty in Dialogue Management. In: Proceedings of ACL, pp. 73–76 (2008)
Henderson, J., Lemon, O., Georgila, K.: Hybrid reinforcement / supervised learning of dialogue policies from fixed datasets. Computational Linguistics 34(4), 487–512 (2008)
Janarthanam, S., Lemon, O.: Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems. In: Krahmer, E., Theune, M. (eds.) Empirical Methods in NLG. LNCS (LNAI), vol. 5790, pp. 67–84. Springer, Heidelberg (2010)
Janarthanam, S., Lemon, O.: User simulations for online adaptation and knowledge-alignment in Troubleshooting dialogue systems. In: Proceedings of SEMdial, pp. 133–134 (2008)
Koller, A., Petrick, R.: Experiences with planning for natural language generation. In: ICAPS (2008)
Koller, A., Stone, M.: Sentence generation as planning. In: Proceedings of ACL, pp. 336–343 (2007)
Lemon, O.: Adaptive Natural Language Generation in Dialogue using Reinforcement Learning. In: Proceedings of SEMdial (2008)
Lemon, O.: Learning what to say and how to say it: joint optimization of spoken dialogue management and Natural Language Generation. Computer Speech and Language (to appear)
Liu, X., Rieser, V., Lemon, O.: A Wizard-of-Oz interface to study Information Presentation strategies for Spoken Dialogue Systems. In: Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (2009)
Moore, J., Foster, M.E., Lemon, O., White, M.: Generating tailored, comparative descriptions in spoken dialogue. In: Proceedings of FLAIRS (2004)
Nakatsu, C., White, M.: Learning to say it well: Reranking realizations by predicted synthesis quality. In: Proceedings of ACL (2006)
Oh, A., Rudnicky, A.: Stochastic natural language generation for spoken dialog systems. Computer, Speech & Language 16(3/4), 387–407 (2002)
Paek, T., Horvitz, E.: Conversation as action under uncertainty. In: Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence, pp. 455–464 (2000)
Polifroni, J., Walker, M.: Intensional Summaries as Cooperative Responses in Dialogue Automation and Evaluation. In: Proceedings of ACL, pp. 479–487 (2008)
Rieser, V., Lemon, O.: Does this list contain what you were searching for? Learning adaptive dialogue strategies for Interactive Question Answering. J. Natural Language Engineering 15(1), 55–72 (2008)
Rieser, V., Lemon, O.: Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz data: Bootstrapping and Evaluation. In: Proceedings of ACL, pp. 638–646 (2008)
Rieser, V., Lemon, O.: Learning and evaluation of dialogue strategies for new applications: Empirical methods for optimization from small data sets. Computational Linguistics (subm)
Singh, S., Litman, D., Kearns, M., Walker, M.: Optimizing dialogue management with Reinforcement Learning: Experiments with the NJFun system. Journal of Artificial Intelligence Research (JAIR) 16, 105–133 (2002)
Stent, A., Prasad, R., Walker, M.: Trainable sentence planning for complex information presentation in spoken dialog systems. In: Proceedings of ACL, pp. 79–86 (2004)
Stent, A., Walker, M., Whittaker, S., Maloor, P.: User-tailored generation for spoken dialogue: an experiment. In: Proceedings of ICSLP (2002)
Sutton, R., Barto, A.: Reinforcement Learning. MIT Press, Cambridge (1998)
Wahlster, W., Andre, E., Finkler, W., Profitlich, H.J., Rist, T.: Plan-based integration of natural language and graphics generation. Artificial Intelligence 16(63), 387–427 (1993)
Walker, M., Stent, A., Mairesse, F., Prasad, R.: Individual and domain adaptation in sentence planning for dialogue. Journal of Artificial Intelligence Research (JAIR) 30, 413–456 (2007)
Walker, M., Whittaker, S., Stent, A., Maloor, P., Moore, J., Johnston, M., Vasireddy, G.: User tailored generation in the match multimodal dialogue system. Cognitive Science 28, 811–840 (2004)
Walker, M.A., Kamm, C.A., Litman, D.J.: Towards developing general models of usability with PARADISE. Natural Language Engineering 6(3) (2000)
Whittaker, S., Walker, M., Maloor, P.: Should I Tell All? An Experiment on Conciseness in Spoken Dialogue. In: Proceedings of Eurospeech (2003)
Whittaker, S., Walker, M., Moore, J.: Fish or Fowl: A Wizard of Oz evaluation of dialogue strategies in the restaurant domain. In: Proceedings of LREC (2002)
Winterboer, A., Hu, J., Moore, J.D., Nass, C.: The influence of user tailoring and cognitive load on user performance in spoken dialogue systems. In: Proceedings of Interspeech/ICSLP (2007)
Young, S., Schatzmann, J., Weilhammer, K., Ye, H.: The Hidden Information State Approach to Dialog Management. In: ICASSP (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Rieser, V., Lemon, O. (2010). Natural Language Generation as Planning under Uncertainty for Spoken Dialogue Systems. In: Krahmer, E., Theune, M. (eds) Empirical Methods in Natural Language Generation. EACL ENLG 2009 2009. Lecture Notes in Computer Science(), vol 5790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15573-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-15573-4_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15572-7
Online ISBN: 978-3-642-15573-4
eBook Packages: Computer ScienceComputer Science (R0)