Toward Few-Shot Response Generation for Conversational Agents

Placeholder Show Content

Abstract/Contents

Abstract

One of the major impediments to the deployment of task-oriented response generation models in real-world scenarios is a lack of labeled in-domain training data. In this thesis, we explore a multi-task learning approach in which we pre-train a model with unlabeled dialogues, and then fine-tuned in a few-shot setting to generate factually accurate and natural agent responses.

We formulate our primary task as utterance paraphrasing, where a template-based agent utterance in the context of the preceding dialogue is converted into a more natural-sounding one. We adapt unlabeled dialogues as pre-training data for this task by using automatically generated paraphrases of the last agent utterance as templates to predict the utterance in the context of the preceding dialogue turns. Since the real templates lack the contextual information embedded in the paraphrases, we introduce an auxiliary task, next sentence prediction, where the last turn of a dialogue is masked in the input and used as the predicted gold output.

Evaluation across several conversational datasets shows that our pre-training strategy improves the quality of responses in an in-domain context, indicating that leveraging unlabeled datasets can be an effective method for improving the quality of responses.

Description

Type of resource text
Date created June 2021

Creators/Contributors

Author Yu, Dhara
Degree granting institution Stanford University, Program in Symbolic Systems
Primary advisor Lam, Monica
Advisor Potts, Christopher

Subjects

Subject stanford
Subject symbolic systems
Subject response generation
Subject conversational agent
Subject virtual assistant
Subject natural language processing
Subject task oriented dialogue
Genre Thesis

Bibliographic information

Access conditions

Use and reproduction
User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Preferred citation

Preferred Citation
Yu, Dhara. (2021). Toward Few-Shot Response Generation for Conversational Agents. Stanford Digital Repository. Available at: https://purl.stanford.edu/sc630xy7254

Collection

Undergraduate Honors Theses, Symbolic Systems Program, Stanford University

View other items in this collection in SearchWorks

Contact information

Also listed in

Loading usage metrics...