Toward Few-Shot Response Generation for Conversational Agents
Abstract/Contents
- Abstract
One of the major impediments to the deployment of task-oriented response generation models in real-world scenarios is a lack of labeled in-domain training data. In this thesis, we explore a multi-task learning approach in which we pre-train a model with unlabeled dialogues, and then fine-tuned in a few-shot setting to generate factually accurate and natural agent responses.
We formulate our primary task as utterance paraphrasing, where a template-based agent utterance in the context of the preceding dialogue is converted into a more natural-sounding one. We adapt unlabeled dialogues as pre-training data for this task by using automatically generated paraphrases of the last agent utterance as templates to predict the utterance in the context of the preceding dialogue turns. Since the real templates lack the contextual information embedded in the paraphrases, we introduce an auxiliary task, next sentence prediction, where the last turn of a dialogue is masked in the input and used as the predicted gold output.
Evaluation across several conversational datasets shows that our pre-training strategy improves the quality of responses in an in-domain context, indicating that leveraging unlabeled datasets can be an effective method for improving the quality of responses.
Description
Type of resource | text |
---|---|
Date created | June 2021 |
Creators/Contributors
Author | Yu, Dhara |
---|---|
Degree granting institution | Stanford University, Program in Symbolic Systems |
Primary advisor | Lam, Monica |
Advisor | Potts, Christopher |
Subjects
Subject | stanford |
---|---|
Subject | symbolic systems |
Subject | response generation |
Subject | conversational agent |
Subject | virtual assistant |
Subject | natural language processing |
Subject | task oriented dialogue |
Genre | Thesis |
Bibliographic information
Access conditions
- Use and reproduction
- User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
- License
- This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).
Preferred citation
- Preferred Citation
- Yu, Dhara. (2021). Toward Few-Shot Response Generation for Conversational Agents. Stanford Digital Repository. Available at: https://purl.stanford.edu/sc630xy7254
Collection
Undergraduate Honors Theses, Symbolic Systems Program, Stanford University
View other items in this collection in SearchWorksContact information
- Contact
- dharakyu@stanford.edu
Also listed in
Loading usage metrics...