Toward Few-Shot Response Generation for Conversational Agents

Yu, Dhara

Toward Few-Shot Response Generation for Conversational Agents

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fsc630xy7254" class="su-underline">Show Content</a>

Abstract/Contents

Abstract

One of the major impediments to the deployment of task-oriented response generation models in real-world scenarios is a lack of labeled in-domain training data. In this thesis, we explore a multi-task learning approach in which we pre-train a model with unlabeled dialogues, and then fine-tuned in a few-shot setting to generate factually accurate and natural agent responses.

We formulate our primary task as utterance paraphrasing, where a template-based agent utterance in the context of the preceding dialogue is converted into a more natural-sounding one. We adapt unlabeled dialogues as pre-training data for this task by using automatically generated paraphrases of the last agent utterance as templates to predict the utterance in the context of the preceding dialogue turns. Since the real templates lack the contextual information embedded in the paraphrases, we introduce an auxiliary task, next sentence prediction, where the last turn of a dialogue is masked in the input and used as the predicted gold output.

Evaluation across several conversational datasets shows that our pre-training strategy improves the quality of responses in an in-domain context, indicating that leveraging unlabeled datasets can be an effective method for improving the quality of responses.

Description

Type of resource	text
Date created	June 2021

Creators/Contributors

Author	Yu, Dhara
Degree granting institution	Stanford University, Program in Symbolic Systems
Primary advisor	Lam, Monica
Advisor	Potts, Christopher

Subjects

Subject	stanford
Subject	symbolic systems
Subject	response generation
Subject	conversational agent
Subject	virtual assistant
Subject	natural language processing
Subject	task oriented dialogue
Genre	Thesis

Bibliographic information

Location	https://purl.stanford.edu/sc630xy7254

Access conditions

Use and reproduction: User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Preferred citation

Preferred Citation: Yu, Dhara. (2021). Toward Few-Shot Response Generation for Conversational Agents. Stanford Digital Repository. Available at: https://purl.stanford.edu/sc630xy7254

Collection

Undergraduate Honors Theses, Symbolic Systems Program, Stanford University

View other items in this collection in SearchWorks

Contact information

Contact: dharakyu@stanford.edu

Also listed in

View in SearchWorks

Loading usage metrics...