Body Language Animation Synthesis from Prosody

Placeholder Show Content

Abstract/Contents

Abstract
Human communication involves not only speech, but also a wide variety of gestures and body motions. Interactions in virtual environments often lack this multi-modal aspect of communication. This thesis presents a method for automatically synthesizing body language animations directly from the participants’ speech signals, without the need for additional input. The proposed system generates appropriate body language animations in real time from live speech by selecting segments from motion capture data of real people in conversation. The selection is driven by a hidden Markov model and uses prosody-based features extracted from speech. The training phase is fully automatic and does not require hand-labeling of input data, and the synthesis phase is efficient enough to run in real time on live microphone input. The results of a user study confirm that the proposed method is able to produce realistic and compelling body language.

Description

Type of resource text
Date created 2009-05

Creators/Contributors

Author Levine, Sergey V.
Advisor Koltun, Vladlen
Advisor Theobalt, Christian
Department Stanford University. Department of Computer Science.

Subjects

Subject Human-computer interaction
Subject Computer animation
Subject Firestone Medal for Excellence in Undergraduate Research
Subject Ben Wegbreit Prize for Best Undergraduate Honors Thesis in Computer Science
Genre Thesis

Bibliographic information

Access conditions

Use and reproduction
User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Preferred citation

Preferred Citation
Levine, Sergey (2009). Body Language Animation Synthesis from Prosody. Stanford Digital Repository. Available at http://purl.stanford.edu/vh359gs8861

Collection

Undergraduate Theses, School of Engineering

View other items in this collection in SearchWorks

Contact information

Also listed in

Loading usage metrics...