Visual content creation and editing via structural and functional hierarchies

Placeholder Show Content

Abstract/Contents

Abstract
Starting from low-level perceptual input, humans can automatically infer higher-level semantically meaningful structures in the visual world. In this thesis, I present my work on systems that extract and leverage such hierarchies for visual content creation and editing. First, I present Motion Programs, a hierarchical motion representation for human actions that captures both low-level motion and its high-level description as motion concepts. This representation enables human motion description, interactive editing, and controlled synthesis of novel video sequences within a single framework. Next, I present ProGRIP, a novel shape program representation that uses implicit functions to represent parts. ProGRIP outperforms existing structured representations in shape reconstruction fidelity and semantic segmentation and enables interactive editing. Finally, I present a model for realistically inserting people into scenes. Our model can infer the set of realistic poses given the scene context, re-pose the reference person, and harmonize the composition. We train a large-scale diffusion model to perform this task. At inference time, our model can also be prompted differently to perform several different auxiliary tasks, such as person hallucination, scene hallucination, and cloth swapping. Quantitative and qualitative evaluations show that our method synthesizes more realistic human appearance and more natural human-scene interactions than prior work.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2023; ©2023
Publication date 2023; 2023
Issuance monographic
Language English

Creators/Contributors

Author Kulal, Sumith
Degree supervisor Aiken, Alexander
Degree supervisor Wu, Jiajun, (Computer scientist)
Thesis advisor Aiken, Alexander
Thesis advisor Wu, Jiajun, (Computer scientist)
Thesis advisor Niebles Duque, Juan Carlos, 1980-
Degree committee member Niebles Duque, Juan Carlos, 1980-
Associated with Stanford University, School of Engineering
Associated with Stanford University, Computer Science Department

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Sumith Kulal.
Note Submitted to the Computer Science Department.
Thesis Thesis Ph.D. Stanford University 2023.
Location https://purl.stanford.edu/gk508gg0730

Access conditions

Copyright
© 2023 by Sumith Kulal
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...