Data from "Artistic Vision: Providing Contextual Guidance for Capture-Time Decisions"
To learn photography is to become more intentional about the creative choices you make with your camera. Many of these creative choices happen in real time during the capture process, as the photographer takes in the scene around them and navigates a space of so many possibilities and uncertainties. However, today's resources for learning photography, such as books, classes, and example photos, are largely disconnected from the capture process. Photographers are therefore faced with the task of navigating, in real-time, a seemingly infinite space of possible creative choices while relying on a disconnected space of learning resources that can feel both inaccessible and overwhelming in the moment. The primary insight of my research is that real-time contextual guidance, embedded directly in the camera, can make accessing relevant parts of this wealth of information more approachable and actionable. The feedback assists in cutting through the noise of endless possibilities and focuses photographers' attention on targeted, meaningful creative choices.
My dissertation presents a set of capture-time interfaces that provide real-time contextual guidance. This guidance takes the form of light touch cues presented as automatically generated visual overlays, where each overlay is designed to focus on a specific photographic concept. Each interface's goal is to understand what an expert might be noticing in considering the targeted photographic concept and to, via an annotation overlay, direct a novice user's awareness in a similar manner. In designing this real-time contextual guidance, I take inspiration from photographers' current practice of directing attention through manually drawing annotations onto photos. Today, this practice is mostly restricted to post-hoc feedback used to point out specific decisions or potential mistakes that the artist made. I develop algorithmic approaches designed to understand conceptually relevant aspects of the scene that the photographer is viewing. These algorithms generate annotations that are displayed in the camera in real time. The annotations can move beyond explaining why a specific decision was made, towards helping the photographer become aware of artistic choices that could be made, providing guidance while encouraging creativity and exploration. Through the overlays, we hope to help novices train their eye to see in the way that experts do.
Specifically, I present in-camera guidance interfaces tackling three important photographic concepts: portrait lighting, composition, and decluttering. The portrait lighting tool helps users be more aware of the available lighting styles and reorient their subject to best achieve the lighting style of their choice. The composition guidance tool makes users more aware of the current composition by highlighting lines in a composition grid that are most relevant to the camera view. The decluttering tool increases users' awareness of clutter that would draw attention away from the main story of the image by abstracting the camera view to outline edges around the subject(s) or the image borders. For each interface, I describe my process for designing a novice-interpretable visualization and how it captures context relevant to the target concept. I then evaluate each interface by asking novice photographers to take photos with these tools while focusing on their target concept.
Together, these tools and their evaluations demonstrate that such awareness-based visual guidance camera interfaces can help people be more intentional about their artistic choices. By making users more aware of possible options and mistakes, the interfaces introduced in this dissertation encourage users to explore the space in a more informed manner. In this way, the tools presented in my dissertation help users become more confident in their ability to achieve their artistic goals.
|Type of resource
|[ca. 2014 - 2021]
|E, Jane L
|Landay, James A
- Use and reproduction
- User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
- This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).
- Preferred Citation
- E, Jane L and Landay, James A and Hanrahan, Pat. Data from "Artistic Vision: Providing Contextual Guidance for Capture-Time Decisions". Stanford Digital Repository. Available at: https://purl.stanford.edu/dg109bb1678
Stanford Research DataView other items in this collection in SearchWorks
Also listed in
Loading usage metrics...