SoundCam: A Dataset for Tasks in Tracking and Identifying Humans from Real Room Acoustics

Placeholder Show Content

Abstract/Contents

Abstract
A room's acoustic properties are a product of the room's geometry, as well as the objects within the room and their specific positions. A room’s acoustic properties can be characterized by its impulse response (RIR) between a source and listener location, or inferred roughly from recordings of natural signals present in the room. We present SoundCam, the largest dataset of unique RIRs from in-the-wild rooms released to date publicly. It includes 5,000 10-channel real-world measurements of room impulse responses and 2,000 10-channel recordings of music in three different rooms, including a controlled acoustic lab, an in-the-wild living room, and a conference room, with different humans in positions throughout each room. We show that these measurements can be used for interesting tasks, such as detecting and identifying the human, and tracking their position. See linked project page for more details.

Description

Type of resource Dataset, three dimensional object, still image, text, sound recording
Date created May 5, 2023 - June 3, 2023
Publication date August 22, 2023; August 21, 2023

Creators/Contributors

Author Wang, Mason
Author Clarke, Samuel
Author Wang, Jui-Hsien
Author Gao, Ruohan
Author Wu, Jiajun

Subjects

Subject audio learning
Subject acoustics
Subject tracking
Genre Data
Genre 3d model
Genre Image
Genre Code
Genre Sound recording
Genre Data sets
Genre Dataset
Genre Three-dimensional scan
Genre Computer program

Bibliographic information

Access conditions

Use and reproduction
User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License
This work is licensed under an MIT License.

Preferred citation

Preferred citation
Wang, M., Clarke, S., Wang, J., Gao, R., and Wu, J. (2023). SoundCam: A Dataset for Tasks in Tracking and Identifying Humans from Real Room Acoustics. Stanford Digital Repository. Available at https://purl.stanford.edu/xq364hd5023. https://doi.org/10.25740/xq364hd5023.

Collection

Contact information

Also listed in

Loading usage metrics...