SoundCam: A Dataset for Tasks in Tracking and Identifying Humans from Real Room Acoustics
Abstract/Contents
- Abstract
- A room's acoustic properties are a product of the room's geometry, as well as the objects within the room and their specific positions. A room’s acoustic properties can be characterized by its impulse response (RIR) between a source and listener location, or inferred roughly from recordings of natural signals present in the room. We present SoundCam, the largest dataset of unique RIRs from in-the-wild rooms released to date publicly. It includes 5,000 10-channel real-world measurements of room impulse responses and 2,000 10-channel recordings of music in three different rooms, including a controlled acoustic lab, an in-the-wild living room, and a conference room, with different humans in positions throughout each room. We show that these measurements can be used for interesting tasks, such as detecting and identifying the human, and tracking their position. See linked project page for more details.
Description
Type of resource | Dataset, three dimensional object, still image, text, sound recording |
---|---|
Date created | May 5, 2023 - June 3, 2023 |
Publication date | August 22, 2023; August 21, 2023 |
Creators/Contributors
Author | Wang, Mason | |
---|---|---|
Author | Clarke, Samuel | |
Author | Wang, Jui-Hsien | |
Author | Gao, Ruohan | |
Author | Wu, Jiajun |
Subjects
Subject | audio learning |
---|---|
Subject | acoustics |
Subject | tracking |
Genre | Data |
Genre | 3d model |
Genre | Image |
Genre | Code |
Genre | Sound recording |
Genre | Data sets |
Genre | Dataset |
Genre | Three-dimensional scan |
Genre | Computer program |
Bibliographic information
Related item |
|
---|---|
DOI | https://doi.org/10.25740/xq364hd5023 |
Location | https://purl.stanford.edu/xq364hd5023 |
Access conditions
- Use and reproduction
- User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
- License
- This work is licensed under an MIT License.
Preferred citation
- Preferred citation
- Wang, M., Clarke, S., Wang, J., Gao, R., and Wu, J. (2023). SoundCam: A Dataset for Tasks in Tracking and Identifying Humans from Real Room Acoustics. Stanford Digital Repository. Available at https://purl.stanford.edu/xq364hd5023. https://doi.org/10.25740/xq364hd5023.
Collection
Stanford Research Data
View other items in this collection in SearchWorksContact information
Also listed in
Loading usage metrics...