SoundCam: A Dataset for Tasks in Tracking and Identifying Humans from Real Room Acoustics
- A room's acoustic properties are a product of the room's geometry, as well as the objects within the room and their specific positions. A room’s acoustic properties can be characterized by its impulse response (RIR) between a source and listener location, or inferred roughly from recordings of natural signals present in the room. We present SoundCam, the largest dataset of unique RIRs from in-the-wild rooms released to date publicly. It includes 5,000 10-channel real-world measurements of room impulse responses and 2,000 10-channel recordings of music in three different rooms, including a controlled acoustic lab, an in-the-wild living room, and a conference room, with different humans in positions throughout each room. We show that these measurements can be used for interesting tasks, such as detecting and identifying the human, and tracking their position. See linked project page for more details.
|Type of resource
|Dataset, three dimensional object, still image, text, sound recording
|May 5, 2023 - June 3, 2023
|August 22, 2023; August 21, 2023
- Use and reproduction
- User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
- This work is licensed under an MIT License.
Stanford Research DataView other items in this collection in SearchWorks
Also listed in
Loading usage metrics...