Dataset: Stanford Streaming Mobile Augmented Reality Dataset

Makar, Mina; Araujo, Andre; Chen, David

Dataset: Stanford Streaming Mobile Augmented Reality Dataset

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fph459zk5920" class="su-underline">Show Content</a>

Abstract/Contents

Abstract

We introduce the Stanford Streaming MAR dataset. The dataset contains 23 different objects of interest, divided to four categories: Books, CD covers, DVD covers and Common Objects. We first record one video for each object where the object is in a static position while the camera is moving. These videos are recorded with a hand-held mobile phone with different amounts of camera motion, glare, blur, zoom, rotation and perspective changes. Each video is 100 frames long, recorded at 30 fps with resolution 640 x 480. For each video, we provide a clean database image (no background noise) for the corresponding object of interest.
We also provide 5 more videos for moving objects recorded with a moving camera. These videos help to study the effect of background clutter when there is a relative motion between the object and the background. Finally, we record 4 videos that contain multiple objects from the dataset. Each video is 200 frames long and contains 3 objects of interest where the camera captures them one after the other.
We provide the ground-truth localization information for 14 videos, where we manually define a bounding quadrilateral around the object of interest in each video frame. This localization information is used in the calculation of the Jaccard index.

1. Static single object:
1.a. Books: Automata Theory, Computer Architecture, OpenCV, Wang Book.
1.b. CD Covers: Barry White, Chris Brown, Janet Jackson, Rascal Flatts, Sheryl Crow.
1.c. DVD Covers: Finding Nemo, Monsters Inc, Mummy Returns, Private Ryan, Rush Hour, Shrek, Titanic, Toy Story.
1.d. Common Objects: Bleach, Glade, Oreo, Polish, Tide, Tuna.

2. Moving object, moving camera:
Barry White Moving, Chris Brown Moving, Titanic Moving, Titanic Moving - Second, Toy Story Moving.

3. Multiple objects:
3.a. Multiple Objects 1: Polish, Wang Book, Monsters Inc.
3.b. Multiple Objects 2: OpenCV, Barry White, Titanic.
3.c. Multiple Objects 3: Monsters Inc, Toy Story, Titanic.
3.d. Multiple Objects 4: Wang Book, Barry White, OpenCV.

Description

Type of resource	software, multimedia
Date created	August 2013

Creators/Contributors

Author	Makar, Mina
Author	Araujo, Andre
Author	Chen, David

Subjects

Subject	mobile augmented reality
Subject	image retrieval
Subject	object tracking
Subject	temporally coherent keypoint detection
Subject	feature descriptors
Subject	canonical patches
Genre	Dataset

Bibliographic information

Related Publication	Mina Makar, Sam Tsai, Vijay Chandrasekhar, David Chen and Bernd Girod, "Interframe Coding of Canonical Patches for Low Bit-Rate Mobile Augmented Reality," Special Issue of the International Journal of Semantic Computing, vol. 7, no. 1, pp. 5-24, March 2013. http://dx.doi.org/10.1142/S1793351X13400011
Related Publication	Mina Makar, Sam Tsai, Vijay Chandrasekhar, David Chen and Bernd Girod, "Interframe Coding of Canonical Patches for Mobile Augmented Reality," Proc. IEEE International Symposium on Multimedia (ISM 2012), Irvine, CA, USA, December 2012.
Related Publication	Mina Makar, Vijay Chandrasekhar, Sam Tsai, David Chen, and Bernd Girod, "Interframe Coding of Feature Descriptors for Mobile Augmented Reality," IEEE Transactions on Image Processing. (in preparation).
Related Publication	Mina Makar, "Interframe Compression of Visual Feature Descriptors for Mobile Augmented Reality," Ph.D. Dissertation, EE Dept., Stanford University. (in preparation).
Location	https://purl.stanford.edu/ph459zk5920

Access conditions

Use and reproduction: User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.

Preferred citation

Preferred Citation: Mina Makar, Sam Tsai, Vijay Chandrasekhar, David Chen and Bernd Girod, "Interframe Coding of Canonical Patches for Low Bit-Rate Mobile Augmented Reality," Special Issue of the International Journal of Semantic Computing, vol. 7, no. 1, pp. 5-24, March 2013. http://dx.doi.org/10.1142/S1793351X13400011. Data available at http://purl.stanford.edu/ph459zk5920.

Collection

Research Datasets for Image, Video, and Multimedia Systems Group at Stanford

View other items in this collection in SearchWorks

Contact information

Contact: mamakar@stanford.edu

Also listed in

View in SearchWorks

Loading usage metrics...