Incorporating uncertainty in data management and integration

Placeholder Show Content

Abstract/Contents

Abstract
Modern-day applications like information extraction on the web, data integration, entity resolution, scientific data management, and sensor data management are all required to cope with uncertainty in data. Motivated by this observation, recent years have witnessed a surge of research in the field of uncertain databases. The basic goal of this research is to abstract the common challenges and develop principled, general, and efficient techniques for dealing with uncertainty in the context of data management systems. This thesis makes advances in the field of uncertain data management by presenting efficient techniques for managing and integrating uncertain data. Specifically, the contributions may be classified under three areas: (1) Generalizing: We generalize uncertain databases to incorporate continuous probability distributions and incomplete information; (2) Integration: We establish foundations for integration of uncertain data sources; (3) Efficiency: We develop efficient algorithms for joins and indexing over uncertain data.

Description

Type of resource text
Form electronic; electronic resource; remote
Extent 1 online resource.
Publication date 2012
Issuance monographic
Language English

Creators/Contributors

Associated with Agrawal, Parag
Associated with Stanford University, Computer Science Department
Primary advisor Widom, Jennifer
Thesis advisor Widom, Jennifer
Thesis advisor Haas, Peter
Thesis advisor Ullman, Jeffrey D, 1942-
Advisor Haas, Peter
Advisor Ullman, Jeffrey D, 1942-

Subjects

Genre Theses

Bibliographic information

Statement of responsibility Parag Agrawal.
Note Submitted to the Department of Computer Science.
Thesis Thesis (Ph.D.)--Stanford University, 2012.
Location electronic resource

Access conditions

Copyright
© 2012 by Parag Agrawal
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...