Pagewise preview ]

CategoryValue
Available viahttp://dbpubs.stanford.edu/pub/2008-16
Submitted on 5th of June 2008
Author Das Sarma, Anish; Agrawal, Parag; Nabar, Shubha; Widom, Jennifer
Title Towards Special-Purpose Indexes and Statistics for Uncertain Data
Date of publication 2008
Citation Das Sarma, Anish; Agrawal, Parag; Nabar, Shubha; Widom, Jennifer. Towards Special-Purpose Indexes and Statistics for Uncertain Data,
Number of pages 15
Language English
Project Stanford InfoLab
Type Other
Subject group Miscellaneous
Abstract The Trio project at Stanford for managing data, uncertainty, and lineage is developed on top of a conventional DBMS. Uncertain data with lineage is encoded in relational tables, and Trio queries are translated to SQL queries on the encoding. Such a layered approach reaps significant benefits in terms of architectural simplicity, and the ability to use an off-the-shelf query processing engine. In this paper, we present special-purpose indexes and statistics that complement the layered approach to further enhance its performance. First, we identify a well-defined structure of Trio queries, relations, and their encoding that can be exploited by the underlying query optimizer to improve the performance using Trio's layered approach. We propose several mechanisms for indexing Trio's uncertain relations and study when these indexes are useful. We then present an interesting order, and an associated operator, which are especially useful to consider when composing query plans. The decision of which query plan to use for a Trio query is dictated by various statistical properties of the input data. We identify the statistical data that can guide the underlying optimizer, and design histograms that enable estimating the statistics accurately.
Contact address anish@cs.stanford.edu
Fulltext source
  • Postscript (ps, ps.gz, ps.zip)
  • PDF (pdf, pdf.gz, pdf.zip)
  • Plain text (text, text.gz, text.zip)
  • Management of the document bysiroker@db.stanford.edu

    Pagewise preview ]


    Stanford InfoLab Publication Server