Pagewise preview ]

CategoryValue
Available viahttp://dbpubs.stanford.edu/pub/2002-1
Previous version2001-25
Submitted on 12th of November 2001
Author Melnik, Sergey; Garcia-Molina, Hector; Rahm, Erhard
Title Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching
Date of publication 2002
Published in Proc. 18th ICDE Conf. (Best Student Paper award)
Citation Melnik, Sergey; Garcia-Molina, Hector; Rahm, Erhard. Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching, Proc. 18th ICDE Conf., 2002
Number of pages 12
Language English
Project Digital Libraries; OntoAgents/DAML
Type Conference or Journal Paper
Subject group Data Integration and Mediation; Semistructured data
Abstract Matching elements of two data schemas or two data instances plays a key role in data warehousing, e-business, or even biochemical applications. In this paper we present a matching algorithm based on a fixpoint computation that is usable across different scenarios. The algorithm takes two graphs (schemas, catalogs, or other data structures) as input, and produces as output a mapping between corresponding nodes of the graphs. Depending on the matching goal, a subset of the mapping is chosen using filters. After our algorithm runs, we expect a human to check and if necessary adjust the results. As a matter of fact, we evaluate the `accuracy' of the algorithm by counting the number of needed adjustments. We conducted a user study, in which our accuracy metric was used to estimate the labor savings that the users could obtain by utilizing our algorithm to obtain an initial matching. Finally, we illustrate how our matching algorithm is deployed as one of several high-level operators in an implemented testbed for managing information models and mappings.
Keywords Matching, Model Management, Heterogeneous Databases, Semistructured Data
Fulltext source
  • Postscript (ps, ps.gz, ps.zip)
  • PDF (pdf, pdf.gz, pdf.zip)
  • Plain text (text, text.gz, text.zip)
  • Management of the document bypubs@db.stanford.edu

    Pagewise preview ]


    Stanford InfoLab Publication Server