[Brown CS Talks] Brown CS Seminar: Alin Deutsch in Lubrano on 4/4/2002 at noon
talks-admin@list.cs.brown.edu
talks-admin@list.cs.brown.edu
Wed, 27 Mar 2002 16:42:42 -0500
CS Seminar
The Department of Computer Science
BROWN UNIVERSITY
presents
Alin Deutsch
University of Pennsylvania
Thursday, April 4, 2002 at noon
Lubrano Conference Room (CIT 4th floor)
Refreshments will be served at 11:45 am
Querying XML from Mixed and Redundant Storage
Abstract
XML is widely accepted as the standard for data exchange between
businesses on the Internet. However, most corporations publish only
selected portions of their proprietary business data as XML documents,
and even then only virtually, that is by exposing a schema (interface)
against which XML queries can be formulated. In order to be answered,
such XML queries must then be *reformulated* as queries on the actual
proprietary data.
Our work concentrates on this query reformulation problem. We solve
the problem in a very general setting that allows mixed (XML,
relational, LDAP, etc.) storage for the proprietary data and takes
advantage of redundancies (materialized views, indexes and caches)
that can enhance performance. Moreover, we are able to give a
theoretical guarantee that our algorithm will always find an optimal
reformulation if one exists. We discuss the MARS system that
implements this technique and we present a suite of experiments that
validate it.
Our general approach to query reformulation is also applicable in
contexts other than XML publishing, such as information integration,
evolution of schema correspondences, distributed data caching,
adaptive distributed query optimization, and data security.
Host: Professor Steve Reiss