[Brown CS Talks] Brown CS Seminar: Alin Deutsch in Lubrano on 4/4/2002 at noon

talks-admin@list.cs.brown.edu talks-admin@list.cs.brown.edu
Wed, 27 Mar 2002 16:42:42 -0500


			      CS Seminar
		  
		  The Department of Computer Science
			   BROWN UNIVERSITY

			      
			       presents

			     Alin Deutsch

		      University of Pennsylvania
				 
		   Thursday, April 4, 2002 at noon
	       Lubrano Conference Room (CIT 4th floor)
	       Refreshments will be served at 11:45 am
			       

	    Querying XML from Mixed and Redundant Storage

 
			       Abstract


XML is widely accepted as the standard for data exchange between
businesses on the Internet. However, most corporations publish only
selected portions of their proprietary business data as XML documents,
and even then only virtually, that is by exposing a schema (interface)
against which XML queries can be formulated.  In order to be answered,
such XML queries must then be *reformulated* as queries on the actual
proprietary data.

Our work concentrates on this query reformulation problem. We solve
the problem in a very general setting that allows mixed (XML,
relational, LDAP, etc.) storage for the proprietary data and takes
advantage of redundancies (materialized views, indexes and caches)
that can enhance performance. Moreover, we are able to give a
theoretical guarantee that our algorithm will always find an optimal
reformulation if one exists. We discuss the MARS system that
implements this technique and we present a suite of experiments that
validate it.

Our general approach to query reformulation is also applicable in
contexts other than XML publishing, such as information integration,
evolution of schema correspondences, distributed data caching,
adaptive distributed query optimization, and data security.


		     Host:  Professor Steve Reiss