[Brown CS Talks] Brown CS Seminar: Wang-Chiew Tan in Lubrano on 3/20/02 at noon

talks-admin@list.cs.brown.edu talks-admin@list.cs.brown.edu
Wed, 06 Mar 2002 08:55:59 -0500


			      CS Seminar
		  
		  The Department of Computer Science
			   BROWN UNIVERSITY

			      
			       presents

			    Wang-Chiew Tan

		      University of Pennsylvania

		  Wednesday, March 20, 2002 at noon
	       Lubrano Conference Room (CIT 4th floor)
	       Refreshments will be served at 11:45 am
			       

   Where Did My Data Come From? Annotating and Archiving Databases


			       Abstract

Publishing data on the Web has revolutionized the way much scientific
research is conducted.  However, it also brings new problems and new
opportunities. Among the problems is that it is often difficult to
trace a piece of data to its source, since it may have moved through
several databases being transformed and edited on its journey from the
source.  Worse, the source may no longer exist!  Knowing the source
and provenance is essential for its scientific credibility.  Among the
opportunities is that scientists now want to annotate a data element
and to have their annotations spread to other people who look at the
same element.  This is related to provenance because the annotation
should ``spread'' back to the source and forward to other databases and
users.

This talk deals with two issues concerned with provenance.  I will
first examine the problem of propagating annotations through queries
and show a dichotomy of complexity for this problem.  In the second
part of the talk, I will describe a new technique for archiving data
that allows all versions of an evolving scientific database to be
stored and retrieved with very small overhead.


		     Host:  Professor Steve Reiss