[CI] Initial pass at new intro
Samuel Madden
madden at csail.mit.edu
Sun Jun 15 18:39:31 EDT 2008
All --
I checked in a new intro which I think addresses the concerns about
related work a little more clearly. Please look it over and see what
you think -- Alex, I believe I've captured all the ideas you guys
mentioned in your mail, but I might have missed on or two.
I'm sure we will need to iterate on this, so pls read and comment!
There are a number of comments throughout the paper where I note
things that need to fix in light of related work, etc. Everyone
should read the BHUNT and CORDS papers and understand the basic
approaches since they appear to be the state of the art and are
clearly pretty close to what we have done.
I also made two other changes:
1) A new title, making the scheme sound less like a new index and more
like a way to exploit soft function dependencies at query execution time
2) Changed CI to CT (correlation table), so that it doesn't seem as
much like we are proposing a new index (which we aren't.)
I think we need to clean up our story re: unclustered B+trees a
little more -- they aren't really a competitive technique, just
another way to exploit correlations (rather than predicate
introduction) once correlations have been found. Problem is that one
could use an existing tool like CORDS and then create indices on the
unclustered attributes which might be a win.
We need to say if there is anything about c_per_u / our estimation of
c_per_u that is different / better than CORDS. If not we should cut
down sections 3 and 4 and say we use the same method as CORDS so that
we can dedicate more space to bucketing and multicolumn search in
section 5.
Also, there are some references to CT in the figures that I couldn't
fix -- if one of you could take care of that, that'd be great.
-Sam
More information about the CI
mailing list