[Brown CS Talks] Brown CS Seminar: Frank McSherry in Lubrano on 3/13/02 at noon
talks-admin@list.cs.brown.edu
talks-admin@list.cs.brown.edu
Mon, 11 Mar 2002 16:51:34 -0500
CS Seminar
The Department of Computer Science
BROWN UNIVERSITY
presents
Frank McSherry
University of Washington
Wednesday, March 13, 2002 at noon
Lubrano Conference Room (CIT 4th floor)
Refreshments will be served at 11:45 am
Data Mining via Spectral Analysis
Abstract
Much attention has recently been paid to the analysis of data by
casting the data set as a matrix, and considering its eigenvectors.
This spectral analysis of data has been applied successfully in many
domains; examples include Google's PageRank algorithm, Latent Semantic
Analysis, and Kernel PCA. However, the success of spectral analysis
is largely empirical, with little analytic understanding of why this
approach works so well.
In this talk I will present a general framework for data mining
problems and show how this framework justifies spectral analysis.
Specifically, we will see that the problems of collaborative
filtering, web search, and data clustering fall into the framework and
give rigorous bounds on the performance of spectral analysis on each
problem. Furthermore, we will see how we can immediately translate
ideas and understanding developed through this framework into new
algorithms which address problems in graph theory, and numerical
analysis.
This research is joint work with Yossi Azar and Amos Fiat at Tel Aviv
University, Dimitris Achlioptas at Microsoft Research, and Anna Karlin
and Jared Saia at the University of Washington.
Host: Professor Steve Reiss