Last modified: Tue Sep 27 12:16:59 EDT 2005 by Dean Foster

Statistical Data mining: High dimensions (part 2)

Admistrivia

Lecture overview

Model

Model: Truth is a linear function. I.e. Y = X beta.

2-d example

2-d heuristic for d-dimensional problem

How close are nearest neighbors?

Intuition via Johnson-Lindenstrauss lemma

Readings:


dean@foster.net