|   | CMU-CS-97-167 Computer Science Department
 School of Computer Science, Carnegie Mellon University
 
    
     
 CMU-CS-97-167
 
Geometric Tools for Algorithms 
Santosh Vempala 
August 1997  
Ph.D. Thesis 
CMU-CS-97-167.psCMU-CS-97-167.pdf
 Keywords: Geometric algorithms, randomization, outliers, sampling,
information retrieval, machine learning
 Our thesis is that a geometric perspective yields insights into the
structure of fundamental problems, and thereby suggests efficient algorithms
for them. As evidence, we develop new geometric models and general-purpose
tools for removing outliers from numeric data, reducing
dimensionality, and counting combinatorial sets.  Then we apply
these techniques to a set of old problems to obtain polynomial-time algorithms.
These include: (1) learning noisy linear-threshold functions (half-spaces),
(2) learning the intersection of half-spaces, (3) clustering text corpora, 
and (4) counting lattice points in a convex body.
 
We supplement some of our theorems with experimental studies. 
86 pages 
 |