CMU-ML-07-104
Machine Learning Department
School of Computer Science, Carnegie Mellon University



CMU-ML-07-104

Measure Concentration of
Strongly Mixing Processes with Applications

Leonid Kontorovich

May 2007

Ph.D. Thesis

CMU-ML-07-104.pdf


Keywords: Concentration of measure, Lipschitz function, stochastic process, mixing

The concentration of measure phenomenon was first discovered in the 1930's by Paul Lévy and has been investigated since then, with increasing intensity in recent decades. The probability-theoretic results have been gradually percolating throughout the mathematical community, finding applications in Banach space geometry, analysis of algorithms, statistics and machine learning.

There are several approaches to proving concentration of measure results; we shall offer a brief survey of these. The principal contribution of this thesis is a functional norm inequality, which immediately implies a concentration inequality for nonproduct measures. The inequality is proved by elementary means, yet enables one, with minimal effort, to recover and generalize the best current results for Markov chains, as well as to obtain new results for hidden Markov chains and Markov trees.

As an application of our inequalities, we give a strong law of large numbers for a broad class of non-independent processes. In particular, this allows one to analyze the convergence of inhomogeneous Markov Chain Monte Carlo algorithms. We also give some partial results on extending the Rademacher-type generalization bounds to processes with arbitrary dependence.

We end the thesis with some conjectures and open problems.

89 pages


SCS Technical Report Collection
School of Computer Science homepage

This page maintained by reports@cs.cmu.edu