CMU-CS-06-160
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-06-160

Free Energy Estimates of All-atom Protein Structures
Using Generalized Belief Propagation

Hetunandan Kamisetty, Eric P. Xing*, Christopher James Langmead*

November 2006

CMU-CS-06-160.pdf


Keywords: Protein structure, decoy detection, free energy, probabilistic graphic models

We present a technique for approximating the free energy of protein structures using Generalized Belief Propagation (GBP). The accuracy and utility of these estimates are then demonstrated in two different application domains. First, we show that the entropy component of our free energy estimates can be used to distinguish native protein structures from decoys — structures with similar internal energy to that of the native structure, but otherwise incorrect. Our method is able to correctly identify the native fold from among a set of decoys with 87.5% accuracy over a total of 48 different immunoglobin folds. The remaining 12.5% of native structures are ranked among the top 4 of all structures. Second, we show that our estimates of ΔΔG upon mutation for three different data sets have linear correlations between 0.64-0.69 with experimental values and statistically significant p-values. Together, these results suggests that GBP is an effective means for computing free energy in all-atom models of protein structures. GBP is also efficient, taking a few minutes to run on a typical sized protein, further suggesting that GBP may be an attractive alternative to more costly molecular dynamic simulations for some tasks.

18 pages

*Machine Learning Department


Return to: SCS Technical Report Collection
School of Computer Science

This page maintained by reports@cs.cmu.edu