Proc.Inst.Statist.Math.54-2

Proceedings of the Institute of Statistical Mathematics Vol.54, No.2, 211-222(2006)

Prediction of Membrane Protein Structures
by Generalized-ensemble Algorithms

Hironori Kokubo

(Department of Chemistry, University of Houston)

Yuko Okamoto

(Department of Physics, Nagoya University)

The Markov-chain Monte Carlo method is a computer simulation algorithm that reproduces statistical ensembles. It is usually based on the Boltzmann weight factor and realizes a fixed-temperature canonical ensemble. However, when the number of degrees of freedom of the system is large, there exist a huge number of local-minimum-energy states that are separated by high-energy barriers. This forces the simulation to get trapped in energy-local-minimum states and makes it very difficult to reproduce an accurate low-temperature canonical ensemble. Generalized-ensemble algorithms are generic terms for those methods that are based on non-Boltzmann weight factors and overcome the above-mentioned difficulty by realizing a one-dimensional random walk in energy space. We review one of the generalized-ensemble algorithms, namely, the replica-exchange method, and its extensions. As an example of its application, we present the results of replica-exchange Monte Carlo simulation applied to the prediction of membrane protein structures.

Key words: Generalized-ensemble algorithm, replica-exchange method, membrane protein, transmembrane helix, protein tertiary structure prediction.

Prediction of Membrane Protein Structures by Generalized-ensemble Algorithms

Recent Progress in Ocean Data Assimilation on Climate in Tropical Pacific

Data Assimilation Model for Japan Sea Circulation

A Spatiotemporal Response Model for Chlorophyll-aDistributions Based on Some Oceanographic Factors

Forecasting Locations of Future Large Earthquakes,Using Pattern Informatics Method: A Review

Injury Surveillance System for Preventing Children's Injury that Circulates Reusable Knowledge

Graph Mining and Its Application to Statistical Modeling

Utilizing Heterogeneous Genomic Data to Estimate Gene Networks

Estimating Protein Network from Multiple Genomic Data by Kernel Methods

Prediction and Discovery: Towards Novel Methodology for Genome Data Analysis

Prediction and Estimation from Gene ExpressionData Analysis: What We Want toand be Able to Conclude from Data

Quantitative Analysis Method of Network Trafficby Component DecompositionUsing Bayesian Time Series Model

A Duration Analysis of Hair Salon Consumers’ Behavior and Prediction of Revisit Rates

Prediction of Infectious Disease Outbreakwith Particular Emphasis on the Statistical IssuesUsing Transmission Model

On Model-selection Problemsin Terms of Prediction Mean Squared Error

Randomness from Computational View Points

Prediction of Membrane Protein Structures
by Generalized-ensemble Algorithms

Recent Progress in Ocean Data Assimilation on Climate
in Tropical Pacific

A Spatiotemporal Response Model for Chlorophyll-a
Distributions Based on Some Oceanographic Factors

Forecasting Locations of Future Large Earthquakes,
Using Pattern Informatics Method: A Review

Injury Surveillance System for Preventing Children's Injury
that Circulates Reusable Knowledge

Estimating Protein Network from Multiple Genomic Data
by Kernel Methods

Prediction and Discovery: Towards Novel Methodology
for Genome Data Analysis

Prediction and Estimation from Gene Expression
Data Analysis: What We Want to
and be Able to Conclude from Data

Quantitative Analysis Method of Network Traffic
by Component Decomposition
Using Bayesian Time Series Model

A Duration Analysis of Hair Salon Consumers’ Behavior
and Prediction of Revisit Rates

Prediction of Infectious Disease Outbreak
with Particular Emphasis on the Statistical Issues
Using Transmission Model

On Model-selection Problems
in Terms of Prediction Mean Squared Error