- Shopping Bag ( 0 items )
Recent advances in ligand design methods, current issues in de novo molecular design, etc.
"…this volume continues the traditions and standards of this series as a prime resource for anyone with an interest in theoretical and computational chemistry…a welcome addition to any library collection." (Journal of the American Chemical Society, March 9, 2005)
This chapter is written for the reader who would like to learn how
Monte Carlo methods are used to calculate thermodynamic properties of systems
at the atomic level, or to determine which advanced Monte Carlo methods
might work best in their particular application. There are a number of
excellent books and review articles on Monte Carlo methods, which are generally
focused on condensed phases, biomolecules or electronic structure theory.
The purpose of this chapter is to explain and illustrate some of the
special techniques that we and our colleagues have found to be particularly
well suited for simulations of nanodimensional atomic and molecular clusters.
We want to help scientists and engineers who are doing their first work in this
area to get off on the right foot, and also provide a pedagogical chapter for
those who are doing experimental work. By including examples of simulations
of some simple, yet representative systems, we provide the reader with some
data for direct comparison when writing their own code from scratch.
Although a number of Monte Carlo methods in current use will be
reviewed, this chapter is not meant to be comprehensive in scope.Monte Carlo
is a remarkably flexible class of numerical methods. So many versions of the
basic algorithms have arisen that we believe a comprehensive review would be
of limited pedagogical value. Instead, we intend to provide our readers with
enough information and background to allow them to navigate successfully
through the many different Monte Carlo techniques in the literature. This
should help our readers use existing Monte Carlo codes knowledgably, adapt
existing codes to their own purposes, or even write their own programs. We
also provide a few general recommendations and guidelines for those who are
just getting started with Monte Carlo methods in teaching or in research.
This chapter has been written with the goal of describing methods that
are generally useful. However, many of our discussions focus on applications
to atomic and molecular clusters (nanodimensional aggregates of a finite number
of atoms and/or molecules). We do this for two reasons:
1. A great deal of our own research has focused on such systems,
particularly the phase transitions and other structural transformations induced by
changes in a cluster's temperature and size, keeping an eye on how various
properties approach their bulk limits. The precise determination of thermodynamic
properties (such as the heat capacity) of a cluster type as a function of
temperature and size presents challenges that must be addressed when using
Monte Carlo methods to study virtually any system. For example, analogous
structural transitions can also occur in phenomena as disparate as the denaturation
of proteins. The modeling of these transitions presents similar
computational challenges to those encountered in cluster studies.
2. Although cluster systems can present some unique challenges, their
study is unencumbered by many of the technical issues regarding periodic
boundary conditions that arise when solids, liquids, surface adsorbates, and
solvated biomolecules and polymers are studied. These issues are addressed
well elsewhere, and can be thoroughly appreciated and mastered once
a general background in Monte Carlo methods is obtained from this chapter.
It should be noted that "Monte Carlo" is a term used in many fields of
science, engineering, statistics, and mathematics to mean entirely different
things. The one (and only) thing that all Monte Carlo methods have in common
is that they all use random numbers to help calculate something. What we
mean by "Monte Carlo" in this chapter is the use of random-walk processes
to draw samples from a desired probability function, thereby allowing one to
calculate integrals of the form [integral] dqf(q) [rho](q). The quantity [rho](q) is a normalized
probability density function that spans the space of a many-dimensional
variable q, and f(q) is a function whose average is of thermodynamic importance
and interest. This integral, as well as all other integrals in this chapter,
should be understood to be a definite integral that spans the entire domain of
q. Finally, we note that the inclusion of quantum effects through path-integral
Monte Carlo methods is not discussed in this chapter. The reader interested in
including quantum effects in Monte Carlo thermodynamic calculations is
METROPOLIS MONTE CARLO
Monte Carlo simulations are widely used in the fields of chemistry, biology,
physics, and engineering in order to determine the structural and thermodynamic
properties of complex systems at the atomic level. Thermodynamic
averages of molecular properties can be determined from Monte Carlo methods,
as can minimum-energy structures. Let (f) represent the average value
of some coordinate-dependent property f(x), with x representing the 3N
Cartesian coordinates needed to locate all of the N atoms. In the canonical
ensemble (fixed N, V and T, with V the volume and T the absolute temperature),
averages of molecular properties are given by an average of f(x) over the
(f) = [integral] dxf(x)exp[-[beta]U(x)]/[integral]dx exp[-[beta]U(x)] 
where U(x) is the potential energy of the system, [beta] = 1/[k.sub.B]T, and [k.sub.B] is the
Boltzmann constant. If one can compute the thermodynamic average of
f(x) it is then possible to calculate various thermodynamic properties. In
the canonical ensemble it is most common to calculate E, the internal energy,
and [C.sub.v] , the constant-volume heat capacity (although other properties can be
calculated as well). For example, if we average U(x) over all possible configurations
according to Eq. , then E and [C.sub.v] are given by
E = 3N[k.sub.B]T/2 + (U) 
[C.sub.v] = 3N[k.sub.B]/2 + ([U.sup.2]) - [(U).sup.2]/([k.sub.B][T.sup.2]) 
The first term in each equation represents the contribution of kinetic energy,
which is analytically integrable. In the harmonic (low-temperature) limit, E
given by Eq.  will be a linear function of temperature and [C.sub.v] from Eq. 
will be constant, in accordance with the Equipartition Theorem. For a small
cluster of, say, 6 atoms, the integrals implicit in the calculation of Eqs. 
and  are already of such high dimension that they cannot be effectively computed
using Simpson's rule or other basic quadrature methods. For
larger clusters, liquids, polymers or biological molecules the dimensionality
is obviously much higher, and one typically resorts to either Monte Carlo,
molecular dynamics, or other related algorithms.
To calculate the desired thermodynamic averages, it is necessary to have
some method available for computation of the potential energy, either explicitly
(in the form of a function representing the interaction potential as in
molecular mechanics) or implicitly (in the form of direct quantum-mechanical
calculations). Throughout this chapter we shall assume that U is known or can
be computed as needed, although this computation is typically the most computationally
expensive part of the procedure (because U may need to be computed
many, many times). For this reason, all possible measures should be
taken to assure the maximum efficiency of the method used in the computation
Also, it should be noted that constraining potentials (which keep the
cluster components from straying too far from a cluster's center of mass)
are sometimes used. At finite temperature, clusters have finite vapor pressures,
and particular cluster sizes are typically unstable to evaporation. Introducing
a constraining potential enables one to define clusters of desired sizes.
Because the constraining potential is artificial, the dependence of calculated
thermodynamic properties on the form and the radius of the constraining
potential must be investigated on a case-by-case basis. Rather than diverting
the discussion from our main focus (Monte Carlo methods), we refer the
interested reader elsewhere for more details and references on the use of con-straining
Random-Number Generation: A Few Notes
Because generalized Metropolis Monte Carlo methods are based on
"random" sampling from probability distribution functions, it is necessary
to use a high-quality random-number generator algorithm to obtain reliable
results. A review of such methods is beyond the scope of this chapter,
but a few general considerations merit discussion.
Random-number generators do not actually produce random numbers.
Rather, they use an integer "seed" to initialize a particular "pseudorandom"
sequence of real numbers that, taken as a group, have properties that leave
them nearly indistinguishable from truly random numbers. These are conventionally
floating-point numbers, distributed uniformly on the interval (0,1). If
there is a correlation between seeds, a correlation may be introduced between
the pseudorandom numbers produced by a particular generator. Thus, the
generator should ideally be initialized only once (at the beginning of the random
walk), and not re-initialized during the course of the walk. The seed
should be supplied either by the user or generated arbitrarily by the program
4 Strategies for Monte Carlo Thermodynamic Calculations
using, say, the number of seconds since midnight (or some other arcane formula).
One should be cautious about using the "built-in" random-number
generator functions that come with a compiler for Monte Carlo integration
work because some of them are known to be of very poor quality. The
reader should always be sure to consult the appropriate literature and obtain
(and test) a high-quality random-number generator before attempting to write
and debug a Monte Carlo program.
The Generalized Metropolis Monte Carlo Algorithm
The Metropolis Monte Carlo (MMC) algorithm is the single most widely
used method for computing thermodynamic averages. It was originally developed
by Metropolis et al. and used by them to simulate the freezing transition
for a two-dimensional hard-sphere fluid. However, Monte Carlo methods can
be used to estimate the values of multidimensional integrals in whatever context
they may arise. Although Metropolis et al. did not present their algorithm
as a general-utility method for numerical integration, it soon became
apparent that it could be generalized and applied to a variety of situations.
The core of the MMC algorithm is the way in which it draws samples from
a desired probability distribution function. The basic strategies used in
MMC can be generalized so as to apply to many kinds of probability functions
and in combination with many kinds of sampling strategies. Some authors
refer to the generalized MMC algorithm simply as "Metropolis sampling,"
while others have referred to it as the M(RT) method in honor of the five
authors of the original paper (Metropolis, the Rosenbluths, and the Tellers).
We choose to call this the generalized Metropolis Monte Carlo (gMMC) method,
and we will always use the term MMC to refer strictly to the combination
of methods originally presented by Metropolis et al.
In the literature of numerical analysis, gMMC is classified as an importance
sampling technique. Importance sampling methods generate configurations
that are distributed according to a desired probability function rather
than simply picking them at random from a uniform distribution. The probability
function is chosen so as to obtain improved convergence of the properties
of interest. gMMC is a special type of importance sampling method which
asymptotically (i.e., in the limit that the number of configurations becomes
large) generates states of a system according to the desired probability distribution.
This probability function is usually (but not always) the actual
probability distribution function for the physical system of interest. Nearly
all statistical-mechanical applications of Monte Carlo techniques require the
use of importance sampling, whether gMMC or another method is used (alternatively,
"stratified sampling" is sometimes an effective approach).
gMMC is certainly the most widely used importance sampling method.
In the gMMC algorithm successive configurations of the system are
generated to build up a special kind of random walk called a Markov
chain. The random walk visits successive configurations, where each
configuration's location depends on the configuration immediately preceding
it in the chain. The gMMC algorithm establishes how this can be done so as
to asymptotically generate a distribution of configurations corresponding to
the probability density function of interest, which we denote as [rho](q).
We define K([q.sub.j] -> [q.sub.j) to be the conditional probability that a
configuration at [q.sub.i] will be brought to [q.sub.j] in the next step of the random walk. This conditional
probability is sometimes called the "transition rate." The probability
of moving from q to [q'] (where q and [q'] are arbitrarily chosen configurations
somewhere in the available domain) is therefore given by P(q -> [q'):
P(q -> [q') = K(q -> [q')[rho](q) 
For the system to evolve toward a unique limiting distribution, we must place
a constraint on P(q -> [q']). The gMMC algorithm achieves the desired limiting
behavior by requiring that, on the average, a point is just as likely to move
from q to [q'] as it is to move in the reverse direction, namely, that
P(q -> [q') = P([q'] -> q). This likelihood can be achieved only if the walk is
ergodic (an ergodic walk eventually visits all configurations when started
from any given configuration) and if it is aperiodic (a situation in which no
single number of steps will generate a return to the initial configuration).
This latter requirement is known as the "detailed balance" or the "microscopic
K(q -> [q'])[rho](q) = K([q'] -> q)[rho]([q']) 
Satisfying the detailed balance condition ensures that the configurations
generated by the gMMC algorithm will asymptotically be distributed according
The transition rate may be written as a product of a trial probability
and an acceptance probability A
K([q.sub.i] -> [q.sub.j]) = [PI]([q.sub.i] -> [q.sub.j])A([q.sub.i] -> [q.sub.j]) 
where [PI] can be taken to be any normalized distribution that asymptotically
spans the space of all possible configurations, and A is constructed so that
Eq.  is satisfied for a particular choice of [PI].
Excerpted from Reviews in Computational Chemistry, Volume 19
Copyright © 2003 by Kenny B. Lipkowitz, Raima Larter, Thomas R. Cundari.
Excerpted by permission.
All rights reserved. No part of this excerpt may be reproduced or reprinted without permission in writing from the publisher.
Excerpts are provided by Dial-A-Book Inc. solely for the personal use of visitors to this web site.
|1||Protein structure classification||1|
|2||Comparative protein modeling||57|
|3||Simulations of protein folding||169|
|4||The simulation of ionic charge transport in biological ion channels : an introduction to numerical methods||229|
|5||Wavelets in chemistry and chemoinformatics||295|