# Coarse-grained modeling

Coarse-grained modeling, coarse-grained models, aim at simulating the behaviour of complex systems using their coarse-grained (simplified) representation. Coarse-grained models are widely used for molecular modeling of biomolecules[1][2] at various granularity levels.

A wide range of coarse-grained models have been proposed. They are usually dedicated to computational modeling of specific molecules: proteins,[1][2] nucleic acids,[3][4] lipid membranes,[2][5] carbohydrates[6] or water.[7] In these models, molecules are represented not by individual atoms, but by "pseudo-atoms" approximating groups of atoms, such as whole amino acid residue. By decreasing the degrees of freedom much longer simulation times can be studied at the expense of molecular detail. Coarse-grained models have found practical applications in molecular dynamics simulations.[1] Another case of interest is the simplification of a given discrete-state system, as very often descriptions of the same system at different levels of detail are possible.[8][9] An example is given by the chemomechanical dynamics of a molecular machine, such as Kinesin.[8][10]

The coarse-grained modeling originates from work by Michael Levitt and Ariel Warshel in 1970s.[11][12][13] Coarse-grained models are presently often used as components of multiscale modeling protocols in combination with reconstruction tools[14] (from coarse-grained to atomistic representation) and atomistic resolution models.[1] Atomistic resolution models alone are presently not efficient enough to handle large system sizes and simulation timescales.[1][2]

Coarse graining and fine graining in statistical mechanics addresses the subject of entropy ${\displaystyle S}$, and thus the second law of thermodynamics. One has to realise that the concept of temperature ${\displaystyle T}$ cannot be attributed to an arbitrarily microscopic particle since this does not radiate thermally like a macroscopic or black body´´. However, one can attribute a nonzero entropy ${\displaystyle S}$ to an object with as few as two states like a bit´´ (and nothing else). The entropies of the two cases are called thermal entropy and von Neumann entropy respectively.[15] They are also distinguished by the terms coarse grained and fine grained respectively. This latter distinction is related to the aspect spelled out above and is elaborated on below.

The Liouville theorem (sometimes also called Liouville equation)

${\displaystyle {\frac {d}{dt}}(\Delta q\Delta p)=0}$

states that a phase space volume ${\displaystyle \Gamma }$ (spanned by ${\displaystyle q}$ and ${\displaystyle p}$, here in one spatial dimension) remains constant in the course of time, no matter where the point ${\displaystyle q,p}$ contained in ${\displaystyle \Delta q\Delta p}$ moves. This is a consideration in classical mechanics. In order to relate this view to macroscopic physics one surrounds each point ${\displaystyle q,p}$ e.g. with a sphere of some fixed volume - a procedure called coarse graining which lumps together points or states of similar behaviour. The trajectory of this sphere in phase space then covers also other points and hence its volume in phase space grows. The entropy ${\displaystyle S}$ associated with this consideration, whether zero or not, is called coarse grained entropy or thermal entropy. A large number of such systems, i.e. the one under consideration together with many copies, is called an ensemble. If these systems do not interact with each other or anything else, and each has the same energy ${\displaystyle E}$, the ensemble is called a microcanonical ensemble. Each replica system appears with the same probability, and temperature does not enter.

Now suppose we define a probability density ${\displaystyle \rho (q_{i},p_{i},t)}$ describing the motion of the point ${\displaystyle q_{i},p_{i}}$ with phase space element ${\displaystyle \Delta q_{i}\Delta p_{i}}$. In the case of equilibrium or steady motion the equation of continuity implies that the probability density ${\displaystyle \rho }$ is independent of time ${\displaystyle t}$. We take ${\displaystyle \rho _{i}=\rho (q_{i},p_{i})}$ as nonzero only inside the phase space volume ${\displaystyle V_{\Gamma }}$. One then defines the entropy ${\displaystyle S}$ by the relation

${\displaystyle S=-\Sigma _{i}\rho _{i}\ln \rho _{i},\;\;}$ where ${\displaystyle \;\;\Sigma _{i}\rho _{i}=1.}$

Then,by maximisation for a given energy ${\displaystyle E}$, i.e. linking ${\displaystyle \delta S=0}$ with ${\displaystyle \delta }$ of the other sum equal to zero via a Lagrange multiplier ${\displaystyle \lambda }$, one obtains (as in the case of a lattice of spins or with a bit at each lattice point)

${\displaystyle V_{\Gamma }=e^{(\lambda +1)}={\frac {1}{\rho }}}$ ${\displaystyle \;\;\;}$ and ${\displaystyle \;\;\;}$ ${\displaystyle S=\ln V_{\Gamma }}$,

the volume of ${\displaystyle \Gamma }$ being proportional to the exponential of S. This is again a consideration in classical mechanics.

In quantum mechanics the phase space becomes a space of states, and the probability density ${\displaystyle \rho }$ an operator with a subspace of states ${\displaystyle \Gamma }$ of dimension or number of states ${\displaystyle N_{\Gamma }}$ specified by a projection operator ${\displaystyle P_{\Gamma }}$. Then the entropy ${\displaystyle S}$ is (obtained as above)

${\displaystyle S=-Tr\rho \ln \rho =\ln N_{\Gamma },}$

and is described as fine grained or von Neumann entropy. If ${\displaystyle N_{\Gamma }=1}$, the entropy vanishes and the system is said to be in a pure state. Here the exponential of S is proportional to the number of states. The microcanonical ensemble is again a large number of noninteracting copies of the given system and ${\displaystyle S}$, energy ${\displaystyle E}$ etc. become ensemble averages.

Now consider interaction of a given system with another one - or in ensemble terminology - the given system and the large number of replicas all immersed in a big one called a heat bath characterised by ${\displaystyle \rho }$. Since the systems interact only via the heat bath, the individual systems of the ensemble can have different energies ${\displaystyle E_{i},E_{j},...}$ depending on which energy state ${\displaystyle E_{i},E_{j},...}$ they are in. This interaction is described as entanglement and the ensemble as canonical ensemble (the macrocanonical ensemble permits also exchange of particles).

The interaction of the ensemble elements via the heat bath leads to temperature ${\displaystyle T}$, as we now show.[16] Considering two elements with energies ${\displaystyle E_{i},E_{j}}$, the probability of finding these in the heat bath is proportional to ${\displaystyle \rho (E_{i})\rho (E_{j})}$, and this is proportional to ${\displaystyle \rho (E_{i}+E_{j})}$ if we consider the binary system as a system in the same heat bath defined by the function ${\displaystyle \rho }$. It follows that ${\displaystyle \rho (E)\propto e^{-\mu E}}$ (the only way to satisfy the proportionality), where ${\displaystyle \mu }$ is a constant. Normalisation then implies

${\displaystyle \rho (E_{i})={\frac {e^{-\mu E_{i}}}{\Sigma _{j}e^{-\mu E_{j}}}},\Sigma _{i}\rho (E_{i})=1.}$

Then in terms of ensemble averages

${\displaystyle {\overline {S}}=-{\overline {\ln \rho }}}$, and ${\displaystyle \mu \equiv {\frac {1}{T}},\;k_{B}=1,}$

or by comparison with the second law of thermodynamics. ${\displaystyle {\overline {S}}}$ is now the entanglement entropy or fine grained von Neumann entropy. This is zero if the system is in a pure state, and is nonzero when in a mixed (entangled) state.

Above we considered a system immersed in another huge one called heat bath with the possibility of allowing heat exchange between them. Frequently one considers a different situation, i.e. two systems A and B with a small hole in the partition between them. Suppose B is originally empty but A contains an explosive device which fills A instantaneously with photons. Originally A and B have energies ${\displaystyle E_{A}}$ and ${\displaystyle E_{B}}$ respectively, and there is no interaction. Hence originally both are in pure quantum states and have zero fine grained entropies. Immediately after explosion A is filled with photons, the energy still being ${\displaystyle E_{A}}$ and that of B also ${\displaystyle E_{B}}$ (no photon has yet escaped). Since A is filled with photons, these obey a Planck distribution law and hence the coarse grained thermal entropy of A is nonzero (recall: lots of configurations of the photons in A, lots of states with one maximal), although the fine grained quantum mechanical entropy is still zero (same energy state), as also that of B. Now allow photons to leak slowly (i.e. with no disturbance of the equilibrium) from A to B. With fewer photons in A, its coarse grained entropy diminishes but that of B increases. This entanglement of A and B implies they are now quantum mechanically in mixed states, and so their fine grained entropies are no longer zero. Finally when all photons are in B, the coarse grained entropy of A as well as its fine grained entropy vanish and A is again in a pure state but with new energy. On the other hand B now has an increased thermal entropy, but since the entanglement is over it is quantum mechanically again in a pure state, its ground state, and that has zero fine grained von Neumann entropy. Consider B: In the course of the entanglement with A its fine grained or entanglement entropy started and ended in pure states (thus with zero entropies). Its coarse grained entropy, however, rose from zero to its final nonzero value. Roughly half way through the procedure the entanglement entropy of B reaches a maximum and then decreases to zero at the end.

The classical coarse grained thermal entropy of the second law of thermodynamics is not the same as the (mostly smaller) quantum mechanical fine grained entropy. The difference is called information. As may be deduced from the foregoing arguments, this difference is roughly zero before the entanglement entropy (which is the same for A and B) attains its maximum. An example of coarse graining is provided by Brownian motion.[17]

## Software packages

• Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS)
• Extensible Simulation Package for Research on Soft Matter ESPResSo (external link)

## References

1. Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, Kolinski A (July 2016). "Coarse-Grained Protein Models and Their Applications". Chemical Reviews. 116 (14): 7898–936. doi:10.1021/acs.chemrev.6b00163. PMID 27333362.
2. ^ a b c d Ingólfsson HI, Lopez CA, Uusitalo JJ, de Jong DH, Gopal SM, Periole X, Marrink SJ (May 2014). "The power of coarse graining in biomolecular simulations". Wiley Interdisciplinary Reviews. Computational Molecular Science. 4 (3): 225–248. doi:10.1002/wcms.1169. PMC 4171755. PMID 25309628.
3. ^ Boniecki MJ, Lach G, Dawson WK, Tomala K, Lukasz P, Soltysinski T, et al. (April 2016). "SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction". Nucleic Acids Research. 44 (7): e63. doi:10.1093/nar/gkv1479. PMC 4838351. PMID 26687716.
4. ^ Potoyan DA, Savelyev A, Papoian GA (2013-01-01). "Recent successes in coarse-grained modeling of DNA". Wiley Interdisciplinary Reviews: Computational Molecular Science. 3 (1): 69–83. doi:10.1002/wcms.1114. ISSN 1759-0884. S2CID 12043343.
5. ^ Baron R, Trzesniak D, de Vries AH, Elsener A, Marrink SJ, van Gunsteren WF (February 2007). "Comparison of thermodynamic properties of coarse-grained and atomic-level simulation models" (PDF). ChemPhysChem. 8 (3): 452–61. doi:10.1002/cphc.200600658. PMID 17290360.
6. ^ López CA, Rzepiela AJ, de Vries AH, Dijkhuizen L, Hünenberger PH, Marrink SJ (December 2009). "Martini Coarse-Grained Force Field: Extension to Carbohydrates". Journal of Chemical Theory and Computation. 5 (12): 3195–210. doi:10.1021/ct900313w. PMID 26602504.
7. ^ Hadley KR, McCabe C (July 2012). "Coarse-Grained Molecular Models of Water: A Review". Molecular Simulation. 38 (8–9): 671–681. doi:10.1080/08927022.2012.671942. PMC 3420348. PMID 22904601.
8. ^ a b Seiferth D, Sollich P, Klumpp S (December 2020). "Coarse graining of biochemical systems described by discrete stochastic dynamics". Physical Review E. 102 (6–1): 062149. arXiv:2102.13394. Bibcode:2020PhRvE.102f2149S. doi:10.1103/PhysRevE.102.062149. PMID 33466014. S2CID 231652939.
9. ^ Hummer G, Szabo A (July 2015). "Optimal Dimensionality Reduction of Multistate Kinetic and Markov-State Models". The Journal of Physical Chemistry B. 119 (29): 9029–37. doi:10.1021/jp508375q. PMC 4516310. PMID 25296279.
10. ^ Liepelt S, Lipowsky R (June 2007). "Kinesin's network of chemomechanical motor cycles". Physical Review Letters. 98 (25): 258102. Bibcode:2007PhRvL..98y8102L. doi:10.1103/PhysRevLett.98.258102. PMID 17678059.
11. ^ Levitt M, Warshel A (February 1975). "Computer simulation of protein folding". Nature. 253 (5494): 694–8. Bibcode:1975Natur.253..694L. doi:10.1038/253694a0. PMID 1167625. S2CID 4211714.
12. ^ Warshel A, Levitt M (May 1976). "Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme". Journal of Molecular Biology. 103 (2): 227–49. doi:10.1016/0022-2836(76)90311-9. PMID 985660.
13. ^ Levitt M (September 2014). "Birth and future of multiscale modeling for macromolecular systems (Nobel Lecture)". Angewandte Chemie. 53 (38): 10006–18. doi:10.1002/anie.201403691. PMID 25100216. S2CID 3680673.
14. ^ Badaczewska-Dawid AE, Kolinski A, Kmiecik S (2020). "Computational reconstruction of atomistic protein structures from coarse-grained models". Computational and Structural Biotechnology Journal. 18: 162–176. doi:10.1016/j.csbj.2019.12.007. PMC 6961067. PMID 31969975.
15. ^ Susskind L, Lindesay J (2005). Black Holes, Information and the String Theory Revolution. World Scientific. pp. 69–77. ISBN 981-256-131-5.
16. ^ Müller-Kirsten HJ (2013). Basics of Statistical Physics (2nd ed.). World Scientific. pp. 28–31, 152–167. ISBN 978-981-4449-53-3.
17. ^ Muntean A, Rademacher JD, Zagaris A (2016). Macroscopic and Large Scale Phenomena: Coarse Graining, Mean Field Limits and Ergodicity. Springer. ISBN 978-3-319-26883-5.