Perspective: Identification of collective variables and metastable states of protein dynamics
Author(s) -
Florian Sittel,
Gerhard Stock
Publication year - 2018
Publication title -
the journal of chemical physics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.071
H-Index - 357
eISSN - 1089-7690
pISSN - 0021-9606
DOI - 10.1063/1.5049637
Subject(s) - statistical physics , principal component analysis , markov chain , dimensionality reduction , curse of dimensionality , computer science , metastability , energy landscape , ergodicity , mathematics , physics , artificial intelligence , machine learning , statistics , quantum mechanics , thermodynamics
The statistical analysis of molecular dynamics simulations requires dimensionality reduction techniques, which yield a low-dimensional set of collective variables (CVs) { } = that in some sense describe the essential dynamics of the system. Considering the distribution ( ) of the CVs, the primal goal of a statistical analysis is to detect the characteristic features of ( ), in particular, its maxima and their connection paths. This is because these features characterize the low-energy regions and the energy barriers of the corresponding free energy landscape Δ( ) = - ln ( ), and therefore amount to the metastable states and transition regions of the system. In this perspective, we outline a systematic strategy to identify CVs and metastable states, which subsequently can be employed to construct a Langevin or a Markov state model of the dynamics. In particular, we account for the still limited sampling typically achieved by molecular dynamics simulations, which in practice seriously limits the applicability of theories (e.g., assuming ergodicity) and black-box software tools (e.g., using redundant input coordinates). We show that it is essential to use internal (rather than Cartesian) input coordinates, employ dimensionality reduction methods that avoid rescaling errors (such as principal component analysis), and perform density based (rather than -means-type) clustering. Finally, we briefly discuss a machine learning approach to dimensionality reduction, which highlights the essential internal coordinates of a system and may reveal hidden reaction mechanisms.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom