scispace - formally typeset
Open AccessJournal ArticleDOI

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

Reads0
Chats0
TLDR
In this article, a framework was proposed to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms, and the successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA.
Abstract
With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descriptors, or collective variables (CVs), is needed to discern the relevant dynamics that describes the molecular process of interest. However, proposing an adequate set of CVs that can capture the intrinsic reaction coordinate of the molecular transition is often extremely difficult. Here, we present a framework to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms. The approach effectively replaces the encoder of an autoencoder network with genes to represent the latent space, i.e., the CVs. Given a selection of CVs as input, the network is trained to recover the atom coordinates underlying the CV values at points along the transition. The network performance is used as an estimator of the fitness of the input CVs. Two genetic algorithms optimize the CV selection and the neural network architecture. The successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA from the classic Watson-Crick pairing to the alternative Hoogsteen pairing. Key advantages of our framework include the following: optimal interpretable CVs, avoiding costly calculation of committor or time-correlation functions, and automatic hyperparameter optimization. In addition, we show that applying a time-delay between the network input and output allows for enhanced selection of slow variables. Moreover, the network can also be used to generate molecular configurations of unexplored microstates, for example, for augmentation of the simulation data.

read more

Citations
More filters
Journal ArticleDOI

Reaction coordinates in complex systems-a perspective

TL;DR: A number of methods have been developed to reduce the vast amount of high-dimensional data to a small number of essential degrees of freedom representing the reaction coordinate, and if the reaction coordinates is known, a variety of approaches have been proposed to enhance the sampling along the important degree of freedom as mentioned in this paper.
Journal ArticleDOI

Explaining reaction coordinates of alanine dipeptide isomerization obtained from deep neural networks using Explainable Artificial Intelligence (XAI).

TL;DR: The present study offers an AI-aided framework to explain the appropriate reaction coordinates, which acquires considerable significance when the number of degrees of freedom increases.
Journal ArticleDOI

DeepCV: A Deep Learning Framework for Blind Search of Collective Variables in Expanded Configurational Space

TL;DR: DeepCV as mentioned in this paper is an open-source software written in Python/C++ object-oriented languages, based on the TensorFlow framework, which can be used to calculate molecular features, train models, generate CVs, validate rare events from sampling, and analyze a trajectory for chemical reactions of interest.
Journal ArticleDOI

Computational strategies for protein conformational ensemble detection.

TL;DR: The ability to detect and juggle protein conformations supplemented by a physics-based understanding has implications for not only in vivo problems but also for resistance impeding drug discovery and bionano-sensor design as discussed by the authors.
Journal ArticleDOI

Computational strategies for protein conformational ensemble detection

TL;DR: The ability to detect and juggle protein conformations supplemented by a physics-based understanding has implications for not only in vivo problems but also for resistance impeding drug discovery and bionano-sensor design as discussed by the authors .
References
More filters
Journal ArticleDOI

A smooth particle mesh Ewald method

TL;DR: It is demonstrated that arbitrary accuracy can be achieved, independent of system size N, at a cost that scales as N log(N), which is comparable to that of a simple truncation method of 10 A or less.
Journal ArticleDOI

The perceptron: a probabilistic model for information storage and organization in the brain.

TL;DR: This article will be concerned primarily with the second and third questions, which are still subject to a vast amount of speculation, and where the few relevant facts currently supplied by neurophysiology have not yet been integrated into an acceptable theory.
Journal ArticleDOI

Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling

TL;DR: In this paper, the authors describe the use of arbitrary sampling distributions chosen to facilitate the estimate of the free energy difference between a model system and some reference system, but the conventional Monte Carlo methods of obtaining such averages are inadequate for the free-energy case.
Journal ArticleDOI

Improved side‐chain torsion potentials for the Amber ff99SB protein force field

TL;DR: A new force field, which is termed Amber ff99SB‐ILDN, exhibits considerably better agreement with the NMR data and is validated against a large set of experimental NMR measurements that directly probe side‐chain conformations.
Journal ArticleDOI

Escaping free-energy minima

TL;DR: A powerful method for exploring the properties of the multidimensional free energy surfaces of complex many-body systems by means of coarse-grained non-Markovian dynamics in the space defined by a few collective coordinates is introduced.
Related Papers (5)