scispace - formally typeset
Open AccessPosted Content

Chemistrees: data driven identification of reaction pathways via machine learning

TLDR
In this paper, a supervised machine learning algorithm, decision trees, is proposed to analyze molecular dynamics output to identify the predominant geometric features which correlate with trajectories that transition between two arbitrarily defined states.
Abstract
We propose a supervised machine learning algorithm, decision trees, to analyze molecular dynamics output. The approach aims to identify the predominant geometric features which correlate with trajectories that transition between two arbitrarily defined states. The data-based algorithm aims to identify such features in an approach which is unbiased by human "chemical intuition". We demonstrate the method by analyzing proton exchange reactions in formic acid (FA) solvated in small water clusters. The simulations were performed with ab initio molecular dynamics combined with a method for generating rare events, specifically path sampling. Our machine learning analysis identified mechanistic descriptions of the proton transfer reaction for the different water clusters.

read more

References
More filters
Journal Article

Scikit-learn: Machine Learning in Python

TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Journal ArticleDOI

A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu

TL;DR: The revised DFT-D method is proposed as a general tool for the computation of the dispersion energy in molecules and solids of any kind with DFT and related (low-cost) electronic structure methods for large systems.
Journal ArticleDOI

QUICKSTEP: Fast and accurate density functional calculations using a mixed Gaussian and plane waves approach

TL;DR: It is shown how derivatives of the GPW energy functional, namely ionic forces and the Kohn–Sham matrix, can be computed in a consistent way and the computational cost is scaling linearly with the system size, even for condensed phase systems of just a few tens of atoms.
Journal ArticleDOI

A novel path sampling method for the calculation of rate constants

TL;DR: In this paper, the rate constant of transitions between stable states separated by high free energy barriers in a complex environment within the framework of transition path sampling has been derived based on directly and simultaneously measuring the fluxes through many phase space interfaces.
Related Papers (5)