Journal ArticleDOI
Towards Data-Driven Design of Asymmetric Hydrogenation of Olefins: Database and Hierarchical Learning
TLDR
In this article, the authors developed a hierarchical learning approach to achieve predictive machine leaning model using only dozens of enantioselectivity data with the target olefin, which offers a useful solution for the few-shot learning problem.Abstract:
Asymmetric hydrogenation of olefins is one of the most powerful asymmetric transformations in molecular synthesis. Although several privileged catalyst scaffolds are available, the catalyst development for asymmetric hydrogenation is still a time- and resource-consuming process due to the lack of predictive catalyst design strategy. Targeting the data-driven design of asymmetric catalysis, we herein report the development of a standardized database that contains the detailed information of over 12000 literature asymmetric hydrogenations of olefins. This database provides a valuable platform for the machine learning applications in asymmetric catalysis. Based on this database, we developed a hierarchical learning approach to achieve predictive machine leaning model using only dozens of enantioselectivity data with the target olefin, which offers a useful solution for the few-shot learning problem and will facilitate the reaction optimization with new olefin substrate in catalysis screening.read more
Citations
More filters
Journal ArticleDOI
Genetic Optimization of Homogeneous Catalysts
TL;DR: NaviCatGA as mentioned in this paper is a genetic algorithm capable of optimizing molecular catalyst structures using well-suited fitness functions to achieve a set of targeted properties, such as activity/selectivity.
Journal ArticleDOI
An Ensemble Structure and Physiochemical (SPOC) Descriptor for Machine-Learning Prediction of Chemical Reaction and Molecular Properties.
TL;DR: Inspired by the chemist's vision on molecules, an ensemble descriptor, SPOC, curated on the principles of physical organic chemistry that integrates Structure and Physicochemical property (SPOC) of a molecule is presented.
Journal ArticleDOI
Machine Learning Applications for Chemical Reactions
TL;DR: In this article , the authors summarized recent achievements of ML studies on two different problems; predicting reaction properties and synthetic routes, and the predictions of reactivity, self-optimization of reaction, and designing retrosynthetic reaction paths are also tackled by ML approaches.
Journal ArticleDOI
When machine learning meets molecular synthesis
João C. A. Oliveira,J Urban Frey,Shuo-Qing Zhang,Li-Cheng Xu,Xin Li,Shu Wen Li,Xin Hong,Lutz Ackermann +7 more
TL;DR: In this paper , the authors highlight the key concepts and approaches in ML and their major potential towards molecular synthesis with emphasis in catalysis, pointing out additionally the most successful cases in the field.
Journal ArticleDOI
Mechanistic Inference from Statistical Models at Different Data-Size Regimes
Danilo M. Lustosa,Anat Milo +1 more
TL;DR: In this article , the integration of statistical principles into homogeneous catalysis can streamline not only reaction optimization protocols but also mechanistic investigation procedures, highlighting how different aspects of molecular modeling, data set design, data visualization, and nuanced data restructuring can contribute to improving chemical reactivity and selectivity, while furthering our understanding of reaction mechanisms.
References
More filters
Journal ArticleDOI
Extremely randomized trees
TL;DR: A new tree-based ensemble method for supervised classification and regression problems that consists of randomizing strongly both attribute and cut-point choice while splitting a tree node and builds totally randomized trees whose structures are independent of the output values of the learning sample.
Journal ArticleDOI
SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules
TL;DR: This chapter discusses the construction of Benzenoid and Coronoid Hydrocarbons through the stages of enumeration, classification, and topological properties in a number of computers used for this purpose.
Journal ArticleDOI
Extended-Connectivity Fingerprints
David Rogers,Mathew Hahn +1 more
TL;DR: A description of their implementation has not previously been presented in the literature, and ECFPs can be very rapidly calculated and can represent an essentially infinite number of different molecular features.
Journal ArticleDOI
Asymmetric Catalysis: Science and Opportunities (Nobel Lecture)
TL;DR: Asymmetric catalysis, in its infancy in the 1960s, has dramatically changed the procedures of chemical synthesis, and resulted in an impressive progression to a level that technically approximates or sometimes even exceeds that of natural biological processes as discussed by the authors.
Journal ArticleDOI
Atom-centered symmetry functions for constructing high-dimensional neural network potentials
TL;DR: Neural networks offer an unbiased and numerically very accurate approach to represent high-dimensional ab initio potential-energy surfaces and a transformation to symmetry functions is required to enable molecular dynamics simulations of large systems.