scispace - formally typeset
Journal ArticleDOI

Towards Data-Driven Design of Asymmetric Hydrogenation of Olefins: Database and Hierarchical Learning

TLDR
In this article, the authors developed a hierarchical learning approach to achieve predictive machine leaning model using only dozens of enantioselectivity data with the target olefin, which offers a useful solution for the few-shot learning problem.
Abstract
Asymmetric hydrogenation of olefins is one of the most powerful asymmetric transformations in molecular synthesis. Although several privileged catalyst scaffolds are available, the catalyst development for asymmetric hydrogenation is still a time- and resource-consuming process due to the lack of predictive catalyst design strategy. Targeting the data-driven design of asymmetric catalysis, we herein report the development of a standardized database that contains the detailed information of over 12000 literature asymmetric hydrogenations of olefins. This database provides a valuable platform for the machine learning applications in asymmetric catalysis. Based on this database, we developed a hierarchical learning approach to achieve predictive machine leaning model using only dozens of enantioselectivity data with the target olefin, which offers a useful solution for the few-shot learning problem and will facilitate the reaction optimization with new olefin substrate in catalysis screening.

read more

Citations
More filters
Journal ArticleDOI

Genetic Optimization of Homogeneous Catalysts

TL;DR: NaviCatGA as mentioned in this paper is a genetic algorithm capable of optimizing molecular catalyst structures using well-suited fitness functions to achieve a set of targeted properties, such as activity/selectivity.
Journal ArticleDOI

An Ensemble Structure and Physiochemical (SPOC) Descriptor for Machine-Learning Prediction of Chemical Reaction and Molecular Properties.

TL;DR: Inspired by the chemist's vision on molecules, an ensemble descriptor, SPOC, curated on the principles of physical organic chemistry that integrates Structure and Physicochemical property (SPOC) of a molecule is presented.
Journal ArticleDOI

Machine Learning Applications for Chemical Reactions

TL;DR: In this article , the authors summarized recent achievements of ML studies on two different problems; predicting reaction properties and synthetic routes, and the predictions of reactivity, self-optimization of reaction, and designing retrosynthetic reaction paths are also tackled by ML approaches.
Journal ArticleDOI

When machine learning meets molecular synthesis

TL;DR: In this paper , the authors highlight the key concepts and approaches in ML and their major potential towards molecular synthesis with emphasis in catalysis, pointing out additionally the most successful cases in the field.
Journal ArticleDOI

Mechanistic Inference from Statistical Models at Different Data-Size Regimes

Danilo M. Lustosa, +1 more
- 17 Jun 2022 - 
TL;DR: In this article , the integration of statistical principles into homogeneous catalysis can streamline not only reaction optimization protocols but also mechanistic investigation procedures, highlighting how different aspects of molecular modeling, data set design, data visualization, and nuanced data restructuring can contribute to improving chemical reactivity and selectivity, while furthering our understanding of reaction mechanisms.
References
More filters
Journal ArticleDOI

Extremely randomized trees

TL;DR: A new tree-based ensemble method for supervised classification and regression problems that consists of randomizing strongly both attribute and cut-point choice while splitting a tree node and builds totally randomized trees whose structures are independent of the output values of the learning sample.
Journal ArticleDOI

SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules

TL;DR: This chapter discusses the construction of Benzenoid and Coronoid Hydrocarbons through the stages of enumeration, classification, and topological properties in a number of computers used for this purpose.
Journal ArticleDOI

Extended-Connectivity Fingerprints

TL;DR: A description of their implementation has not previously been presented in the literature, and ECFPs can be very rapidly calculated and can represent an essentially infinite number of different molecular features.
Journal ArticleDOI

Asymmetric Catalysis: Science and Opportunities (Nobel Lecture)

TL;DR: Asymmetric catalysis, in its infancy in the 1960s, has dramatically changed the procedures of chemical synthesis, and resulted in an impressive progression to a level that technically approximates or sometimes even exceeds that of natural biological processes as discussed by the authors.
Journal ArticleDOI

Atom-centered symmetry functions for constructing high-dimensional neural network potentials

TL;DR: Neural networks offer an unbiased and numerically very accurate approach to represent high-dimensional ab initio potential-energy surfaces and a transformation to symmetry functions is required to enable molecular dynamics simulations of large systems.
Related Papers (5)