scispace - formally typeset
Open AccessJournal ArticleDOI

Quantum chemistry structures and properties of 134 kilo molecules

Reads0
Chats0
TLDR
This data set provides quantum chemical properties for a relevant, consistent, and comprehensive chemical space of small organic molecules that may serve the benchmarking of existing methods, development of new methods, such as hybrid quantum mechanics/machine learning, and systematic identification of structure-property relationships.
Abstract
Computational de novo design of new drugs and materials requires rigorous and unbiased exploration of chemical compound space. However, large uncharted territories persist due to its size scaling combinatorially with molecular size. We report computed geometric, energetic, electronic, and thermodynamic properties for 134k stable small organic molecules made up of CHONF. These molecules correspond to the subset of all 133,885 species with up to nine heavy atoms (CONF) out of the GDB-17 chemical universe of 166 billion organic molecules. We report geometries minimal in energy, corresponding harmonic frequencies, dipole moments, polarizabilities, along with energies, enthalpies, and free energies of atomization. All properties were calculated at the B3LYP/6-31G(2df,p) level of quantum chemistry. Furthermore, for the predominant stoichiometry, C7H10O2, there are 6,095 constitutional isomers among the 134k molecules. We report energies, enthalpies, and free energies of atomization at the more accurate G4MP2 level of theory for all of them. As such, this data set provides quantum chemical properties for a relevant, consistent, and comprehensive chemical space of small organic molecules. This database may serve the benchmarking of existing methods, development of new methods, such as hybrid quantum mechanics/machine learning, and systematic identification of structure-property relationships.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A Comprehensive Survey on Graph Neural Networks

TL;DR: This article provides a comprehensive overview of graph neural networks (GNNs) in data mining and machine learning fields and proposes a new taxonomy to divide the state-of-the-art GNNs into four categories, namely, recurrent GNNS, convolutional GNN’s, graph autoencoders, and spatial–temporal Gnns.
Posted Content

Fast Graph Representation Learning with PyTorch Geometric

Matthias Fey, +1 more
- 06 Mar 2019 - 
TL;DR: PyTorch Geometric is introduced, a library for deep learning on irregularly structured input data such as graphs, point clouds and manifolds, built upon PyTorch, and a comprehensive comparative study of the implemented methods in homogeneous evaluation scenarios is performed.
Posted Content

Neural Message Passing for Quantum Chemistry

TL;DR: Using MPNNs, state of the art results on an important molecular property prediction benchmark are demonstrated and it is believed future work should focus on datasets with larger molecules or more accurate ground truth labels.
Journal ArticleDOI

Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules

TL;DR: In this article, a deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an encoder, a decoder, and a predictor, which can generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds.
Journal ArticleDOI

Machine learning and the physical sciences

TL;DR: This article reviews in a selective way the recent research on the interface between machine learning and the physical sciences, including conceptual developments in ML motivated by physical insights, applications of machine learning techniques to several domains in physics, and cross fertilization between the two fields.
References
More filters
Journal ArticleDOI

Open Babel: An open chemical toolbox

TL;DR: The implementation of Open Babel is detailed, key advances in the 2.3 release are described, and a variety of uses are outlined both in terms of software products and scientific research, including applications far beyond simple format interconversion.
Journal ArticleDOI

SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules

TL;DR: This chapter discusses the construction of Benzenoid and Coronoid Hydrocarbons through the stages of enumeration, classification, and topological properties in a number of computers used for this purpose.
Book

A Chemist's Guide to Density Functional Theory

TL;DR: A Chemist's Guide to Density Functional Theory should be an invaluable source of insight and knowledge for many chemists using DFT approaches to solve chemical problems.
Journal ArticleDOI

Towards the computational design of solid catalysts

TL;DR: The first steps towards using computational methods to design new catalysts are reviewed and how, in the future, such methods may be used to engineer the electronic structure of the active surface by changing its composition and structure are discussed.
Journal ArticleDOI

A complete basis set model chemistry. VI. Use of density functional geometries and frequencies

TL;DR: The CBS-Q model chemistry is modified to use B3LYP hybrid density functional geometries and frequencies, which give both improved reliability (maximum error for the G2 test set reduced from 3.9 to 2.8 kcal/mol) and increased accuracy as mentioned in this paper.
Related Papers (5)