Home
/
Authors
/
Chenru Duan

Author

Chenru Duan

Other affiliations: Singapore–MIT alliance, Zhejiang University

Bio: Chenru Duan is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Computer science & Medicine. The author has an hindex of 14, co-authored 33 publications receiving 627 citations. Previous affiliations of Chenru Duan include Singapore–MIT alliance & Zhejiang University.

Topics: Computer science, Medicine, Physics, Chemical space, Artificial neural network ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A quantitative uncertainty metric controls error in neural network-driven chemical discovery

[...]

Jon Paul Janet¹, Chenru Duan¹, Tzuhsiung Yang¹, Aditya Nandy¹, Heather J. Kulik¹ - Show less +1 more•Institutions (1)

Massachusetts Institute of Technology¹

28 Aug 2019-Chemical Science

TL;DR: In this paper, the authors introduce the distance to available data in the latent space of a neural network ML model as a low-cost, quantitative uncertainty metric that works for both inorganic and organic chemistry.

...read moreread less

Abstract: Machine learning (ML) models, such as artificial neural networks, have emerged as a complement to high-throughput screening, enabling characterization of new compounds in seconds instead of hours. The promise of ML models to enable large-scale chemical space exploration can only be realized if it is straightforward to identify when molecules and materials are outside the model's domain of applicability. Established uncertainty metrics for neural network models are either costly to obtain (e.g., ensemble models) or rely on feature engineering (e.g., feature space distances), and each has limitations in estimating prediction errors for chemical space exploration. We introduce the distance to available data in the latent space of a neural network ML model as a low-cost, quantitative uncertainty metric that works for both inorganic and organic chemistry. The calibrated performance of this approach exceeds widely used uncertainty metrics and is readily applied to models of increasing complexity at no additional cost. Tightening latent distance cutoffs systematically drives down predicted model errors below training errors, thus enabling predictive error control in chemical discovery or identification of useful data points for active learning.

...read moreread less

146 citations

Journal Article•DOI•

Accurate Multiobjective Design in a Space of Millions of Transition Metal Complexes with Neural-Network-Driven Efficient Global Optimization.

[...]

Jon Paul Janet¹, Sahasrajit Ramesh¹, Chenru Duan¹, Heather J. Kulik¹•Institutions (1)

Massachusetts Institute of Technology¹

11 Mar 2020-ACS central science

TL;DR: The ANN-driven EI approach achieves at least 500-fold acceleration over random search, identifying a Pareto-optimal design in around 5 weeks instead of 50 years, and shows that a multitask ANN with latent-distance-based UQ surpasses the generalization performance of a GP in this space.

...read moreread less

Abstract: The accelerated discovery of materials for real world applications requires the achievement of multiple design objectives. The multidimensional nature of the search necessitates exploration of mult...

...read moreread less

104 citations

Journal Article•DOI•

Strategies and Software for Machine Learning Accelerated Discovery in Transition Metal Chemistry

[...]

Aditya Nandy¹, Chenru Duan¹, Jon Paul Janet¹, Stefan Gugler¹, Stefan Gugler², Heather J. Kulik¹ - Show less +2 more•Institutions (2)

Massachusetts Institute of Technology¹, ETH Zurich²

24 Sep 2018-Industrial & Engineering Chemistry Research

TL;DR: In this paper, the authors compare the performance of LASSO, kernel ridge regression (KRR), and artificial neural network (ANN) models using heuristic, topological revised autocorrelation (RAC) descriptors.

...read moreread less

Abstract: Machine learning the electronic structure of open shell transition metal complexes presents unique challenges, including robust and automated data set generation. Here, we introduce tools that simplify data acquisition from density functional theory (DFT) and validation of trained machine learning models using the molSimplify automatic design (mAD) workflow. We demonstrate this workflow by training and comparing the performance of LASSO, kernel ridge regression (KRR), and artificial neural network (ANN) models using heuristic, topological revised autocorrelation (RAC) descriptors we have recently introduced for machine learning inorganic chemistry. On a series of open shell transition metal complexes, we evaluate set aside test errors of these models for predicting the HOMO level and HOMO–LUMO gap. The best performing models are ANNs, which show 0.15 and 0.25 eV test set mean absolute errors on the HOMO level and HOMO–LUMO gap, respectively. Poor performing KRR models using the full 153-feature RAC set ar...

...read moreread less

99 citations

Journal Article•DOI•

Zero-temperature localization in a sub-Ohmic spin-boson model investigated by an extended hierarchy equation of motion

[...]

Chenru Duan¹, Zhoufei Tang¹, Jianshu Cao², Jianlan Wu¹•Institutions (2)

Zhejiang University¹, Massachusetts Institute of Technology²

28 Jun 2017-Physical Review B

TL;DR: In this article, the hierarchy equation of motion (HEOM) is extended to the zero-temperature sub-Ohmic spin-boson model, providing a numerically accurate prediction of quantum dynamics.

...read moreread less

Abstract: With a decomposition scheme for the bath correlation function, the hierarchy equation of motion (HEOM) is extended to the zero-temperature sub-Ohmic spin-boson model, providing a numerically accurate prediction of quantum dynamics. As a dynamic approach, the extended HEOM determines the delocalized-localized (DL) phase transition from the extracted rate kernel and the coherent-incoherent dynamic transition from the short-time oscillation. As the bosonic bath approaches from the strong to weak sub-Ohmic regimes, a crossover behavior is identified for the critical Kondo parameter of the DL transition, accompanied by the transition from the coherent to incoherent dynamics in the localization.

...read moreread less

81 citations

Journal Article•DOI•

Designing in the Face of Uncertainty: Exploiting Electronic Structure and Machine Learning Models for Discovery in Inorganic Chemistry.

[...]

Jon Paul Janet¹, Fang Liu¹, Aditya Nandy¹, Chenru Duan¹, Tzuhsiung Yang¹, Sean Lin¹, Heather J. Kulik¹ - Show less +3 more•Institutions (1)

Massachusetts Institute of Technology¹

05 Mar 2019-Inorganic Chemistry

TL;DR: Five key mandates for realizing computationally driven accelerated discovery in inorganic chemistry are outlined, including fully automated simulation of new compounds, knowledge of prediction sensitivity or accuracy, faster-than-fast property prediction methods, and maps for rapid chemical space traversal.

...read moreread less

Abstract: Recent transformative advances in computing power and algorithms have made computational chemistry central to the discovery and design of new molecules and materials. First-principles simulations are increasingly accurate and applicable to large systems with the speed needed for high-throughput computational screening. Despite these strides, the combinatorial challenges associated with the vastness of chemical space mean that more than just fast and accurate computational tools are needed for accelerated chemical discovery. In transition-metal chemistry and catalysis, unique challenges arise. The variable spin, oxidation state, and coordination environments favored by elements with well-localized d or f electrons provide great opportunity for tailoring properties in catalytic or functional (e.g., magnetic) materials but also add layers of uncertainty to any design strategy. We outline five key mandates for realizing computationally driven accelerated discovery in inorganic chemistry: (i) fully automated simulation of new compounds, (ii) knowledge of prediction sensitivity or accuracy, (iii) faster-than-fast property prediction methods, (iv) maps for rapid chemical space traversal, and (v) a means to reveal design rules on the kilocompound scale. Through case studies in open-shell transition-metal chemistry, we describe how advances in methodology and software in each of these areas bring about new chemical insights. We conclude with our outlook on the next steps in this process toward realizing fully autonomous discovery in inorganic chemistry using computational chemistry.

...read moreread less

78 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

Ab initio calculation of vibrational absorption and circular dichroism spectra using density functional force fields

[...]

Philip J. Stephens, Frank J. Devlin, Cary F. Chabalowski, Michael J. Frisch

01 Feb 1995

TL;DR: In this paper, the unpolarized absorption and circular dichroism spectra of the fundamental vibrational transitions of the chiral molecule, 4-methyl-2-oxetanone, are calculated ab initio using DFT, MP2, and SCF methodologies and a 5S4P2D/3S2P (TZ2P) basis set.

...read moreread less

Abstract: : The unpolarized absorption and circular dichroism spectra of the fundamental vibrational transitions of the chiral molecule, 4-methyl-2-oxetanone, are calculated ab initio. Harmonic force fields are obtained using Density Functional Theory (DFT), MP2, and SCF methodologies and a 5S4P2D/3S2P (TZ2P) basis set. DFT calculations use the Local Spin Density Approximation (LSDA), BLYP, and Becke3LYP (B3LYP) density functionals. Mid-IR spectra predicted using LSDA, BLYP, and B3LYP force fields are of significantly different quality, the B3LYP force field yielding spectra in clearly superior, and overall excellent, agreement with experiment. The MP2 force field yields spectra in slightly worse agreement with experiment than the B3LYP force field. The SCF force field yields spectra in poor agreement with experiment.The basis set dependence of B3LYP force fields is also explored: the 6-31G* and TZ2P basis sets give very similar results while the 3-21G basis set yields spectra in substantially worse agreements with experiment. jg

...read moreread less

1,652 citations

Journal Article•

Big Data of Materials Science -- Critical Role of the Descriptor

[...]

Luca M. Ghiringhelli, Jan Vybíral, Sergey V. Levchenko, Claudia Draxl, Matthias Scheffler - Show less +1 more

02 Mar 2015-Bulletin of the American Physical Society

TL;DR: A trustful prediction of new promising materials, identification of anomalies, and scientific advancement are doubtful when the scientific connection between the descriptor and the actuating mechanisms is unclear.

...read moreread less

Abstract: Statistical learning of materials properties or functions so far starts with a largely silent, nonchallenged step: the choice of the set of descriptive parameters (termed descriptor). However, when the scientific connection between the descriptor and the actuating mechanisms is unclear, the causality of the learned descriptor-property relation is uncertain. Thus, a trustful prediction of new promising materials, identification of anomalies, and scientific advancement are doubtful. We analyze this issue and define requirements for a suitable descriptor. For a classic example, the energy difference of zinc blende or wurtzite and rocksalt semiconductors, we demonstrate how a meaningful descriptor can be found systematically.

...read moreread less

455 citations

Journal Article•DOI•

The promise of artificial intelligence in chemical engineering: Is it here, finally?

[...]

Venkat Venkatasubramanian¹•Institutions (1)

Columbia University¹

01 Feb 2019-Aiche Journal

358 citations

Journal Article•DOI•

Machine Learning for Catalysis Informatics: Recent Applications and Prospects

[...]

Takashi Toyao¹, Takashi Toyao², Zen Maeno², Satoru Takakusagi², Takashi Kamachi¹, Takashi Kamachi³, Ichigaku Takigawa², Ken-ichi Shimizu¹, Ken-ichi Shimizu² - Show less +5 more•Institutions (3)

Kyoto University¹, Hokkaido University², Fukuoka Institute of Technology³

07 Feb 2020-ACS Catalysis

TL;DR: The discovery and development of catalysts and catalytic processes are essential components to maintaining an ecological balance in the future as mentioned in this paper, and recent revolutions made in data science could have a...

...read moreread less

Abstract: The discovery and development of catalysts and catalytic processes are essential components to maintaining an ecological balance in the future. Recent revolutions made in data science could have a ...

...read moreread less

272 citations

Journal Article•DOI•

Drug discovery with explainable artificial intelligence

[...]

José Jiménez-Luna¹, Francesca Grisoni¹, Gisbert Schneider¹•Institutions (1)

ETH Zurich¹

01 Oct 2020-Nature Machine Intelligence

TL;DR: A review of the most prominent algorithmic concepts of explainable artificial intelligence, and forecasts future opportunities, potential applications as well as several remaining challenges is provided in this article. But, the review is limited to the use of deep learning for drug discovery.

...read moreread less

Abstract: Deep learning bears promise for drug discovery, including advanced image analysis, prediction of molecular structure and function, and automated generation of innovative chemical entities with bespoke properties. Despite the growing number of successful prospective applications, the underlying mathematical models often remain elusive to interpretation by the human mind. There is a demand for ‘explainable’ deep learning methods to address the need for a new narrative of the machine language of the molecular sciences. This Review summarizes the most prominent algorithmic concepts of explainable artificial intelligence, and forecasts future opportunities, potential applications as well as several remaining challenges. We also hope it encourages additional efforts towards the development and acceptance of explainable artificial intelligence techniques. Drug discovery has recently profited greatly from the use of deep learning models. However, these models can be notoriously hard to interpret. In this Review, Jimenez-Luna and colleagues summarize recent approaches to use explainable artificial intelligence techniques in drug discovery.

...read moreread less

270 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127

Collapse