Home
/
Authors
/
Jorge Aguilera-Iparraguirre

Author

Jorge Aguilera-Iparraguirre

Other affiliations: Ruhr University Bochum, Karlsruhe Institute of Technology, Massachusetts Institute of Technology

Bio: Jorge Aguilera-Iparraguirre is an academic researcher from Harvard University. The author has contributed to research in topics: OLED & Encoder. The author has an hindex of 16, co-authored 22 publications receiving 5041 citations. Previous affiliations of Jorge Aguilera-Iparraguirre include Ruhr University Bochum & Karlsruhe Institute of Technology.

Topics: OLED, Encoder, Radical, Feature extraction, Convolutional neural network ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules

[...]

Rafael Gómez-Bombarelli, Jennifer N. Wei¹, David Duvenaud², José Miguel Hernández-Lobato³, Benjamin Sanchez-Lengeling¹, Dennis Sheberla¹, Jorge Aguilera-Iparraguirre, Timothy D. Hirzel, Ryan P. Adams⁴, Alán Aspuru-Guzik¹, Alán Aspuru-Guzik⁵ - Show less +7 more•Institutions (5)

Harvard University¹, University of Toronto², University of Cambridge³, Google⁴, Canadian Institute for Advanced Research⁵

12 Jan 2018-ACS central science

TL;DR: In this article, a deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an encoder, a decoder, and a predictor, which can generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds.

...read moreread less

Abstract: We report a method to convert discrete representations of molecules to and from a multidimensional continuous representation. This model allows us to generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds. A deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an encoder, a decoder, and a predictor. The encoder converts the discrete representation of a molecule into a real-valued continuous vector, and the decoder converts these continuous vectors back to discrete molecular representations. The predictor estimates chemical properties from the latent continuous vector representation of the molecule. Continuous representations of molecules allow us to automatically generate novel chemical structures by performing simple operations in the latent space, such as decoding random vectors, perturbing known chemical structures, or interpolating between molecules. Continuous represent...

...read moreread less

1,884 citations

Proceedings Article•

Convolutional networks on graphs for learning molecular fingerprints

[...]

David Duvenaud¹, Dougal Maclaurin¹, Jorge Aguilera-Iparraguirre¹, Rafael Gómez-Bombarelli¹, Timothy D. Hirzel¹, Alán Aspuru-Guzik¹, Ryan P. Adams¹ - Show less +3 more•Institutions (1)

Harvard University¹

07 Dec 2015

TL;DR: In this paper, a convolutional neural network that operates directly on graphs is proposed to learn end-to-end learning of prediction pipelines whose inputs are graphs of arbitrary size and shape.

...read moreread less

Abstract: We introduce a convolutional neural network that operates directly on graphs. These networks allow end-to-end learning of prediction pipelines whose inputs are graphs of arbitrary size and shape. The architecture we present generalizes standard molecular feature extraction methods based on circular fingerprints. We show that these data-driven features are more interpretable, and have better predictive performance on a variety of tasks.

...read moreread less

1,857 citations

Journal Article•DOI•

Automatic chemical design using a data-driven continuous representation of molecules

[...]

Rafael Gómez-Bombarelli, Jennifer N. Wei¹, David Duvenaud², José Miguel Hernández-Lobato³, Benjamin Sanchez-Lengeling¹, Dennis Sheberla¹, Jorge Aguilera-Iparraguirre, Timothy D. Hirzel, Ryan P. Adams⁴, Alán Aspuru-Guzik⁵, Alán Aspuru-Guzik¹ - Show less +7 more•Institutions (5)

Harvard University¹, University of Toronto², University of Cambridge³, Google⁴, Canadian Institute for Advanced Research⁵

07 Oct 2016-arXiv: Learning

TL;DR: A method to convert discrete representations of molecules to and from a multidimensional continuous representation that allows us to generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds is reported.

...read moreread less

Abstract: We report a method to convert discrete representations of molecules to and from a multidimensional continuous representation. This model allows us to generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds. A deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an encoder, a decoder and a predictor. The encoder converts the discrete representation of a molecule into a real-valued continuous vector, and the decoder converts these continuous vectors back to discrete molecular representations. The predictor estimates chemical properties from the latent continuous vector representation of the molecule. Continuous representations allow us to automatically generate novel chemical structures by performing simple operations in the latent space, such as decoding random vectors, perturbing known chemical structures, or interpolating between molecules. Continuous representations also allow the use of powerful gradient-based optimization to efficiently guide the search for optimized functional compounds. We demonstrate our method in the domain of drug-like molecules and also in the set of molecules with fewer that nine heavy atoms.

...read moreread less

1,462 citations

Journal Article•DOI•

Design of efficient molecular organic light-emitting diodes by a high-throughput virtual screening and experimental approach

[...]

Rafael Gómez-Bombarelli¹, Jorge Aguilera-Iparraguirre¹, Timothy D. Hirzel¹, David Duvenaud¹, Dougal Maclaurin¹, Martin A. Blood-Forsythe¹, Hyun Sik Chae², Markus Einzinger³, Dong-Gwang Ha³, Tony C. Wu³, Georgios Markopoulos³, Soon Ok Jeon², Ho-Suk Kang², Hiroshi Miyazaki², Numata Masaki², Sunghan Kim², Wenliang Huang³, Seongik Hong², Marc A. Baldo³, Ryan P. Adams¹, Alán Aspuru-Guzik¹ - Show less +17 more•Institutions (3)

Harvard University¹, Samsung², Massachusetts Institute of Technology³

01 Oct 2016-Nature Materials

TL;DR: An integrated organic functional material design process that incorporates theoretical insight, quantum chemistry, cheminformatics, machine learning, industrial expertise, organic synthesis, molecular characterization, device fabrication and optoelectronic testing is reported.

...read moreread less

Abstract: A high-throughput virtual screening approach is used to select molecules with efficient, thermally activated delayed fluorescence. The good performance of several selected emitters in organic LED applications has also been confirmed experimentally.

...read moreread less

711 citations

Posted Content•

Convolutional Networks on Graphs for Learning Molecular Fingerprints

[...]

David Duvenaud¹, Dougal Maclaurin¹, Jorge Aguilera-Iparraguirre¹, Rafael Gómez-Bombarelli¹, Timothy D. Hirzel¹, Alán Aspuru-Guzik¹, Ryan P. Adams¹ - Show less +3 more•Institutions (1)

Harvard University¹

30 Sep 2015-arXiv: Learning

TL;DR: A convolutional neural network that operates directly on graphs that allows end-to-end learning of prediction pipelines whose inputs are graphs of arbitrary size and shape is introduced.

...read moreread less

369 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A Comprehensive Survey on Graph Neural Networks

[...]

Zonghan Wu¹, Shirui Pan², Fengwen Chen¹, Guodong Long¹, Chengqi Zhang¹, Philip S. Yu³ - Show less +2 more•Institutions (3)

University of Technology, Sydney¹, Monash University, Clayton campus², University of Illinois at Chicago³

01 Jan 2021-IEEE Transactions on Neural Networks

TL;DR: This article provides a comprehensive overview of graph neural networks (GNNs) in data mining and machine learning fields and proposes a new taxonomy to divide the state-of-the-art GNNs into four categories, namely, recurrent GNNS, convolutional GNN’s, graph autoencoders, and spatial–temporal Gnns.

...read moreread less

Abstract: Deep learning has revolutionized many machine learning tasks in recent years, ranging from image classification and video processing to speech recognition and natural language understanding. The data in these tasks are typically represented in the Euclidean space. However, there is an increasing number of applications, where data are generated from non-Euclidean domains and are represented as graphs with complex relationships and interdependency between objects. The complexity of graph data has imposed significant challenges on the existing machine learning algorithms. Recently, many studies on extending deep learning approaches for graph data have emerged. In this article, we provide a comprehensive overview of graph neural networks (GNNs) in data mining and machine learning fields. We propose a new taxonomy to divide the state-of-the-art GNNs into four categories, namely, recurrent GNNs, convolutional GNNs, graph autoencoders, and spatial–temporal GNNs. We further discuss the applications of GNNs across various domains and summarize the open-source codes, benchmark data sets, and model evaluation of GNNs. Finally, we propose potential research directions in this rapidly growing field.

...read moreread less

4,584 citations

Book Chapter•DOI•

Modeling Relational Data with Graph Convolutional Networks

[...]

Michael Sejr Schlichtkrull¹, Thomas Kipf¹, Peter Bloem², Rianne van den Berg¹, Ivan Titov¹, Ivan Titov³, Max Welling¹, Max Welling⁴ - Show less +4 more•Institutions (4)

University of Amsterdam¹, VU University Amsterdam², University of Edinburgh³, Canadian Institute for Advanced Research⁴

03 Jun 2018

TL;DR: It is shown that factorization models for link prediction such as DistMult can be significantly improved through the use of an R-GCN encoder model to accumulate evidence over multiple inference steps in the graph, demonstrating a large improvement of 29.8% on FB15k-237 over a decoder-only baseline.

...read moreread less

Abstract: Knowledge graphs enable a wide variety of applications, including question answering and information retrieval. Despite the great effort invested in their creation and maintenance, even the largest (e.g., Yago, DBPedia or Wikidata) remain incomplete. We introduce Relational Graph Convolutional Networks (R-GCNs) and apply them to two standard knowledge base completion tasks: Link prediction (recovery of missing facts, i.e. subject-predicate-object triples) and entity classification (recovery of missing entity attributes). R-GCNs are related to a recent class of neural networks operating on graphs, and are developed specifically to handle the highly multi-relational data characteristic of realistic knowledge bases. We demonstrate the effectiveness of R-GCNs as a stand-alone model for entity classification. We further show that factorization models for link prediction such as DistMult can be significantly improved through the use of an R-GCN encoder model to accumulate evidence over multiple inference steps in the graph, demonstrating a large improvement of 29.8% on FB15k-237 over a decoder-only baseline.

...read moreread less

3,168 citations

Proceedings Article•DOI•

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

[...]

Rex Ying¹, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L. Hamilton¹, Jure Leskovec¹ - Show less +2 more•Institutions (1)

Stanford University¹

19 Jul 2018

TL;DR: A novel method based on highly efficient random walks to structure the convolutions and a novel training strategy that relies on harder-and-harder training examples to improve robustness and convergence of the model are developed.

...read moreread less

Abstract: Recent advancements in deep neural networks for graph-structured data have led to state-of-the-art performance on recommender system benchmarks. However, making these methods practical and scalable to web-scale recommendation tasks with billions of items and hundreds of millions of users remains an unsolved challenge. Here we describe a large-scale deep recommendation engine that we developed and deployed at Pinterest. We develop a data-efficient Graph Convolutional Network (GCN) algorithm, which combines efficient random walks and graph convolutions to generate embeddings of nodes (i.e., items) that incorporate both graph structure as well as node feature information. Compared to prior GCN approaches, we develop a novel method based on highly efficient random walks to structure the convolutions and design a novel training strategy that relies on harder-and-harder training examples to improve robustness and convergence of the model. We also develop an efficient MapReduce model inference algorithm to generate embeddings using a trained model. Overall, we can train on and embed graphs that are four orders of magnitude larger than typical GCN implementations. We show how GCN embeddings can be used to make high-quality recommendations in various settings at Pinterest, which has a massive underlying graph with 3 billion nodes representing pins and boards, and 17 billion edges. According to offline metrics, user studies, as well as A/B tests, our approach generates higher-quality recommendations than comparable deep learning based systems. To our knowledge, this is by far the largest application of deep graph embeddings to date and paves the way for a new generation of web-scale recommender systems based on graph convolutional architectures.

...read moreread less

2,647 citations

Journal Article•DOI•

Geometric Deep Learning: Going beyond Euclidean data

[...]

Michael M. Bronstein¹, Joan Bruna, Yann LeCun², Arthur Szlam³, Pierre Vandergheynst⁴ - Show less +1 more•Institutions (4)

University of Lugano¹, New York University², Facebook³, École Polytechnique Fédérale de Lausanne⁴

11 Jul 2017-IEEE Signal Processing Magazine

TL;DR: In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions) and are natural targets for machine-learning techniques as mentioned in this paper.

...read moreread less

Abstract: Many scientific fields study data with an underlying structure that is non-Euclidean. Some examples include social networks in computational social sciences, sensor networks in communications, functional networks in brain imaging, regulatory networks in genetics, and meshed surfaces in computer graphics. In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions) and are natural targets for machine-learning techniques. In particular, we would like to use deep neural networks, which have recently proven to be powerful tools for a broad range of problems from computer vision, natural-language processing, and audio analysis. However, these tools have been most successful on data with an underlying Euclidean or grid-like structure and in cases where the invariances of these structures are built into networks used to model them.

...read moreread less

2,565 citations

Posted Content•

Graph Neural Networks: A Review of Methods and Applications

[...]

Jie Zhou¹, Ganqu Cui¹, Shengding Hu¹, Zhengyan Zhang¹, Cheng Yang², Zhiyuan Liu¹, Lifeng Wang³, Changcheng Li³, Maosong Sun¹ - Show less +5 more•Institutions (3)

Tsinghua University¹, Beijing University of Posts and Telecommunications², Tencent³

20 Dec 2018-arXiv: Learning

TL;DR: A detailed review over existing graph neural network models is provided, systematically categorize the applications, and four open problems for future research are proposed.

...read moreread less

Abstract: Lots of learning tasks require dealing with graph data which contains rich relation information among elements. Modeling physics systems, learning molecular fingerprints, predicting protein interface, and classifying diseases demand a model to learn from graph inputs. In other domains such as learning from non-structural data like texts and images, reasoning on extracted structures (like the dependency trees of sentences and the scene graphs of images) is an important research topic which also needs graph reasoning models. Graph neural networks (GNNs) are neural models that capture the dependence of graphs via message passing between the nodes of graphs. In recent years, variants of GNNs such as graph convolutional network (GCN), graph attention network (GAT), graph recurrent network (GRN) have demonstrated ground-breaking performances on many deep learning tasks. In this survey, we propose a general design pipeline for GNN models and discuss the variants of each component, systematically categorize the applications, and propose four open problems for future research.

...read moreread less

2,494 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse