Home
/
Authors
/
Robert Hoehndorf

Author

Robert Hoehndorf

King Abdullah University of Science and Technology

Other affiliations: Leipzig University, Max Planck Society, University of Cambridge ...read more

Bio: Robert Hoehndorf is an academic researcher from King Abdullah University of Science and Technology. The author has contributed to research in topics: Ontology (information science) & Open Biomedical Ontologies. The author has an hindex of 37, co-authored 212 publications receiving 4634 citations. Previous affiliations of Robert Hoehndorf include Leipzig University & Max Planck Society.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006

Papers

PDF

Open Access

More filters

Journal Article•DOI•

DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.

[...]

Maxat Kulmanov¹, Mohammed Asif Khan¹, Robert Hoehndorf¹•Institutions (1)

King Abdullah University of Science and Technology¹

15 Feb 2018-Bioinformatics

TL;DR: This work has developed a novel method to predict protein function from sequence that uses deep learning to learn features from protein sequences as well as a cross-species protein–protein interaction network.

...read moreread less

Abstract: Motivation A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional characterization of these proteins is time-consuming and expensive, and is often only done rigorously for few selected model organisms. Computational function prediction approaches have been suggested to fill this gap. The functions of proteins are classified using the Gene Ontology (GO), which contains over 40 000 classes. Additionally, proteins have multiple functions, making function prediction a large-scale, multi-class, multi-label problem. Results We have developed a novel method to predict protein function from sequence. We use deep learning to learn features from protein sequences as well as a cross-species protein-protein interaction network. Our approach specifically outputs information in the structure of the GO and utilizes the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and demonstrate a significant improvement over baseline methods such as BLAST, in particular for predicting cellular locations. Availability and implementation Web server: http://deepgo.bio2vec.net, Source code: https://github.com/bio-ontology-research-group/deepgo. Contact robert.hoehndorf@kaust.edu.sa. Supplementary information Supplementary data are available at Bioinformatics online.

...read moreread less

309 citations

Journal Article•DOI•

The role of ontologies in biological and biomedical research: a functional perspective

[...]

Robert Hoehndorf¹, Paul N. Schofield, Georgios V. Gkoutos•Institutions (1)

King Abdullah University of Science and Technology¹

01 Nov 2015-Briefings in Bioinformatics

TL;DR: A functional perspective on ontologies in biology and biomedicine is provided, focusing on what ontologies can do and describing how they can be used in support of integrative research.

...read moreread less

Abstract: Ontologies are widely used in biological and biomedical research. Their success lies in their combination of four main features present in almost all ontologies: provision of standard identifiers for classes and relations that represent the phenomena within a domain; provision of a vocabulary for a domain; provision of metadata that describes the intended meaning of the classes and relations in ontologies; and the provision of machine-readable axioms and definitions that enable computational access to some aspects of the meaning of classes and relations. While each of these features enables applications that facilitate data integration, data access and analysis, a great potential lies in the possibility of combining these four features to support integrative analysis and interpretation of multimodal data. Here, we provide a functional perspective on ontologies in biology and biomedicine, focusing on what ontologies can do and describing how they can be used in support of integrative research. We also outline perspectives for using ontologies in data-driven science, in particular their application in structured data mining and machine learning applications.

...read moreread less

240 citations

Journal Article•DOI•

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

[...]

Naihui Zhou¹, Yuxiang Jiang², Timothy Bergquist³, Alexandra J. Lee⁴ +185 more•Institutions (71)

19 Nov 2019-Genome Biology

TL;DR: The third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed, concluded that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not.

...read moreread less

Abstract: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

...read moreread less

227 citations

Journal Article•DOI•

PhenomeNET: a whole-phenome approach to disease gene discovery

[...]

Robert Hoehndorf¹, Paul N. Schofield¹, Georgios V. Gkoutos¹•Institutions (1)

University of Cambridge¹

01 Oct 2011-Nucleic Acids Research

TL;DR: It is demonstrated that PhenomeNET can identify orthologous genes, genes involved in the same pathway and gene–disease associations through the comparison of mutant phenotypes, and is applied to prioritize genes for rare and orphan diseases for which the molecular basis is unknown.

...read moreread less

Abstract: Phenotypes are investigated in model organisms to understand and reveal the molecular mechanisms underlying disease. Phenotype ontologies were developed to capture and compare phenotypes within the context of a single species. Recently, these ontologies were augmented with formal class definitions that may be utilized to integrate phenotypic data and enable the direct comparison of phenotypes between different species. We have developed a method to transform phenotype ontologies into a formal representation, combine phenotype ontologies with anatomy ontologies, and apply a measure of semantic similarity to construct the PhenomeNET cross-species phenotype network. We demonstrate that PhenomeNET can identify orthologous genes, genes involved in the same pathway and gene–disease associations through the comparison of mutant phenotypes. We provide evidence that the Adam19 and Fgf15 genes in mice are involved in the tetralogy of Fallot, and, using zebrafish phenotypes, propose the hypothesis that the mammalian homologs of Cx36.7 and Nkx2.5 lie in a pathway controlling cardiac morphogenesis and electrical conductivity which, when defective, cause the tetralogy of Fallot phenotype. Our method implements a whole-phenome approach toward disease gene discovery and can be applied to prioritize genes for rare and orphan diseases for which the molecular basis is unknown.

...read moreread less

221 citations

Journal Article•DOI•

Text-mining solutions for biomedical research: enabling integrative biology.

[...]

Dietrich Rebholz-Schuhmann¹, Anika Oellrich¹, Robert Hoehndorf²•Institutions (2)

European Bioinformatics Institute¹, University of Cambridge²

01 Dec 2012-Nature Reviews Genetics

TL;DR: The latest advancements in automated literature analysis are explored and its contribution to innovative research approaches are explored.

...read moreread less

Abstract: In response to the unbridled growth of information in literature and biomedical databases, researchers require efficient means of handling and extracting information. As well as providing background information for research, scientific publications can be processed to transform textual information into database content or complex networks and can be integrated with existing knowledge resources to suggest novel hypotheses. Information extraction and text data analysis can be particularly relevant and helpful in genetics and biomedical research, in which up-to-date information about complex processes involving genes, proteins and phenotypes is crucial. Here we explore the latest advancements in automated literature analysis and its contribution to innovative research approaches.

...read moreread less

215 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Harrison's Principles of Internal Medicine

[...]

JudyAnn Bigby

01 Feb 1988-Archives of Dermatology

TL;DR: The 11th edition of Harrison's Principles of Internal Medicine welcomes Anthony Fauci to its editorial staff, in addition to more than 85 new contributors.

...read moreread less

Abstract: The 11th edition of Harrison's Principles of Internal Medicine welcomes Anthony Fauci to its editorial staff, in addition to more than 85 new contributors. While the organization of the book is similar to previous editions, major emphasis has been placed on disorders that affect multiple organ systems. Important advances in genetics, immunology, and oncology are emphasized. Many chapters of the book have been rewritten and describe major advances in internal medicine. Subjects that received only a paragraph or two of attention in previous editions are now covered in entire chapters. Among the chapters that have been extensively revised are the chapters on infections in the compromised host, on skin rashes in infections, on many of the viral infections, including cytomegalovirus and Epstein-Barr virus, on sexually transmitted diseases, on diabetes mellitus, on disorders of bone and mineral metabolism, and on lymphadenopathy and splenomegaly. The major revisions in these chapters and many

...read moreread less

6,968 citations

Journal Article•DOI•

DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants

[...]

Janet Piñero¹, Àlex Bravo¹, Núria Queralt-Rosinach¹, Alba Gutiérrez-Sacristán¹, Jordi Deu-Pons¹, Emilio Centeno¹, Javier Garcia-Garcia¹, Ferran Sanz¹, Laura I. Furlong¹ - Show less +5 more•Institutions (1)

Pompeu Fabra University¹

04 Jan 2017-Nucleic Acids Research

TL;DR: DisGeNET is a versatile platform that can be used for different research purposes including the investigation of the molecular underpinnings of specific human diseases and their comorbidities, the analysis of the properties of disease genes, the generation of hypothesis on drug therapeutic action and drug adverse effects, the validation of computationally predicted disease genes and the evaluation of text-mining methods performance.

...read moreread less

Abstract: The information about the genetic basis of human diseases lies at the heart of precision medicine and drug discovery. However, to realize its full potential to support these goals, several problems, such as fragmentation, heterogeneity, availability and different conceptualization of the data must be overcome. To provide the community with a resource free of these hurdles, we have developed DisGeNET (http://www.disgenet.org), one of the largest available collections of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype-phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, scripts in several programming languages and an R package. DisGeNET is a versatile platform that can be used for different research purposes including the investigation of the molecular underpinnings of specific human diseases and their comorbidities, the analysis of the properties of disease genes, the generation of hypothesis on drug therapeutic action and drug adverse effects, the validation of computationally predicted disease genes and the evaluation of text-mining methods performance.

...read moreread less

1,718 citations

[서평]「The Unified Modeling Language User Guide」

[...]

강문설

01 Dec 1999

1,636 citations

Journal Article•DOI•

The ChEMBL database in 2017.

[...]

Anna Gaulton¹, Anne Hersey¹, Michal Nowotka¹, A. Patrícia Bento¹, Jon Chambers¹, David Mendez¹, Prudence Mutowo¹, Francis Atkinson¹, Louisa J. Bellis¹, Elena Cibrian-Uhalte¹, Mark Davies¹, Nathan Dedman¹, Anneli Karlsson¹, María Paula Magariños¹, John P. Overington¹, George Papadatos¹, Ines Smit¹, Andrew R. Leach¹ - Show less +14 more•Institutions (1)

European Bioinformatics Institute¹

04 Jan 2017-Nucleic Acids Research

TL;DR: ChEMBL is an open large-scale bioactivity database that includes the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts.

...read moreread less

Abstract: ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 and 2014 Nucleic Acids Research Database Issues. Since then, alongside the continued extraction of data from the medicinal chemistry literature, new sources of bioactivity data have also been added to the database. These include: deposited data sets from neglected disease screening; crop protection data; drug metabolism and disposition data and bioactivity data from patents. A number of improvements and new features have also been incorporated. These include the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts. The ChEMBL data can be accessed via a web-interface, RDF distribution, data downloads and RESTful web-services.

...read moreread less

1,601 citations

Journal Article•DOI•

The ChEMBL bioactivity database: an update

[...]

A. Patrícia Bento¹, Anna Gaulton¹, Anne Hersey¹, Louisa J. Bellis¹, Jon Chambers¹, Mark Davies¹, Felix A. Kruger¹, Yvonne Light¹, Lora Mak¹, Shaun McGlinchey¹, Michal Nowotka¹, George Papadatos¹, Rita Santos¹, John P. Overington¹ - Show less +10 more•Institutions (1)

European Bioinformatics Institute¹

01 Jan 2014-Nucleic Acids Research

TL;DR: More comprehensive tracking of compounds from research stages through clinical development to market is provided through the inclusion of data from United States Adopted Name applications and a new richer data model for representing drug targets has been developed.

...read moreread less

Abstract: ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 Nucleic Acids Research Database Issue. Since then, a variety of new data sources and improvements in functionality have contributed to the growth and utility of the resource. In particular, more comprehensive tracking of compounds from research stages through clinical development to market is provided through the inclusion of data from United States Adopted Name applications; a new richer data model for representing drug targets has been developed; and a number of methods have been put in place to allow users to more easily identify reliable data. Finally, access to ChEMBL is now available via a new Resource Description Framework format, in addition to the web-based interface, data downloads and web services.

...read moreread less

1,302 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse