Home
/
Authors
/
Christine A. Orengo

Author

Christine A. Orengo

Other affiliations: European Bioinformatics Institute, National Institute for Medical Research, Birkbeck, University of London

Bio: Christine A. Orengo is an academic researcher from University College London. The author has contributed to research in topics: Structural genomics & Protein domain. The author has an hindex of 78, co-authored 271 publications receiving 28200 citations. Previous affiliations of Christine A. Orengo include European Bioinformatics Institute & National Institute for Medical Research.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1994
1993
1992
1990
1989
1981

Papers

PDF

Open Access

More filters

Journal Article•DOI•

CATH – a hierarchic classification of protein domain structures

[...]

Christine A. Orengo¹, A.D. Michie¹, Susan Jones¹, David T. Jones², Mark B. Swindells, Janet M. Thornton¹ - Show less +2 more•Institutions (2)

University College London¹, University of Warwick²

15 Aug 1997-Structure

TL;DR: Analysis of the structural families generated by CATH reveals the prominent features of protein structure space and a database of well-characterised protein structure families will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.

...read moreread less

2,551 citations

Journal Article•DOI•

InterPro: the integrative protein signature database

[...]

Sarah Hunter¹, Rolf Apweiler, Teresa K. Attwood, Amos Marc Bairoch, Alex Bateman, David Binns, Peer Bork, Ujjwal Das, Louise C. Daugherty, Lauranne Duquenne, Robert D. Finn, Julian Gough, Daniel H. Haft, Nicolas Hulo, Daniel Kahn, Elizabeth Kelly, Aurélie Laugraud, Ivica Letunic, David M. Lonsdale, Rodrigo Lopez, Martin Madera, John Maslen, Craig McAnulla, Jennifer McDowall, Jaina Mistry, Alex L. Mitchell, Nicola Mulder, Darren A. Natale, Christine A. Orengo, Antony F. Quinn, Jeremy D. Selengut, Christian J. A. Sigrist, Manjula Thimma, Paul Thomas, Franck Valentin, Derek Wilson, Cathy H. Wu, Corin Yeats - Show less +34 more•Institutions (1)

European Bioinformatics Institute¹

01 Jan 2009-Nucleic Acids Research

TL;DR: The InterPro database integrates together predictive models or ‘signatures’ representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs.

...read moreread less

Abstract: The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or 'signatures' representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Integration is performed manually and approximately half of the total approximately 58,000 signatures available in the source databases belong to an InterPro entry. Recently, we have started to also display the remaining un-integrated signatures via our web interface. Other developments include the provision of non-signature data, such as structural data, in new XML files on our FTP site, as well as the inclusion of matchless UniProtKB proteins in the existing match XML files. The web interface has been extended and now links out to the ADAN predicted protein-protein interaction database and the SPICE and Dasty viewers. The latest public release (v18.0) covers 79.8% of UniProtKB (v14.1) and consists of 16 549 entries. InterPro data may be accessed either via the web address above, via web services, by downloading files by anonymous FTP or by using the InterProScan search software (http://www.ebi.ac.uk/Tools/InterProScan/).

...read moreread less

1,834 citations

Journal Article•DOI•

InterPro in 2017-beyond protein family and domain annotations

[...]

Robert D. Finn¹, Teresa K. Attwood², Patricia C. Babbitt³, Alex Bateman¹, Peer Bork, Alan Bridge⁴, Hsin-Yu Chang¹, Zsuzsanna Dosztányi⁵, Sara El-Gebali¹, Matthew Fraser¹, Julian Gough⁶, David R. Haft⁷, Gemma L. Holliday³, Hongzhan Huang⁸, Xiaosong Huang⁹, Ivica Letunic, Rodrigo Lopez¹, Shennan Lu¹⁰, Aron Marchler-Bauer¹⁰, Huaiyu Mi⁹, Jaina Mistry¹, Darren A. Natale¹¹, Marco Necci¹², Gift Nuka¹, Christine A. Orengo¹³, Youngmi Park¹, Sebastien Pesseat¹, Damiano Piovesan¹², Simon C. Potter¹, Neil D. Rawlings¹, Nicole Redaschi⁴, Lorna Richardson¹, Catherine Rivoire⁴, Amaia Sangrador-Vegas¹, Christian J. A. Sigrist⁴, Ian Sillitoe¹³, Ben Smithers⁶, Silvano Squizzato¹, Granger G. Sutton⁷, Narmada Thanki¹⁰, Paul Thomas⁹, Silvio C. E. Tosatto¹², Cathy H. Wu⁸, Ioannis Xenarios⁴, Lai-Su L. Yeh¹¹, Siew Yit Young¹, Alex L. Mitchell¹ - Show less +43 more•Institutions (13)

European Bioinformatics Institute¹, University of Manchester², University of California, San Francisco³, Swiss Institute of Bioinformatics⁴, Eötvös Loránd University⁵, University of Bristol⁶, J. Craig Venter Institute⁷, University of Delaware⁸, University of Southern California⁹, National Institutes of Health¹⁰, Georgetown University Medical Center¹¹, University of Padua¹², University College London¹³

04 Jan 2017-Nucleic Acids Research

TL;DR: Recent developments with InterPro are reported, including the addition of two new databases, and the functionality to include residue-level annotation and prediction of intrinsic disorder, which enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences.

...read moreread less

Abstract: InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences.

...read moreread less

1,246 citations

Journal Article•DOI•

The InterPro protein families database: the classification resource after 15 years

[...]

Alex L. Mitchell¹, Hsin-Yu Chang¹, Louise C. Daugherty¹, Matthew Fraser¹, Sarah Hunter¹, Rodrigo Lopez¹, Craig McAnulla¹, Conor McMenamin¹, Gift Nuka¹, Sebastien Pesseat¹, Amaia Sangrador-Vegas¹, Maxim Scheremetjew¹, Claudia Rato¹, Siew-Yit Yong¹, Alex Bateman¹, Marco Punta¹, Teresa K. Attwood², Christian J. A. Sigrist³, Nicole Redaschi³, Catherine Rivoire³, Ioannis Xenarios³, Daniel Kahn, Dominique Guyot, Peer Bork¹, Ivica Letunic¹, Julian Gough⁴, Matt E. Oates⁴, Daniel H. Haft⁵, Hongzhan Huang⁶, Darren A. Natale⁶, Cathy H. Wu⁶, Christine A. Orengo⁷, Ian Sillitoe⁷, Huaiyu Mi⁸, Paul Thomas⁸, Robert D. Finn¹ - Show less +32 more•Institutions (8)

European Bioinformatics Institute¹, University of Manchester², Swiss Institute of Bioinformatics³, University of Bristol⁴, J. Craig Venter Institute⁵, Georgetown University Medical Center⁶, University College London⁷, University of Southern California⁸

28 Jan 2015-Nucleic Acids Research

TL;DR: The new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined, and the challenges faced by the resource given the explosive growth in sequence data in recent years are discussed.

...read moreread less

Abstract: The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36 766 member database signatures integrated into 26 238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012.

...read moreread less

1,189 citations

Journal Article•DOI•

InterPro in 2019: improving coverage, classification and access to protein sequence annotations.

[...]

Alex L. Mitchell¹, Teresa K. Attwood², Patricia C. Babbitt³, Matthias Blum¹, Peer Bork, Alan Bridge⁴, Shoshana D. Brown³, Hsin-Yu Chang¹, Sara El-Gebali¹, Matthew Fraser¹, Julian Gough⁵, David R. Haft⁶, Hongzhan Huang⁷, Ivica Letunic, Rodrigo Lopez¹, Aurelien Luciani¹, Fábio Madeira¹, Aron Marchler-Bauer⁸, Huaiyu Mi⁹, Darren A. Natale¹⁰, Marco Necci¹¹, Marco Necci¹², Gift Nuka¹, Christine A. Orengo¹³, Arun Prasad Pandurangan⁵, Typhaine Paysan-Lafosse¹, Sebastien Pesseat¹, Simon C. Potter¹, Matloob Qureshi¹, Neil D. Rawlings¹, Nicole Redaschi⁴, Lorna Richardson¹, Catherine Rivoire⁴, Gustavo A. Salazar¹, Amaia Sangrador-Vegas¹, Christian J. A. Sigrist⁴, Ian Sillitoe¹³, Granger G. Sutton⁶, Narmada Thanki⁸, Paul Thomas⁹, Silvio C. E. Tosatto¹¹, Siew-Yit Yong¹, Robert D. Finn¹ - Show less +39 more•Institutions (13)

European Bioinformatics Institute¹, University of Manchester², University of California, San Francisco³, Swiss Institute of Bioinformatics⁴, Laboratory of Molecular Biology⁵, J. Craig Venter Institute⁶, University of Delaware⁷, National Institutes of Health⁸, University of Southern California⁹, Georgetown University Medical Center¹⁰, University of Padua¹¹, University of Udine¹², University College London¹³

08 Jan 2019-Nucleic Acids Research

TL;DR: Recent developments with InterPro (version 70.0) and its associated software are reported, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website.

...read moreread less

Abstract: The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities.

...read moreread less

1,167 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Protein Data Bank

[...]

Helen M. Berman¹, John D. Westbrook, Zukang Feng, Gary L. Gilliland, Talapady N. Bhat, Helge Weissig, Ilya N. Shindyalov, Philip E. Bourne - Show less +4 more•Institutions (1)

Rutgers University¹

01 Jan 2000-Nucleic Acids Research

TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.

...read moreread less

Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

...read moreread less

34,239 citations

Journal Article•DOI•

The Pfam protein families database

[...]

Marco Punta¹, Penny Coggill¹, Ruth Y. Eberhardt¹, Jaina Mistry¹, John Tate¹, Chris Boursnell¹, Ningze Pang¹, Kristoffer Forslund¹, Goran Ceric¹, Jody Clements¹, Andreas Heger¹, Liisa Holm¹, Erik L. L. Sonnhammer¹, Sean R. Eddy¹, Alex Bateman¹, Robert D. Finn¹ - Show less +12 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jan 2000-Nucleic Acids Research

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.

...read moreread less

Abstract: Pfam is a widely used database of protein families and domains. This article describes a set of major updates that we have implemented in the latest release (version 24.0). The most important change is that we now use HMMER3, the latest version of the popular profile hidden Markov model package. This software is approximately 100 times faster than HMMER2 and is more sensitive due to the routine use of the forward algorithm. The move to HMMER3 has necessitated numerous changes to Pfam that are described in detail. Pfam release 24.0 contains 11,912 families, of which a large number have been significantly updated during the past two years. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/).

...read moreread less

14,075 citations

Journal Article•DOI•

Pfam: the protein families database.

[...]

Robert D. Finn¹, Alex Bateman², Jody Clements¹, Penelope Coggill², Ruth Y. Eberhardt², Sean R. Eddy¹, Andreas Heger, Kirstie Hetherington³, Liisa Holm, Jaina Mistry², Erik L. L. Sonnhammer⁴, John Tate², Marco Punta² - Show less +9 more•Institutions (4)

Howard Hughes Medical Institute¹, European Bioinformatics Institute², Wellcome Trust Sanger Institute³, Stockholm University⁴

01 Jan 2014-Nucleic Acids Research

TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.

...read moreread less

Abstract: Pfam, available via servers in the UK (http://pfam.sanger.ac.uk/) and the USA (http://pfam.janelia.org/), is a widely used database of protein families, containing 14 831 manually curated entries in the current release, version 27.0. Since the last update article 2 years ago, we have generated 1182 new families and maintained sequence coverage of the UniProt Knowledgebase (UniProtKB) at nearly 80%, despite a 50% increase in the size of the underlying sequence database. Since our 2012 article describing Pfam, we have also undertaken a comprehensive review of the features that are provided by Pfam over and above the basic family data. For each feature, we determined the relevance, computational burden, usage statistics and the functionality of the feature in a website context. As a consequence of this review, we have removed some features, enhanced others and developed new ones to meet the changing demands of computational biology. Here, we describe the changes to Pfam content. Notably, we now provide family alignments based on four different representative proteome sequence data sets and a new interactive DNA search interface. We also discuss the mapping between Pfam and known 3D structures.

...read moreread less

9,415 citations

Journal Article•DOI•

STRING v10: protein–protein interaction networks, integrated over the tree of life

[...]

Damian Szklarczyk¹, Andrea Franceschini¹, Stefan Wyder¹, Kristoffer Forslund, Davide Heller¹, Jaime Huerta-Cepas, Milan Simonovic¹, Alexander Roth¹, Alberto Santos², Kalliopi Tsafou², Michael Kuhn³, Peer Bork, Lars Juhl Jensen², Christian von Mering¹ - Show less +10 more•Institutions (3)

Swiss Institute of Bioinformatics¹, University of Copenhagen², Dresden University of Technology³

28 Jan 2015-Nucleic Acids Research

TL;DR: H hierarchical and self-consistent orthology annotations are introduced for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution in the STRING database.

...read moreread less

Abstract: The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein-protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein-protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.

...read moreread less

8,224 citations

Journal Article•DOI•

J. Appl. Cryst.の発刊に際して

[...]

良二上田

10 Mar 1970

8,159 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse