Home
/
Authors
/
Daniel Barrell

Author

Daniel Barrell

Other affiliations: Wellcome Trust Sanger Institute, Wellcome Trust

Bio: Daniel Barrell is an academic researcher from European Bioinformatics Institute. The author has contributed to research in topics: Annotation & UniProt. The author has an hindex of 25, co-authored 29 publications receiving 18261 citations. Previous affiliations of Daniel Barrell include Wellcome Trust Sanger Institute & Wellcome Trust.

Topics: Annotation, UniProt, Open Biomedical Ontologies, Genome, Ensembl ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

GENCODE: The reference human genome annotation for The ENCODE Project

[...]

Jennifer Harrow¹, Adam Frankish¹, José M. González¹, Electra Tapanari¹, Mark Diekhans², Felix Kokocinski¹, Bronwen Aken¹, Daniel Barrell¹, Amonida Zadissa¹, Stephen M. J. Searle¹, If H. A. Barnes¹, Alexandra Bignell¹, Veronika Boychenko¹, Toby Hunt¹, M. Kay¹, Gaurab Mukherjee¹, Jeena Rajan¹, Gloria Despacio-Reyes¹, Gary Saunders¹, Charles A. Steward¹, Rachel A. Harte², Michael F. Lin³, Cédric Howald⁴, Andrea Tanzer, Thomas Derrien⁴, Jacqueline Chrast⁴, Nathalie Walters⁴, Suganthi Balasubramanian⁵, Baikang Pei⁵, Michael L. Tress, Jose Manuel Rodriguez, Iakes Ezkurdia, Jeltje Van Baren, Michael R. Brent, David Haussler², Manolis Kellis³, Alfonso Valencia, Alexandre Reymond⁴, Mark Gerstein⁵, Roderic Guigó, Tim Hubbard¹ - Show less +37 more•Institutions (5)

Wellcome Trust Sanger Institute¹, University of California, Santa Cruz², Massachusetts Institute of Technology³, University of Lausanne⁴, Yale University⁵

01 Sep 2012-Genome Research

TL;DR: This work has examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites, and over one-third of GENCODE protein-Coding genes aresupported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas.

...read moreread less

Abstract: The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

...read moreread less

4,281 citations

Journal Article•DOI•

The Gene Ontology (GO) database and informatics resource.

[...]

Midori A. Harris, Jennifer I. Clark¹, Ireland A¹, Jane Lomax¹, Michael Ashburner², Michael Ashburner¹, R. Foulger¹, R. Foulger², Karen Eilbeck³, Karen Eilbeck¹, Suzanna E. Lewis¹, Suzanna E. Lewis³, B. Marshall¹, B. Marshall³, Christopher J. Mungall¹, Christopher J. Mungall³, J. Richter³, J. Richter¹, Gerald M. Rubin³, Gerald M. Rubin¹, Judith A. Blake¹, Carol J. Bult¹, Dolan M¹, Drabkin H¹, Janan T. Eppig¹, Hill Dp¹, L. Ni¹, Ringwald M¹, Rama Balakrishnan¹, Rama Balakrishnan⁴, J. M. Cherry¹, J. M. Cherry⁴, Karen R. Christie¹, Karen R. Christie⁴, Maria C. Costanzo¹, Maria C. Costanzo⁴, Selina S. Dwight¹, Selina S. Dwight⁴, Stacia R. Engel⁴, Stacia R. Engel¹, Dianna G. Fisk¹, Dianna G. Fisk⁴, Jodi E. Hirschman⁴, Jodi E. Hirschman¹, Eurie L. Hong⁴, Eurie L. Hong¹, Robert S. Nash¹, Robert S. Nash⁴, Anand Sethuraman¹, Anand Sethuraman⁴, Chandra L. Theesfeld⁴, Chandra L. Theesfeld¹, David Botstein¹, David Botstein⁵, Kara Dolinski⁵, Kara Dolinski¹, Becket Feierbach¹, Becket Feierbach⁵, Tanya Z. Berardini¹, Tanya Z. Berardini⁶, S. Mundodi⁶, S. Mundodi¹, Seung Y. Rhee¹, Seung Y. Rhee⁶, Rolf Apweiler¹, Daniel Barrell¹, Camon E¹, E. Dimmer¹, Lee¹, Rex L. Chisholm, Pascale Gaudet¹, Pascale Gaudet⁷, Warren A. Kibbe⁷, Warren A. Kibbe¹, Ranjana Kishore¹, Ranjana Kishore⁸, Erich M. Schwarz¹, Erich M. Schwarz⁸, Paul W. Sternberg⁸, Paul W. Sternberg¹, M. Gwinn¹, Hannick L¹, Wortman J¹, Matthew Berriman⁹, Matthew Berriman¹, Wood⁹, Wood¹, de la Cruz N¹, de la Cruz N¹⁰, Peter J. Tonellato¹⁰, Peter J. Tonellato¹, Pankaj Jaiswal¹, Pankaj Jaiswal¹¹, Seigfried T¹², Seigfried T¹, White R¹³, White R¹ - Show less +93 more•Institutions (13)

Wellcome Trust¹, University of Cambridge², University of California, Berkeley³, Stanford University⁴, Princeton University⁵, Carnegie Institution for Science⁶, Northwestern University⁷, California Institute of Technology⁸, Wellcome Trust Sanger Institute⁹, Medical College of Wisconsin¹⁰, Cornell University¹¹, Iowa State University¹², Incyte¹³

01 Jan 2004-Nucleic Acids Research

TL;DR: The Gene Ontology (GO) project as discussed by the authors provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.

...read moreread less

Abstract: The Gene Ontology (GO) project (http://www.geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

...read moreread less

3,565 citations

An integrated encyclopedia of DNA elements in the human genome

[...]

Ian Dunham, Anshul Kundaje, Shelley Force Aldred, Patrick J. Collins +439 more

01 Sep 2012

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

2,767 citations

The Universal Protein Resource (UniProt) in 2010

[...]

Rolf Apweiler, Maria Jesus Martin, Claire O'Donovan, Michele Magrane, Yasmin Alam-Faruque, Ricardo Antunes, Daniel Barrell, Benoit Bely, M Bingley, David Binns, Lynette Bower, Paul Browne, WM Chan, E. Dimmer, Ruth Y. Eberhardt, A. Fedotov, Rebecca E. Foulger, John S. Garavelli, Rachael P. Huntley, Julius O.B. Jacobsen, M. Kleen, Kati Laiho, Rasko Leinonen, Duncan Legge, Quan Lin, W Liu, Jie Luo, Sandra Orchard, Samuel Patient, Diego Poggioli, Manuela Pruess, Matthew Corbett, G di Martino, M Donnelly, P van Rensburg, Amos Marc Bairoch, Lydie Bougueleret, Ioannis Xenarios, S Altairac, Andrea H. Auchincloss, Ghislaine Argoud-Puy, Kristian B. Axelsen, Delphine Baratin, M. C. Blatter, Brigitte Boeckmann, Jerven Bolleman, L. Bollondi, Emmanuel Boutet, SB Quintaje, Lionel Breuza, Alan Bridge, E. Decastro, L Ciapina, D Coral, Elisabeth Coudert, Isabelle Cusin, G Delbard, M Doche, Dolnide Dornevil, Paula Duek Roggli, Séverine Duvaud, Anne Estreicher, L Famiglietti, M Feuermann, Sebastien Gehant, N. Farriol-Mathis, Serenella Ferro, Elisabeth Gasteiger, Alain Gateau, Gerritsen, Arnaud Gos, Nadine Gruaz-Gumowski, Ursula Hinz, Chantal Hulo, Nicolas Hulo, J. James, S. Jimenez, Florence Jungo, T. Kappler, Guillaume Keller, Corinne Lachaize, L Lane-Guermonprez, Petra S. Langendijk-Genevaux, Lara, P Lemercier, Damien Lieberherr, Tdo Lima, Mangold, Xavier D. Martin, Patrick Masson, M. Moinat, Anne Morgat, Anaïs Mottaz, Salvo Paesano, Ivo Pedruzzi, Sandrine Pilbout, Pillet, Sylvain Poux, Monica Pozzato, Nicole Redaschi, Catherine Rivoire, Bernd Roechert, Maria Victoria Schneider, Christian J. A. Sigrist, K Sonesson, S Staehli, Eleanor J Stanley, Andre Stutz, Shyamala Sundaram, Michael Tognolli, Laure Verbregue, A-L Veuthey, L Yip, L Zuletta, Cathy H. Wu, Cecilia N. Arighi, Leslie Arminski, Winona C. Barker, Chuming Chen, Yingfei Chen, Z-Z Hu, Hongzhan Huang, Raja Mazumder, Peter B. McGarvey, Darren A. Natale, Jules Nchoutmboube, Natalia V. Petrova, N Subramanian, Baris E. Suzek, U. Ugochukwu, Sona Vasudevan, C. R. Vinayaka, LS Yeh, Jian Zhang - Show less +130 more

01 Jan 2010

961 citations

Journal Article•DOI•

The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology

[...]

Evelyn Camon¹, Michele Magrane¹, Daniel Barrell¹, Vivian Lee¹, Emily Dimmer¹, John Maslen¹, David Binns¹, Nicola Harte¹, Rodrigo Lopez¹, Rolf Apweiler¹ - Show less +6 more•Institutions (1)

European Bioinformatics Institute¹

01 Jan 2004-Nucleic Acids Research

TL;DR: The Gene Ontology Annotation database aims to provide high-quality electronic and manual annotations to the UniProt Knowledgebase (Swiss-Prot, TrEMBL and PIR-PSD) using the standardized vocabulary of theGene Ontology (GO).

...read moreread less

Abstract: The Gene Ontology Annotation (GOA) database (http://www.ebi.ac.uk/GOA) aims to provide high-quality electronic and manual annotations to the UniProt Knowledgebase (Swiss-Prot, TrEMBL and PIR-PSD) using the standardized vocabulary of the Gene Ontology (GO). As a supplementary archive of GO annotation, GOA promotes a high level of integration of the knowledge represented in UniProt with other databases. This is achieved by converting UniProt annotation into a recognized computational format. GOA provides annotated entries for nearly 60,000 species (GOA-SPTr) and is the largest and most comprehensive open-source contributor of annotations to the GO Consortium annotation effort. By integrating GO annotations from other model organism groups, GOA consolidates specialized knowledge and expertise to ensure the data remain a key reference for up-to-date biological information. Furthermore, the GOA database fully endorses the Human Proteomics Initiative by prioritizing the annotation of proteins likely to benefit human health and disease. In addition to a non-redundant set of annotations to the human proteome (GOA-Human) and monthly releases of its GO annotation for all species (GOA-SPTr), a series of GO mapping files and specific cross-references in other databases are also regularly distributed. GOA can be queried through a simple user-friendly web interface or downloaded in a parsable format via the EBI and GO FTP websites. The GOA data set can be used to enhance the annotation of particular model organism or gene expression data sets, although increasingly it has been used to evaluate GO predictions generated from text mining or protein interaction experiments. In 2004, the GOA team will build on its success and will continue to supplement the functional annotation of UniProt and work towards enhancing the ability of scientists to access all available biological information. Researchers wishing to query or contribute to the GOA project are encouraged to email: goa@ebi.ac.uk.

...read moreread less

917 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

STAR: ultrafast universal RNA-seq aligner

[...]

Alexander Dobin¹, Carrie A. Davis¹, Felix Schlesinger¹, Jorg Drenkow¹, Chris Zaleski¹, Sonali Jha¹, Philippe Batut¹, Mark Chaisson¹, Thomas R. Gingeras¹ - Show less +5 more•Institutions (1)

Cold Spring Harbor Laboratory¹

01 Jan 2013-Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

...read moreread less

30,684 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•DOI•

Tissue-based map of the human proteome

[...]

Mathias Uhlén¹, Mathias Uhlén², Linn Fagerberg¹, Björn M. Hallström¹, Cecilia Lindskog³, Per Oksvold¹, Adil Mardinoglu⁴, Åsa Sivertsson¹, Caroline Kampf³, Evelina Sjöstedt³, Evelina Sjöstedt¹, Anna Asplund³, IngMarie Olsson³, Karolina Edlund, Emma Lundberg¹, Sanjay Navani, Cristina Al-Khalili Szigyarto¹, Jacob Odeberg¹, Dijana Djureinovic³, Jenny Ottosson Takanen¹, Sophia Hober¹, Tove Alm¹, Per-Henrik Edqvist³, Holger Berling¹, Hanna Tegel¹, Jan Mulder³, Johan Rockberg¹, Peter Nilsson¹, Jochen M. Schwenk¹, Marica Hamsten¹, Kalle von Feilitzen¹, Mattias Forsberg¹, Lukas Persson¹, Fredric Johansson¹, Martin Zwahlen¹, Gunnar von Heijne⁵, Jens Nielsen⁴, Jens Nielsen², Fredrik Pontén³ - Show less +35 more•Institutions (5)

Royal Institute of Technology¹, Technical University of Denmark², Science for Life Laboratory³, Chalmers University of Technology⁴, Stockholm University⁵

23 Jan 2015-Science

TL;DR: In this paper, a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level.

...read moreread less

Abstract: Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.

...read moreread less

9,745 citations

Journal Article•

An integrated encyclopedia of DNA elements in the human genome.

[...]

ENCODEConsortium

01 Jan 2012-Nature

...read moreread less

8,106 citations

Journal Article•DOI•

UniProt: the Universal Protein knowledgebase

[...]

Rolf Apweiler¹, Amos Marc Bairoch, Cathy H. Wu, Winona C. Barker, Brigitte Boeckmann, Serenella Ferro, Elisabeth Gasteiger, Hongzhan Huang, Rodrigo Lopez, Michele Magrane, Maria Jesus Martin, Darren A. Natale, Claire O'Donovan, Nicole Redaschi, Lai-Su L. Yeh - Show less +11 more•Institutions (1)

European Bioinformatics Institute¹

01 Jan 2004-Nucleic Acids Research

TL;DR: The Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt), which is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces.

...read moreread less

Abstract: To provide the scientific community with a single, centralized, authoritative resource for protein sequences and functional information, the Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt) consortium. Our mission is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces. The central database will have two sections, corresponding to the familiar Swiss-Prot (fully manually curated entries) and TrEMBL (enriched with automated classification, annotation and extensive cross-references). For convenient sequence searches, UniProt also provides several non-redundant sequence databases. The UniProt NREF (UniRef) databases provide representative subsets of the knowledgebase suitable for efficient searching. The comprehensive UniProt Archive (UniParc) is updated daily from many public source databases. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). The scientific community is encouraged to submit data for inclusion in UniProt.

...read moreread less

7,298 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse