Home
/
Authors
/
Peter J. Tonellato

Author

Peter J. Tonellato

Other affiliations: Harvard University, Brigham and Women's Hospital, Marquette University ...read more

Bio: Peter J. Tonellato is an academic researcher from University of Missouri. The author has contributed to research in topics: Genome & Cloud computing. The author has an hindex of 31, co-authored 85 publications receiving 10147 citations. Previous affiliations of Peter J. Tonellato include Harvard University & Brigham and Women's Hospital.

Topics: Genome, Cloud computing, Comparative genomics, Gene, Human genome ...read more

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2008
2006
2005
2004
2003
2002
2001
2000
1999
1998
1996
1992
1991
1989
1984

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The Gene Ontology (GO) database and informatics resource.

[...]

Midori A. Harris, Jennifer I. Clark¹, Ireland A¹, Jane Lomax¹, Michael Ashburner², Michael Ashburner¹, R. Foulger¹, R. Foulger², Karen Eilbeck¹, Karen Eilbeck³, Suzanna E. Lewis³, Suzanna E. Lewis¹, B. Marshall³, B. Marshall¹, Christopher J. Mungall³, Christopher J. Mungall¹, J. Richter³, J. Richter¹, Gerald M. Rubin¹, Gerald M. Rubin³, Judith A. Blake¹, Carol J. Bult¹, Dolan M¹, Drabkin H¹, Janan T. Eppig¹, Hill Dp¹, L. Ni¹, Ringwald M¹, Rama Balakrishnan⁴, Rama Balakrishnan¹, J. M. Cherry⁴, J. M. Cherry¹, Karen R. Christie⁴, Karen R. Christie¹, Maria C. Costanzo⁴, Maria C. Costanzo¹, Selina S. Dwight¹, Selina S. Dwight⁴, Stacia R. Engel¹, Stacia R. Engel⁴, Dianna G. Fisk¹, Dianna G. Fisk⁴, Jodi E. Hirschman¹, Jodi E. Hirschman⁴, Eurie L. Hong¹, Eurie L. Hong⁴, Robert S. Nash⁴, Robert S. Nash¹, Anand Sethuraman¹, Anand Sethuraman⁴, Chandra L. Theesfeld⁴, Chandra L. Theesfeld¹, David Botstein¹, David Botstein⁵, Kara Dolinski⁵, Kara Dolinski¹, Becket Feierbach¹, Becket Feierbach⁵, Tanya Z. Berardini⁶, Tanya Z. Berardini¹, S. Mundodi⁶, S. Mundodi¹, Seung Y. Rhee¹, Seung Y. Rhee⁶, Rolf Apweiler¹, Daniel Barrell¹, Camon E¹, E. Dimmer¹, Lee¹, Rex L. Chisholm, Pascale Gaudet⁷, Pascale Gaudet¹, Warren A. Kibbe¹, Warren A. Kibbe⁷, Ranjana Kishore¹, Ranjana Kishore⁸, Erich M. Schwarz⁸, Erich M. Schwarz¹, Paul W. Sternberg⁸, Paul W. Sternberg¹, M. Gwinn¹, Hannick L¹, Wortman J¹, Matthew Berriman¹, Matthew Berriman⁹, Wood¹, Wood⁹, de la Cruz N¹, de la Cruz N¹⁰, Peter J. Tonellato¹⁰, Peter J. Tonellato¹, Pankaj Jaiswal¹¹, Pankaj Jaiswal¹, Seigfried T¹, Seigfried T¹², White R¹³, White R¹ - Show less +93 more•Institutions (13)

Wellcome Trust¹, University of Cambridge², University of California, Berkeley³, Stanford University⁴, Princeton University⁵, Carnegie Institution for Science⁶, Northwestern University⁷, California Institute of Technology⁸, Wellcome Trust Sanger Institute⁹, Medical College of Wisconsin¹⁰, Cornell University¹¹, Iowa State University¹², Incyte¹³

01 Jan 2004-Nucleic Acids Research

TL;DR: The Gene Ontology (GO) project as discussed by the authors provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.

...read moreread less

Abstract: The Gene Ontology (GO) project (http://www.geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

...read moreread less

3,565 citations

Journal Article•DOI•

Genome sequence of the Brown Norway rat yields insights into mammalian evolution

[...]

Richard A. Gibbs¹, George M. Weinstock¹, Michael L. Metzker¹, Donna M. Muzny¹ +239 more•Institutions (35)

01 Apr 2004-Nature

TL;DR: This first comprehensive analysis of the genome sequence of the Brown Norway (BN) rat strain is reported, which is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution.

...read moreread less

Abstract: The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality 'draft' covering over 90% of the genome. The BN rat sequence is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution. This first comprehensive analysis includes genes and proteins and their relation to human disease, repeated sequences, comparative genome-wide studies of mammalian orthologous chromosomal regions and rearrangement breakpoints, reconstruction of ancestral karyotypes and the events leading to existing species, rates of variation, and lineage-specific and lineage-independent evolutionary events such as expansion of gene families, orthology relations and protein evolution.

...read moreread less

1,964 citations

Genome sequence of the Brown Norway rat yields insights into mammalian evolutionRat Genome Sequencing Project ConsortiumNature200442849352115057822

[...]

Richard A. Gibbs, George M. Weinstock, Michael L. Metzker, Donna M. Muzny +223 more

01 Jan 2004

Abstract: The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality ‘draft’ covering over 90% of the genome. The BN rat sequence is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution. This first comprehensive analysis includes genes and proteins and their relation to human disease, repeated sequences, comparative genome-wide studies of mammalian orthologous chromosomal regions and rearrangement breakpoints, reconstruction of ancestral karyotypes and the events leading to existing species, rates of variation, and lineage-specific and lineage-independent evolutionary events such as expansion of gene families, orthology relations and protein evolution.

...read moreread less

1,854 citations

Gene Ontology Consortium. The Gene Ontology (GO) database and informatics resource

[...]

Harris Ma, Jennifer I. Clark, Amelia Ireland, Jane Lomax, Michael Ashburner, R. Foulger, Karen Eilbeck, Suzanna E. Lewis, B. Marshall, Christopher J. Mungall, J. Richter, Gerald M. Rubin, Blake Ja, Carol J. Bult, Mary E. Dolan, H. Drabkin, Janan T. Eppig, David P. Hill, L. Ni, Martin Ringwald, Rama Balakrishnan, J. M. Cherry, Karen R. Christie, Maria C. Costanzo, Selina S. Dwight, Stacia R. Engel, Dianna G. Fisk, Jodi E. Hirschman, Eurie L. Hong, Robert S. Nash, Anand Sethuraman, Chandra L. Theesfeld, David Botstein, Kara Dolinski, Becket Feierbach, Tanya Z. Berardini, S. Mundodi, Seung Y. Rhee, Rolf Apweiler, Daniel Barrell, Evelyn Camon, E. Dimmer, V. Lee, Rex L. Chisholm, Pascale Gaudet, Warren A. Kibbe, Ranjana Kishore, Erich M. Schwarz, Paul W. Sternberg, M. Gwinn, Linda Hannick, Jennifer R. Wortman, Matthew Berriman, Valerie Wood, N. de sur la Cruz, Peter J. Tonellato, Pankaj Jaiswal, Trent E. Seigfried, Ra White - Show less +55 more

01 Jan 2004

TL;DR: The Gene Ontology (GO) project provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.

...read moreread less

Abstract: The Gene Ontology (GO) project (http://www. geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

...read moreread less

559 citations

Journal Article•DOI•

Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

[...]

Tadashi Imanishi¹, Takeshi Itoh¹, Yutaka Suzuki², Claire O'Donovan³ +164 more•Institutions (42)

20 Apr 2004-PLOS Biology

TL;DR: The H-InvDB as discussed by the authors is a database of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level.

...read moreread less

Abstract: The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.

...read moreread less

341 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets

[...]

Benjamin P. Lewis¹, Christopher B. Burge¹, David P. Bartel¹•Institutions (1)

Massachusetts Institute of Technology¹

14 Jan 2005-Cell

TL;DR: In a four-genome analysis of 3' UTRs, approximately 13,000 regulatory relationships were detected above the estimate of false-positive predictions, thereby implicating as miRNA targets more than 5300 human genes, which represented 30% of the gene set.

...read moreread less

11,624 citations

Journal Article•DOI•

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

[...]

Ewan Birney, John A. Stamatoyannopoulos¹, Anindya Dutta², Roderic Guigó³ +317 more•Institutions (44)

14 Jun 2007-Nature

TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.

...read moreread less

Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

...read moreread less

5,091 citations

Journal Article•DOI•

BioGRID: a general repository for interaction datasets

[...]

Chris Stark¹, Bobby-Joe Breitkreutz, Teresa Reguly, Lorrie Boucher, Ashton Breitkreutz, Mike Tyers - Show less +2 more•Institutions (1)

Ontario Institute for Cancer Research¹

01 Jan 2006-Nucleic Acids Research

TL;DR: BioGRID is a freely accessible database of physical and genetic interactions that includes >116 000 interactions from Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens.

...read moreread less

Abstract: Access to unified datasets of protein and genetic interactions is critical for interrogation of gene/protein function and analysis of global network properties. BioGRID is a freely accessible database of physical and genetic interactions available at http://www.thebiogrid.org. BioGRID release version 2.0 includes >116 000 interactions from Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. Over 30 000 interactions have recently been added from 5778 sources through exhaustive curation of the Saccharomyces cerevisiae primary literature. An internally hyper-linked web interface allows for rapid search and retrieval of interaction data. Full or user-defined datasets are freely downloadable as tab-delimited text files and PSI-MI XML. Pre-computed graphical layouts of interactions are available in a variety of file formats. User-customized graphs with embedded protein, gene and interaction attributes can be constructed with a visualization system called Osprey that is dynamically linked to the BioGRID.

...read moreread less

3,794 citations

Journal Article•DOI•

Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes

[...]

Adam Siepel¹, Gill Bejerano, Jakob Skou Pedersen², Angie S. Hinrichs, Minmei Hou, Kate R. Rosenbloom, Hiram Clawson, John Spieth, LaDeana W. Hillier, Stephen Richards, George M. Weinstock, Richard K. Wilson, Richard A. Gibbs, W. James Kent, Webb Miller, David Haussler - Show less +12 more•Institutions (2)

University of California, Santa Cruz¹, Aarhus University²

01 Aug 2005-Genome Research

TL;DR: A comprehensive search for conserved elements in vertebrate genomes is conducted, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes), using a two-state phylogenetic hidden Markov model (phylo-HMM).

...read moreread less

Abstract: We have conducted a comprehensive search for conserved elements in vertebrate genomes, using genome-wide multiple alignments of five vertebrate species (human, mouse, rat, chicken, and Fugu rubripes). Parallel searches have been performed with multiple alignments of four insect species (three species of Drosophila and Anopheles gambiae), two species of Caenorhabditis, and seven species of Saccharomyces. Conserved elements were identified with a computer program called phastCons, which is based on a two-state phylogenetic hidden Markov model (phylo-HMM). PhastCons works by fitting a phylo-HMM to the data by maximum likelihood, subject to constraints designed to calibrate the model across species groups, and then predicting conserved elements based on this model. The predicted elements cover roughly 3%-8% of the human genome (depending on the details of the calibration procedure) and substantially higher fractions of the more compact Drosophila melanogaster (37%-53%), Caenorhabditis elegans (18%-37%), and Saccharaomyces cerevisiae (47%-68%) genomes. From yeasts to vertebrates, in order of increasing genome size and general biological complexity, increasing fractions of conserved bases are found to lie outside of the exons of known protein-coding genes. In all groups, the most highly conserved elements (HCEs), by log-odds score, are hundreds or thousands of bases long. These elements share certain properties with ultraconserved elements, but they tend to be longer and less perfectly conserved, and they overlap genes of somewhat different functional categories. In vertebrates, HCEs are associated with the 3' UTRs of regulatory genes, stable gene deserts, and megabase-sized regions rich in moderately conserved noncoding sequences. Noncoding HCEs also show strong statistical evidence of an enrichment for RNA secondary structure.

...read moreread less

3,719 citations

Journal Article•DOI•

The impact of microRNAs on protein output

[...]

Daehyun Baek¹, Judit Villén², Chanseok Shin¹, Fernando D. Camargo¹, Steven P. Gygi², David P. Bartel¹ - Show less +2 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

04 Sep 2008-Nature

TL;DR: The impact of micro RNAs on the proteome indicated that for most interactions microRNAs act as rheostats to make fine-scale adjustments to protein output.

...read moreread less

Abstract: MicroRNAs are endogenous ∼23-nucleotide RNAs that can pair to sites in the messenger RNAs of protein-coding genes to downregulate the expression from these messages. MicroRNAs are known to influence the evolution and stability of many mRNAs, but their global impact on protein output had not been examined. Here we use quantitative mass spectrometry to measure the response of thousands of proteins after introducing microRNAs into cultured cells and after deleting mir-223 in mouse neutrophils. The identities of the responsive proteins indicate that targeting is primarily through seed-matched sites located within favourable predicted contexts in 3′ untranslated regions. Hundreds of genes were directly repressed, albeit each to a modest degree, by individual microRNAs. Although some targets were repressed without detectable changes in mRNA levels, those translationally repressed by more than a third also displayed detectable mRNA destabilization, and, for the more highly repressed targets, mRNA destabilization usually comprised the major component of repression. The impact of microRNAs on the proteome indicated that for most interactions microRNAs act as rheostats to make fine-scale adjustments to protein output. MicroRNAs can regulate gene expression by either inhibiting translation of a messenger RNA, or inducing its degradation. While previous studies have measured regulation at the mRNA level, it was unknown how much regulation occurred at the protein level. Now two groups led by David Bartel and Nikolaus Rajewsky have used variants of the technique known as SILAC (stable isotope labelling with amino acids in cell culture) to measure proteome-wide changes in protein level as a function of expression of endogenous and exogenous microRNAs. They find that while microRNAs can directly repress the translation of hundreds of genes, additional indirect effects result in changes in expression of thousands of genes. Many of the changes observed are less than twofold in magnitude, however, indicating either directly or indirectly, microRNAs can act as rheostats to fine-tune protein synthesis to match the needs of the cell at any given time. In one of two studies, a technique known as SILAC is used to measure, on a large scale, changes in protein level as a function of expression of endogenous and exogenous miRNAs. It is found that although miRNAs directly repress the translation of hundreds of genes, additional indirect effects result in changes in expression of thousands of genes.

...read moreread less

3,562 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse