Home
/
Authors
/
Manolis Kellis

Author

Manolis Kellis

Other affiliations: Broad Institute, Epigenomics AG, Harvard University ...read more

Bio: Manolis Kellis is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 128, co-authored 405 publications receiving 112181 citations. Previous affiliations of Manolis Kellis include Broad Institute & Epigenomics AG.

Topics: Genome, Gene, Chromatin, Genomics, Genome-wide association study ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2005
2004
2003

Papers

PDF

Open Access

More filters

Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo

[...]

Silvi Rouskin¹, Meghan Zubradt¹, Stefan Washietl², Manolis Kellis², Jonathan S. Weissman¹ - Show less +1 more•Institutions (2)

University of California, San Francisco¹, Massachusetts Institute of Technology²

01 Dec 2013

TL;DR: These studies broadly enable the functional analysis of physiological RNA structures and reveal that, in contrast to the Anfinsen view of protein folding whereby the structure formed is the most thermodynamically favourable, thermodynamics have an incomplete role in determining mRNA structure in vivo.

...read moreread less

Abstract: RNA has a dual role as an informational molecule and a direct effector of biological tasks. The latter function is enabled by RNA's ability to adopt complex secondary and tertiary folds and thus has motivated extensive computational and experimental efforts for determining RNA structures. Existing approaches for evaluating RNA structure have been largely limited to in vitro systems, yet the thermodynamic forces which drive RNA folding in vitro may not be sufficient to predict stable RNA structures in vivo. Indeed, the presence of RNA-binding proteins and ATP-dependent helicases can influence which structures are present inside cells. Here we present an approach for globally monitoring RNA structure in native conditions in vivo with single-nucleotide precision. This method is based on in vivo modification with dimethyl sulphate (DMS), which reacts with unpaired adenine and cytosine residues, followed by deep sequencing to monitor modifications. Our data from yeast and mammalian cells are in excellent agreement with known messenger RNA structures and with the high-resolution crystal structure of the Saccharomyces cerevisiae ribosome. Comparison between in vivo and in vitro data reveals that in rapidly dividing cells there are vastly fewer structured mRNA regions in vivo than in vitro. Even thermostable RNA structures are often denatured in cells, highlighting the importance of cellular processes in regulating RNA structure. Indeed, analysis of mRNA structure under ATP-depleted conditions in yeast shows that energy-dependent processes strongly contribute to the predominantly unfolded state of mRNAs inside cells. Our studies broadly enable the functional analysis of physiological RNA structures and reveal that, in contrast to the Anfinsen view of protein folding whereby the structure formed is the most thermodynamically favourable, thermodynamics have an incomplete role in determining mRNA structure in vivo.

...read moreread less

647 citations

Journal Article•DOI•

Multiple knockout mouse models reveal lincRNAs are required for life and brain development

[...]

Martin Sauvageau¹, Martin Sauvageau², Loyal A. Goff², Loyal A. Goff¹, Loyal A. Goff³, Simona Lodato¹, Simona Lodato², Boyan B. Bonev², Boyan B. Bonev¹, Abigail F. Groff², Abigail F. Groff¹, Chiara Gerhardinger¹, Chiara Gerhardinger², Diana B. Sanchez-Gomez², Ezgi Hacisuleyman¹, Ezgi Hacisuleyman², Eric Li², Matthew Spence², Stephen C. Liapis², Stephen C. Liapis¹, William Mallard², William Mallard¹, Michael A. Morse¹, Michael A. Morse², Mavis R. Swerdel⁴, Michael F D’Ecclessis⁴, Jennifer C. Moore⁴, Venus Lai⁵, Guochun Gong⁵, George D. Yancopoulos⁵, David Frendewey⁵, Manolis Kellis³, Manolis Kellis¹, Ronald P. Hart⁴, David M. Valenzuela⁵, Paola Arlotta¹, Paola Arlotta², John L. Rinn¹, John L. Rinn⁶, John L. Rinn² - Show less +36 more•Institutions (6)

Broad Institute¹, Harvard University², Massachusetts Institute of Technology³, Rutgers University⁴, Regeneron⁵, Beth Israel Deaconess Medical Center⁶

31 Dec 2013-eLife

TL;DR: It is demonstrated that lncRNAs play critical roles in vivo and provides a framework and impetus for future larger-scale functional investigation into the roles of lncRNA molecules.

...read moreread less

Abstract: The mammalian genome is comprised of DNA sequences that contain the templates for proteins, and other DNA sequences that do not code for proteins. The coding DNA sequences are transcribed to make messenger RNA molecules, which are then translated to make proteins. Researchers have known for many years that some of the noncoding DNA sequences are also transcribed to make other types of RNA molecules, such as transfer and ribosomal RNA. However, the true breadth and diversity of the roles played by these other RNA molecules have only recently begun to be fully appreciated. Mammalian genomes contain thousands of noncoding DNA sequences that are transcribed. Recent in vitro studies suggest that the resulting long noncoding RNA molecules can act as regulators of transcription, translation, and cell cycle. In vitro studies also suggest that these long noncoding RNA molecules may play a role in mammalian development and disease. Yet few in vivo studies have been performed to support or confirm such hypotheses. Now Sauvageau et al. have developed several lines of knockout mice to investigate a subset of noncoding RNA molecules known as long intergenic noncoding RNAs (lincRNAs). These experiments reveal that lincRNAs have a strong influence on the overall viability of mice, and also on a number of developmental processes, including the development of lungs and the cerebral cortex. Given that the vast majority of the human genome is transcribed, the mouse models developed by Sauvageau et al. represent an important step in determining the physiological relevance, on a genetic level, of the noncoding portion of the genome in vivo.

...read moreread less

645 citations

Journal Article•DOI•

Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

[...]

Alexander Stark¹, Michael F. Lin¹, Pouya Kheradpour¹, Jakob Skou Pedersen², Jakob Skou Pedersen³, Leopold Parts⁴, Leopold Parts⁵, Joseph W. Carlson⁶, Madeline A. Crosby⁷, Matthew D. Rasmussen¹, Sushmita Roy⁸, Ameya N. Deoras¹, J. Graham Ruby¹, Julius Brennecke⁹, Emily Hodges⁹, Angie S. Hinrichs³, Anat Caspi¹⁰, Benedict Paten³, Benedict Paten¹¹, Benedict Paten⁵, Seung-Won Park¹², Mira V. Han¹³, Morgan L. Maeder¹⁴, Benjamin J. Polansky¹⁴, Bryanne E. Robson¹⁴, Stein Aerts¹⁵, Jacques van Helden¹⁶, Bassem A. Hassan¹⁵, Donald G. Gilbert¹³, Deborah Eastman¹⁴, Michael D. Rice¹⁷, Michael P. Weir¹⁷, Matthew W. Hahn¹³, Yongkyu Park¹², Colin N. Dewey¹⁸, Lior Pachter¹⁹, W. James Kent³, David Haussler³, Eric C. Lai²⁰, David P. Bartel¹, Gregory J. Hannon⁹, Thomas C. Kaufman¹³, Michael B. Eisen¹⁹, Michael B. Eisen⁶, Andrew G. Clark²¹, Douglas Smith, Susan E. Celniker⁶, William M. Gelbart⁷, Manolis Kellis¹ - Show less +45 more•Institutions (21)

Massachusetts Institute of Technology¹, University of Copenhagen², University of California, Santa Cruz³, University of Tartu⁴, Wellcome Trust Sanger Institute⁵, Lawrence Berkeley National Laboratory⁶, Harvard University⁷, University of New Mexico⁸, Watson School of Biological Sciences⁹, University of California, San Francisco¹⁰, European Bioinformatics Institute¹¹, Rutgers University¹², Indiana University¹³, Connecticut College¹⁴, Katholieke Universiteit Leuven¹⁵, Université libre de Bruxelles¹⁶, Wesleyan University¹⁷, University of Wisconsin-Madison¹⁸, University of California, Berkeley¹⁹, Memorial Sloan Kettering Cancer Center²⁰, Cornell University²¹

08 Nov 2007-Nature

TL;DR: This work uses the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly, and identifies several classes of pre- and post-transcriptional regulatory motifs, and predicts individual motif instances with high confidence.

...read moreread less

Abstract: Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or 'evolutionary signatures', dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies.

...read moreread less

636 citations

Journal Article•DOI•

Distinguishing protein-coding and noncoding genes in the human genome

[...]

Michele Clamp¹, Ben Fry, Mike Kamal, Xiaohui Xie, James Cuff, Michael F. Lin, Manolis Kellis, Kerstin Lindblad-Toh, Eric S. Lander - Show less +5 more•Institutions (1)

Massachusetts Institute of Technology¹

04 Dec 2007-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is shown that the vast majority of nonconserved ORFs present by chance in RNA transcripts are random occurrences, and the results indicate that there has been relatively little true innovation in mammalian protein-coding genes.

...read moreread less

Abstract: Although the Human Genome Project was completed 4 years ago, the catalog of human protein-coding genes remains a matter of controversy. Current catalogs list a total of ≈24,500 putative protein-coding genes. It is broadly suspected that a large fraction of these entries are functionally meaningless ORFs present by chance in RNA transcripts, because they show no evidence of evolutionary conservation with mouse or dog. However, there is currently no scientific justification for excluding ORFs simply because they fail to show evolutionary conservation: the alternative hypothesis is that most of these ORFs are actually valid human genes that reflect gene innovation in the primate lineage or gene loss in the other lineages. Here, we reject this hypothesis by carefully analyzing the nonconserved ORFs—specifically, their properties in other primates. We show that the vast majority of these ORFs are random occurrences. The analysis yields, as a by-product, a major revision of the current human catalogs, cutting the number of protein-coding genes to ≈20,500. Specifically, it suggests that nonconserved ORFs should be added to the human gene catalog only if there is clear evidence of an encoded protein. It also provides a principled methodology for evaluating future proposed additions to the human gene catalog. Finally, the results indicate that there has been relatively little true innovation in mammalian protein-coding genes.

...read moreread less

616 citations

Journal Article•DOI•

Evolution, biogenesis, expression, and target predictions of a substantially expanded set of Drosophila microRNAs

[...]

J. Graham Ruby¹, Alexander Stark, Wendy K. Johnston, Manolis Kellis, David P. Bartel, Eric C. Lai - Show less +2 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Dec 2007-Genome Research

TL;DR: Predicted targets for the expanded set of microRNAs substantially increased and revised the miRNA-target relationships that appear conserved among the fly species and provided insights into their biogenesis and expression.

...read moreread less

Abstract: MicroRNA (miRNA) genes give rise to small regulatory RNAs in a wide variety of organisms We used computational methods to predict miRNAs conserved among Drosophila species and large-scale sequencing of small RNAs from Drosophila melanogaster to experimentally confirm and complement these predictions In addition to validating 20 of our top 45 predictions for novel miRNA loci, the large-scale sequencing identified many miRNAs that had not been predicted In total, 59 novel genes were identified, increasing our tally of confirmed fly miRNAs to 148 The large-scale sequencing also refined the identities of previously known miRNAs and provided insights into their biogenesis and expression Many miRNAs were expressed in particular developmental contexts, with a large cohort of miRNAs expressed primarily in imaginal discs Conserved miRNAs typically were expressed more broadly and robustly than were nonconserved miRNAs, and those conserved miRNAs with more restricted expression tended to have fewer predicted targets than those expressed more broadly Predicted targets for the expanded set of microRNAs substantially increased and revised the miRNA-target relationships that appear conserved among the fly species Insights were also provided into miRNA gene evolution, including evidence for emergent regulatory function deriving from the opposite arm of the miRNA hairpin, exemplified by mir-10, and even the opposite strand of the DNA, exemplified by mir-iab-4

...read moreread less

599 citations

1
2
3
4
5
6
7
8
…
9
10
11
12
13
14
15
…
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

STAR: ultrafast universal RNA-seq aligner

[...]

Alexander Dobin¹, Carrie A. Davis¹, Felix Schlesinger¹, Jorg Drenkow¹, Chris Zaleski¹, Sonali Jha¹, Philippe Batut¹, Mark Chaisson¹, Thomas R. Gingeras¹ - Show less +5 more•Institutions (1)

Cold Spring Harbor Laboratory¹

01 Jan 2013-Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

...read moreread less

30,684 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

MicroRNAs: Target Recognition and Regulatory Functions

[...]

David P. Bartel¹•Institutions (1)

Massachusetts Institute of Technology¹

23 Jan 2009-Cell

TL;DR: The current understanding of miRNA target recognition in animals is outlined and the widespread impact of miRNAs on both the expression and evolution of protein-coding genes is discussed.

...read moreread less

18,036 citations

Journal Article•DOI•

Full-length transcriptome assembly from RNA-Seq data without a reference genome.

[...]

Manfred Grabherr¹, Brian J. Haas¹, Moran Yassour¹, Moran Yassour², Joshua Z. Levin¹, Dawn Thompson¹, Ido Amit¹, Xian Adiconis¹, Lin Fan¹, Raktima Raychowdhury¹, Qiandong Zeng¹, Zehua Chen¹, Evan Mauceli¹, Nir Hacohen¹, Andreas Gnirke¹, Nicholas Rhind³, Federica Di Palma¹, Bruce W. Birren¹, Chad Nusbaum¹, Kerstin Lindblad-Toh¹, Kerstin Lindblad-Toh⁴, Nir Friedman², Aviv Regev¹ - Show less +19 more•Institutions (4)

Massachusetts Institute of Technology¹, Hebrew University of Jerusalem², University of Massachusetts Medical School³, Science for Life Laboratory⁴

01 Jul 2011-Nature Biotechnology

TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.

...read moreread less

Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

...read moreread less

15,665 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse