Home
/
Authors
/
Manolis Kellis

Author

Manolis Kellis

Other affiliations: Broad Institute, Epigenomics AG, Harvard University ...read more

Bio: Manolis Kellis is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 128, co-authored 405 publications receiving 112181 citations. Previous affiliations of Manolis Kellis include Broad Institute & Epigenomics AG.

Topics: Genome, Gene, Chromatin, Genomics, Genome-wide association study ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2005
2004
2003

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Chromatin Accessibility Impacts Transcriptional Reprogramming in Oocytes

[...]

Kei Miyamoto¹, Kei Miyamoto², Khoi T. Nguyen³, George E. Allen¹, Jerome Jullien¹, Dinesh Kumar³, Tomoki Otani¹, Charles R. Bradshaw¹, Frederick J. Livesey¹, Manolis Kellis⁴, Manolis Kellis³, John B. Gurdon¹ - Show less +8 more•Institutions (4)

Wellcome Trust/Cancer Research UK Gurdon Institute¹, Kindai University², Massachusetts Institute of Technology³, Broad Institute⁴

10 Jul 2018-Cell Reports

TL;DR: In this paper, the authors investigated the relationship between the chromatin opening and transcriptional activation and concluded that chromatin accessibility is a central factor for successful transcriptional reprogramming in oocytes.

...read moreread less

40 citations

Journal Article•DOI•

Evidence of reduced recombination rate in human regulatory domains

[...]

Yaping Liu¹, Yaping Liu², Abhishek Sarkar², Abhishek Sarkar¹, Pouya Kheradpour¹, Pouya Kheradpour², Jason Ernst³, Manolis Kellis², Manolis Kellis¹ - Show less +5 more•Institutions (3)

Broad Institute¹, Massachusetts Institute of Technology², University of California, Los Angeles³

20 Oct 2017-Genome Biology

TL;DR: The results suggest the existence of a recombination rate valley at regulatory domains and provide a potential molecular mechanism to interpret the interplay between genetic and epigenetic variations.

...read moreread less

Abstract: Recombination rate is non-uniformly distributed across the human genome. The variation of recombination rate at both fine and large scales cannot be fully explained by DNA sequences alone. Epigenetic factors, particularly DNA methylation, have recently been proposed to influence the variation in recombination rate. We study the relationship between recombination rate and gene regulatory domains, defined by a gene and its linked control elements. We define these links using expression quantitative trait loci (eQTLs), methylation quantitative trait loci (meQTLs), chromatin conformation from publicly available datasets (Hi-C and ChIA-PET), and correlated activity links that we infer across cell types. Each link type shows a “recombination rate valley” of significantly reduced recombination rate compared to matched control regions. This recombination rate valley is most pronounced for gene regulatory domains of early embryonic development genes, housekeeping genes, and constitutive regulatory elements, which are known to show increased evolutionary constraint across species. Recombination rate valleys show increased DNA methylation, reduced doublestranded break initiation, and increased repair efficiency, specifically in the lineage leading to the germ line. Moreover, by using only the overlap of functional links and DNA methylation in germ cells, we are able to predict the recombination rate with high accuracy. Our results suggest the existence of a recombination rate valley at regulatory domains and provide a potential molecular mechanism to interpret the interplay between genetic and epigenetic variations.

...read moreread less

40 citations

Journal Article•DOI•

Genetic drivers of m6A methylation in human brain, lung, heart and muscle.

[...]

Xushen Xiong¹, Xushen Xiong², Xushen Xiong³, Lei Hou³, Lei Hou², Yongjin Park³, Yongjin Park², Benoit Molinie³, Richard I. Gregory¹, Manolis Kellis², Manolis Kellis³ - Show less +7 more•Institutions (3)

Boston Children's Hospital¹, Massachusetts Institute of Technology², Broad Institute³

14 May 2021-Nature Genetics

TL;DR: In this paper, the most prevalent post-transcriptional mRNA modification, N6-methyladenosine (m6A), plays diverse RNA-regulatory roles, but its genetic control in human tissues remains uncharted.

...read moreread less

Abstract: The most prevalent post-transcriptional mRNA modification, N6-methyladenosine (m6A), plays diverse RNA-regulatory roles, but its genetic control in human tissues remains uncharted. Here we report 129 transcriptome-wide m6A profiles, covering 91 individuals and 4 tissues (brain, lung, muscle and heart) from GTEx/eGTEx. We integrate these with interindividual genetic and expression variation, revealing 8,843 tissue-specific and 469 tissue-shared m6A quantitative trait loci (QTLs), which are modestly enriched in, but mostly orthogonal to, expression QTLs. We integrate m6A QTLs with disease genetics, identifying 184 GWAS-colocalized m6A QTL, including brain m6A QTLs underlying neuroticism, depression, schizophrenia and anxiety; lung m6A QTLs underlying expiratory flow and asthma; and muscle/heart m6A QTLs underlying coronary artery disease. Last, we predict novel m6A regulators that show preferential binding in m6A QTLs, protein interactions with known m6A regulators and expression correlation with the m6A levels of their targets. Our results provide important insights and resources for understanding both cis and trans regulation of epitranscriptomic modifications, their interindividual variation and their roles in human disease. Analysis of 129 N6-methyladenosine (m6A) profiles across 4 tissues (brain, lung, muscle and heart) identifies 8,843 tissue-specific and 469 tissue-shared m6A quantitative trait loci (QTLs). Of these, 184 m6A QTLs colocalize with GWAS signals.

...read moreread less

40 citations

Journal Article•DOI•

Distinct and Predictive Histone Lysine Acetylation Patterns at Promoters, Enhancers and Gene Bodies

[...]

Nisha Rajagopal¹, Jason Ernst², Pradipta R. Ray³, Jie Wu⁴, Michael Q. Zhang³, Manolis Kellis⁵, Bing Ren¹, Bing Ren⁶ - Show less +4 more•Institutions (6)

Ludwig Institute for Cancer Research¹, University of California, Los Angeles², University of Texas at Dallas³, Cold Spring Harbor Laboratory⁴, Massachusetts Institute of Technology⁵, University of California, San Diego⁶

01 Nov 2014-G3: Genes, Genomes, Genetics

TL;DR: This work developed an algorithm RFECS to discover the most informative modifications associated with the classification or prediction of mammalian enhancers and found that histone acetylation alone performs well in distinguishing these unique genomic regions.

...read moreread less

Abstract: In eukaryotic cells, histone lysines are frequently acetylated. However, unlike modifications such as methylations, histone acetylation modifications are often considered redundant. As such, the functional roles of distinct histone acetylations are largely unexplored. We previously developed an algorithm RFECS to discover the most informative modifications associated with the classification or prediction of mammalian enhancers. Here, we used this tool to identify the modifications most predictive of promoters, enhancers, and gene bodies. Unexpectedly, we found that histone acetylation alone performs well in distinguishing these unique genomic regions. Further, we found the association of characteristic acetylation patterns with genic regions and association of chromatin state with splicing. Taken together, our work underscores the diverse functional roles of histone acetylation in gene regulation and provides several testable hypotheses to dissect these roles.

...read moreread less

40 citations

Journal Article•DOI•

Motif discovery in physiological datasets: A methodology for inferring predictive elements

[...]

Zeeshan Syed¹, Collin M. Stultz², Manolis Kellis², Piotr Indyk², John V. Guttag² - Show less +1 more•Institutions (2)

University of Michigan¹, Massachusetts Institute of Technology²

18 Jan 2010-ACM Transactions on Knowledge Discovery From Data

TL;DR: In this article, a two-stage process is proposed to identify predictive physiological patterns in the absence of prior knowledge, which is analogous to the discovery of regulatory motifs in genomic datasets.

...read moreread less

Abstract: In this article, we propose a methodology for identifying predictive physiological patterns in the absence of prior knowledge. We use the principle of conservation to identify activity that consistently precedes an outcome in patients, and describe a two-stage process that allows us to efficiently search for such patterns in large datasets. This involves first transforming continuous physiological signals from patients into symbolic sequences, and then searching for patterns in these reduced representations that are strongly associated with an outcome.Our strategy of identifying conserved activity that is unlikely to have occurred purely by chance in symbolic data is analogous to the discovery of regulatory motifs in genomic datasets. We build upon existing work in this area, generalizing the notion of a regulatory motif and enhancing current techniques to operate robustly on non-genomic data. We also address two significant considerations associated with motif discovery in general: computational efficiency and robustness in the presence of degeneracy and noise. To deal with these issues, we introduce the concept of active regions and new subset-based techniques such as a two-layer Gibbs sampling algorithm. These extensions allow for a framework for information inference, where precursors are identified as approximately conserved activity of arbitrary complexity preceding multiple occurrences of an event.We evaluated our solution on a population of patients who experienced sudden cardiac death and attempted to discover electrocardiographic activity that may be associated with the endpoint of death. To assess the predictive patterns discovered, we compared likelihood scores for motifs in the sudden death population against control populations of normal individuals and those with non-fatal supraventricular arrhythmias. Our results suggest that predictive motif discovery may be able to identify clinically relevant information even in the absence of significant prior knowledge.

...read moreread less

39 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
…
45
46
47
48
49
50
51
…
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

STAR: ultrafast universal RNA-seq aligner

[...]

Alexander Dobin¹, Carrie A. Davis¹, Felix Schlesinger¹, Jorg Drenkow¹, Chris Zaleski¹, Sonali Jha¹, Philippe Batut¹, Mark Chaisson¹, Thomas R. Gingeras¹ - Show less +5 more•Institutions (1)

Cold Spring Harbor Laboratory¹

01 Jan 2013-Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

...read moreread less

30,684 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

MicroRNAs: Target Recognition and Regulatory Functions

[...]

David P. Bartel¹•Institutions (1)

Massachusetts Institute of Technology¹

23 Jan 2009-Cell

TL;DR: The current understanding of miRNA target recognition in animals is outlined and the widespread impact of miRNAs on both the expression and evolution of protein-coding genes is discussed.

...read moreread less

18,036 citations

Journal Article•DOI•

Full-length transcriptome assembly from RNA-Seq data without a reference genome.

[...]

Manfred Grabherr¹, Brian J. Haas¹, Moran Yassour¹, Moran Yassour², Joshua Z. Levin¹, Dawn Thompson¹, Ido Amit¹, Xian Adiconis¹, Lin Fan¹, Raktima Raychowdhury¹, Qiandong Zeng¹, Zehua Chen¹, Evan Mauceli¹, Nir Hacohen¹, Andreas Gnirke¹, Nicholas Rhind³, Federica Di Palma¹, Bruce W. Birren¹, Chad Nusbaum¹, Kerstin Lindblad-Toh¹, Kerstin Lindblad-Toh⁴, Nir Friedman², Aviv Regev¹ - Show less +19 more•Institutions (4)

Massachusetts Institute of Technology¹, Hebrew University of Jerusalem², University of Massachusetts Medical School³, Science for Life Laboratory⁴

01 Jul 2011-Nature Biotechnology

TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.

...read moreread less

Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

...read moreread less

15,665 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse