Home
/
Authors
/
Pablo Alvarez

Author

Pablo Alvarez

Bio: Pablo Alvarez is an academic researcher from Broad Institute. The author has contributed to research in topics: Genome & Molecular evolution. The author has an hindex of 8, co-authored 9 publications receiving 9832 citations. Previous affiliations of Pablo Alvarez include Akamai Technologies.

Topics: Genome, Molecular evolution, Genomics, Induced pluripotent stem cell, ChIP-exo ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome-wide maps of chromatin state in pluripotent and lineage-committed cells

[...]

Tarjei S. Mikkelsen¹, Manching Ku², Manching Ku¹, David B. Jaffe¹, Biju Issac¹, Biju Issac², Erez Lieberman Aiden³, Erez Lieberman Aiden¹, Georgia Giannoukos¹, Pablo Alvarez¹, William Brockman¹, Tae Kyung Kim⁴, Richard Koche², Richard Koche¹, Richard Koche³, William Lee¹, Eric M. Mendenhall², Eric M. Mendenhall¹, Aisling O'Donovan², Aviva Presser¹, Carsten Russ¹, Xiaohui Xie¹, Alexander Meissner³, Marius Wernig³, Rudolf Jaenisch³, Chad Nusbaum¹, Eric S. Lander¹, Eric S. Lander³, Bradley E. Bernstein², Bradley E. Bernstein¹ - Show less +26 more•Institutions (4)

Broad Institute¹, Harvard University², Massachusetts Institute of Technology³, Boston Children's Hospital⁴

02 Aug 2007-Nature

TL;DR: The application of single-molecule-based sequencing technology for high-throughput profiling of histone modifications in mammalian cells is reported and it is shown that chromatin state can be read in an allele-specific manner by using single nucleotide polymorphisms.

...read moreread less

Abstract: We report the application of single-molecule-based sequencing technology for high-throughput profiling of histone modifications in mammalian cells By obtaining over four billion bases of sequence from chromatin immunoprecipitated DNA, we generated genome-wide chromatin-state maps of mouse embryonic stem cells, neural progenitor cells and embryonic fibroblasts We find that lysine 4 and lysine 27 trimethylation effectively discriminates genes that are expressed, poised for expression, or stably repressed, and therefore reflect cell state and lineage potential Lysine 36 trimethylation marks primary coding and non-coding transcripts, facilitating gene annotation Trimethylation of lysine 9 and lysine 20 is detected at satellite, telomeric and active long-terminal repeats, and can spread into proximal unique sequences Lysine 4 and lysine 9 trimethylation marks imprinting control regions Finally, we show that chromatin state can be read in an allele-specific manner by using single nucleotide polymorphisms This study provides a framework for the application of comprehensive chromatin profiling towards characterization of diverse mammalian cell populations

...read moreread less

4,166 citations

Journal Article•DOI•

Genome sequence, comparative analysis and haplotype structure of the domestic dog

[...]

Kerstin Lindblad-Toh¹, Claire M. Wade¹, Claire M. Wade², Tarjei S. Mikkelsen¹ +238 more•Institutions (11)

08 Dec 2005-Nature

TL;DR: A high-quality draft genome sequence of the domestic dog is reported, together with a dense map of single nucleotide polymorphisms (SNPs) across breeds, to shed light on the structure and evolution of genomes and genes.

...read moreread less

Abstract: Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

...read moreread less

2,431 citations

Journal Article•DOI•

Evolution of genes and genomes on the Drosophila phylogeny.

[...]

Andrew G. Clark¹, Michael B. Eisen², Michael B. Eisen³, Douglas Smith +426 more•Institutions (70)

08 Nov 2007-Nature

TL;DR: These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution.

...read moreread less

Abstract: Comparative analysis of multiple genomes in a phylogenetic framework dramatically improves the precision and sensitivity of evolutionary inference, producing more robust results than single-genome analyses can provide. The genomes of 12 Drosophila species, ten of which are presented here for the first time (sechellia, simulans, yakuba, erecta, ananassae, persimilis, willistoni, mojavensis, virilis and grimshawi), illustrate how rates and patterns of sequence divergence across taxa can illuminate evolutionary processes on a genomic scale. These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution. Despite remarkable similarities among these Drosophila species, we identified many putatively non-neutral changes in protein-coding genes, non-coding RNA genes, and cis-regulatory regions. These may prove to underlie differences in the ecology and behaviour of these diverse species.

...read moreread less

2,057 citations

Journal Article•DOI•

Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences

[...]

Tarjei S. Mikkelsen¹, Tarjei S. Mikkelsen², Matthew Wakefield³, Bronwen Aken⁴ +235 more•Institutions (21)

10 May 2007-Nature

TL;DR: A high-quality draft of the genome sequence of the grey, short-tailed opossum is reported, indicating a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation.

...read moreread less

Abstract: We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.

...read moreread less

724 citations

Journal Article•DOI•

Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing

[...]

Roman K. Thomas¹, Elizabeth Nickerson, Jan Fredrik Simons², Pasi A. Jänne³, Torstein Tengs⁴, Torstein Tengs³, Yuki Yuza³, Levi A. Garraway³, Levi A. Garraway⁴, Thomas LaFramboise³, Thomas LaFramboise⁴, Jeffrey C. Lee³, Jeffrey C. Lee⁴, Kinjal Shah³, Kinjal Shah⁴, Keith O'Neill⁴, Hidefumi Sasaki⁵, Neal I. Lindeman⁶, Kwok-Kin Wong³, Ana M. Borras³, Edward J. Gutmann⁷, Konstantin H. Dragnev⁷, Ralph M. Debiasi³, Ralph M. Debiasi⁴, Tzu Hsiu Chen³, Tzu Hsiu Chen⁴, Karen A. Glatt³, Heidi Greulich⁴, Heidi Greulich³, Brian Desany, Christine Lubeski, William Brockman⁴, Pablo Alvarez⁴, Stephen K. Hutchison, John H. Leamon, Michael T. Ronan, Gregory S. Turenchalk, Michael Egholm, William R. Sellers³, William R. Sellers⁴, Jonathan M. Rothberg, Matthew Meyerson - Show less +38 more•Institutions (7)

Max Planck Society¹, Dana Corporation², Harvard University³, Broad Institute⁴, Nagoya City University⁵, Brigham and Women's Hospital⁶, Dartmouth–Hitchcock Medical Center⁷

25 Jun 2006-Nature Medicine

TL;DR: It is shown that microreactor-based pyrosequencing can detect rare cancer-associated sequence variations by independent and parallel sampling of multiple representatives of a given DNA fragment and can thereby facilitate accurate molecular diagnosis of heterogeneous cancer specimens and enable patient selection for targeted cancer therapies.

...read moreread less

Abstract: The sensitivity of conventional DNA sequencing in tumor biopsies is limited by stromal contamination and by genetic heterogeneity within the cancer. Here, we show that microreactor-based pyrosequencing can detect rare cancer-associated sequence variations by independent and parallel sampling of multiple representatives of a given DNA fragment. This technology can thereby facilitate accurate molecular diagnosis of heterogeneous cancer specimens and enable patient selection for targeted cancer therapies.

...read moreread less

376 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Model-based Analysis of ChIP-Seq (MACS)

[...]

Yong Zhang¹, Tao Liu¹, Clifford A. Meyer¹, Jérôme Eeckhoute², David S. Johnson, Bradley E. Bernstein³, Bradley E. Bernstein¹, Chad Nusbaum³, Richard M. Myers⁴, Myles Brown², Wei Li⁵, X. Shirley Liu¹ - Show less +8 more•Institutions (5)

Harvard University¹, Brigham and Women's Hospital², Broad Institute³, Stanford University⁴, Baylor College of Medicine⁵

17 Sep 2008-Genome Biology

TL;DR: This work presents Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer, and uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions.

...read moreread less

Abstract: We present Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer. MACS empirically models the shift size of ChIP-Seq tags, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, and is freely available.

...read moreread less

13,008 citations

Journal Article•DOI•

A framework for variation discovery and genotyping using next-generation DNA sequencing data

[...]

Mark A. DePristo¹, Eric Banks¹, Ryan Poplin¹, Kiran V. Garimella¹, Jared Maguire¹, Christopher Hartl¹, Anthony A. Philippakis¹, Anthony A. Philippakis², Anthony A. Philippakis³, Guillermo del Angel¹, Manuel A. Rivas², Manuel A. Rivas¹, Matt Hanna¹, Aaron McKenna¹, Timothy Fennell¹, Andrew Kernytsky¹, Andrey Sivachenko¹, Kristian Cibulskis¹, Stacey Gabriel¹, David Altshuler¹, David Altshuler², Mark J. Daly¹, Mark J. Daly² - Show less +19 more•Institutions (3)

Broad Institute¹, Harvard University², Brigham and Women's Hospital³

01 May 2011-Nature Genetics

TL;DR: A unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs is presented.

...read moreread less

Abstract: Recent advances in sequencing technology make it possible to comprehensively catalogue genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (1) initial read mapping; (2) local realignment around indels; (3) base quality score recalibration; (4) SNP discovery and genotyping to find all potential variants; and (5) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We discuss the application of these tools, instantiated in the Genome Analysis Toolkit (GATK), to deep whole-genome, whole-exome capture, and multi-sample low-pass (~4×) 1000 Genomes Project datasets.

...read moreread less

10,056 citations

Journal Article•DOI•

Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell Identities

[...]

Sven Heinz¹, Christopher Benner¹, Nathanael J. Spann¹, Eric Bertolino², Yin C. Lin¹, Peter Laslo³, Jason X. Cheng², Cornelis Murre¹, Harinder Singh⁴, Harinder Singh², Christopher K. Glass¹ - Show less +7 more•Institutions (4)

University of California, San Diego¹, University of Chicago², University of Leeds³, Genentech⁴

28 May 2010-Molecular Cell

TL;DR: It is demonstrated in macrophages and B cells that collaborative interactions of the common factor PU.1 with small sets of macrophage- or B cell lineage-determining transcription factors establish cell-specific binding sites that are associated with the majority of promoter-distal H3K4me1-marked genomic regions.

...read moreread less

9,620 citations

Journal Article•DOI•

Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

[...]

Erez Lieberman Aiden¹, Nynke L. van Berkum², Louise Williams¹, Maxim Imakaev¹, Tobias Ragoczy³, Tobias Ragoczy⁴, Agnes Telling³, Agnes Telling⁴, Ido Amit¹, Bryan R. Lajoie², Peter J. Sabo³, Michael O. Dorschner³, Richard Sandstrom³, Bradley E. Bernstein⁵, Bradley E. Bernstein¹, Michaël Bender³, Mark Groudine⁴, Mark Groudine³, Andreas Gnirke¹, John A. Stamatoyannopoulos³, Leonid A. Mirny¹, Eric S. Lander⁵, Eric S. Lander¹, Job Dekker² - Show less +20 more•Institutions (5)

Massachusetts Institute of Technology¹, University of Massachusetts Medical School², University of Washington³, Fred Hutchinson Cancer Research Center⁴, Harvard University⁵

09 Oct 2009-Science

TL;DR: Hi-C is described, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing and demonstrates the power of Hi-C to map the dynamic conformations of entire genomes.

...read moreread less

Abstract: We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1 megabase. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free, polymer conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.

...read moreread less

7,180 citations

Journal Article•DOI•

Topological domains in mammalian genomes identified by analysis of chromatin interactions

[...]

Jesse R. Dixon¹, Siddarth Selvaraj², Siddarth Selvaraj¹, Feng Yue¹, Audrey Kim¹, Yan-Yan Li¹, Yin-Zhong Shen¹, Ming Hu³, Jun Liu³, Bing Ren², Bing Ren¹ - Show less +7 more•Institutions (3)

Ludwig Institute for Cancer Research¹, University of California, San Diego², Harvard University³

17 May 2012-Nature

TL;DR: It is found that the boundaries of topological domains are enriched for the insulator binding protein CTCF, housekeeping genes, transfer RNAs and short interspersed element (SINE) retrotransposons, indicating that these factors may have a role in establishing the topological domain structure of the genome.

...read moreread less

Abstract: The spatial organization of the genome is intimately linked to its biological function, yet our understanding of higher order genomic structure is coarse, fragmented and incomplete. In the nucleus of eukaryotic cells, interphase chromosomes occupy distinct chromosome territories, and numerous models have been proposed for how chromosomes fold within chromosome territories. These models, however, provide only few mechanistic details about the relationship between higher order chromatin structure and genome function. Recent advances in genomic technologies have led to rapid advances in the study of three-dimensional genome organization. In particular, Hi-C has been introduced as a method for identifying higher order chromatin interactions genome wide. Here we investigate the three-dimensional organization of the human and mouse genomes in embryonic stem cells and terminally differentiated cell types at unprecedented resolution. We identify large, megabase-sized local chromatin interaction domains, which we term 'topological domains', as a pervasive structural feature of the genome organization. These domains correlate with regions of the genome that constrain the spread of heterochromatin. The domains are stable across different cell types and highly conserved across species, indicating that topological domains are an inherent property of mammalian genomes. Finally, we find that the boundaries of topological domains are enriched for the insulator binding protein CTCF, housekeeping genes, transfer RNAs and short interspersed element (SINE) retrotransposons, indicating that these factors may have a role in establishing the topological domain structure of the genome.

...read moreread less

5,774 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse