Home
/
Authors
/
Chad Nusbaum

Author

Chad Nusbaum

Bio: Chad Nusbaum is an academic researcher from Broad Institute. The author has contributed to research in topics: Genome & Genomics. The author has an hindex of 52, co-authored 79 publications receiving 49039 citations.

Topics: Genome, Genomics, Gene, Population, Sequence assembly ...read more

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Model-based Analysis of ChIP-Seq (MACS)

[...]

Yong Zhang¹, Tao Liu¹, Clifford A. Meyer¹, Jérôme Eeckhoute², David S. Johnson, Bradley E. Bernstein³, Bradley E. Bernstein¹, Chad Nusbaum³, Richard M. Myers⁴, Myles Brown², Wei Li⁵, X. Shirley Liu¹ - Show less +8 more•Institutions (5)

Harvard University¹, Brigham and Women's Hospital², Broad Institute³, Stanford University⁴, Baylor College of Medicine⁵

17 Sep 2008-Genome Biology

TL;DR: This work presents Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer, and uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions.

...read moreread less

Abstract: We present Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer. MACS empirically models the shift size of ChIP-Seq tags, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, and is freely available.

...read moreread less

13,008 citations

Journal Article•DOI•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower¹, Curtis Huttenhower², Dirk Gevers², Rob Knight³ +250 more•Institutions (42)

14 Jun 2012-Nature

TL;DR: The Human Microbiome Project Consortium reported the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome as discussed by the authors.

...read moreread less

Abstract: The Human Microbiome Project Consortium reports the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

8,410 citations

Journal Article•DOI•

Genome-wide maps of chromatin state in pluripotent and lineage-committed cells

[...]

Tarjei S. Mikkelsen¹, Manching Ku¹, Manching Ku², David B. Jaffe¹, Biju Issac¹, Biju Issac², Erez Lieberman Aiden³, Erez Lieberman Aiden¹, Georgia Giannoukos¹, Pablo Alvarez¹, William Brockman¹, Tae Kyung Kim⁴, Richard Koche³, Richard Koche¹, Richard Koche², William Lee¹, Eric M. Mendenhall², Eric M. Mendenhall¹, Aisling O'Donovan², Aviva Presser¹, Carsten Russ¹, Xiaohui Xie¹, Alexander Meissner³, Marius Wernig³, Rudolf Jaenisch³, Chad Nusbaum¹, Eric S. Lander¹, Eric S. Lander³, Bradley E. Bernstein¹, Bradley E. Bernstein² - Show less +26 more•Institutions (4)

Broad Institute¹, Harvard University², Massachusetts Institute of Technology³, Boston Children's Hospital⁴

02 Aug 2007-Nature

TL;DR: The application of single-molecule-based sequencing technology for high-throughput profiling of histone modifications in mammalian cells is reported and it is shown that chromatin state can be read in an allele-specific manner by using single nucleotide polymorphisms.

...read moreread less

Abstract: We report the application of single-molecule-based sequencing technology for high-throughput profiling of histone modifications in mammalian cells By obtaining over four billion bases of sequence from chromatin immunoprecipitated DNA, we generated genome-wide chromatin-state maps of mouse embryonic stem cells, neural progenitor cells and embryonic fibroblasts We find that lysine 4 and lysine 27 trimethylation effectively discriminates genes that are expressed, poised for expression, or stably repressed, and therefore reflect cell state and lineage potential Lysine 36 trimethylation marks primary coding and non-coding transcripts, facilitating gene annotation Trimethylation of lysine 9 and lysine 20 is detected at satellite, telomeric and active long-terminal repeats, and can spread into proximal unique sequences Lysine 4 and lysine 9 trimethylation marks imprinting control regions Finally, we show that chromatin state can be read in an allele-specific manner by using single nucleotide polymorphisms This study provides a framework for the application of comprehensive chromatin profiling towards characterization of diverse mammalian cell populations

...read moreread less

4,166 citations

Journal Article•DOI•

A Draft Sequence of the Neandertal Genome

[...]

Richard E. Green¹, Johannes Krause¹, Adrian W. Briggs¹, Tomislav Maricic¹, Udo Stenzel¹, Martin Kircher¹, Nick Patterson², Heng Li², Weiwei Zhai³, Markus Hsi-Yang Fritz⁴, Nancy F. Hansen⁵, Eric Durand³, Anna-Sapfo Malaspinas³, Jeffrey D. Jensen⁶, Tomas Marques-Bonet⁷, Tomas Marques-Bonet⁸, Can Alkan⁸, Kay Prüfer¹, Matthias Meyer¹, Hernán A. Burbano¹, Jeffrey M. Good⁹, Jeffrey M. Good¹, Rigo Schultz¹, Ayinuer Aximu-Petri¹, Anne Butthof¹, Barbara Höber¹, Barbara Höffner¹, Madien Siegemund¹, Antje Weihmann¹, Chad Nusbaum², Eric S. Lander², Carsten Russ², Nathaniel Novod², Jason P. Affourtit, Michael Egholm, Christine Verna¹, Pavao Rudan¹⁰, Dejana Brajković¹⁰, Željko Kućan¹⁰, Ivan Gušić¹⁰, Vladimir B. Doronichev, Liubov V. Golovanova, Carles Lalueza-Fox⁷, Marco de la Rasilla¹¹, Javier Fortea¹¹, Antonio Rosas⁷, Ralf Schmitz¹², Philip L. F. Johnson¹³, Evan E. Eichler⁸, Daniel Falush¹⁴, Ewan Birney⁴, James C. Mullikin⁵, Montgomery Slatkin³, Rasmus Nielsen³, Janet Kelso¹, Michael Lachmann¹, David Reich¹⁵, David Reich², Svante Pääbo¹ - Show less +55 more•Institutions (15)

Max Planck Society¹, Broad Institute², University of California, Berkeley³, European Bioinformatics Institute⁴, National Institutes of Health⁵, University of Massachusetts Medical School⁶, Spanish National Research Council⁷, University of Washington⁸, University of Montana⁹, Croatian Academy of Sciences and Arts¹⁰, University of Oviedo¹¹, University of Bonn¹², Emory University¹³, University College Cork¹⁴, Harvard University¹⁵

07 May 2010-Science

TL;DR: The genomic data suggest that Neandertals mixed with modern human ancestors some 120,000 years ago, leaving traces of Ne andertal DNA in contemporary humans, suggesting that gene flow from Neand Bertals into the ancestors of non-Africans occurred before the divergence of Eurasian groups from each other.

...read moreread less

Abstract: Neandertals, the closest evolutionary relatives of present-day humans, lived in large parts of Europe and western Asia before disappearing 30,000 years ago. We present a draft sequence of the Neandertal genome composed of more than 4 billion nucleotides from three individuals. Comparisons of the Neandertal genome to the genomes of five present-day humans from different parts of the world identify a number of genomic regions that may have been affected by positive selection in ancestral modern humans, including genes involved in metabolism and in cognitive and skeletal development. We show that Neandertals shared more genetic variants with present-day humans in Eurasia than with present-day humans in sub-Saharan Africa, suggesting that gene flow from Neandertals into the ancestors of non-Africans occurred before the divergence of Eurasian groups from each other.

...read moreread less

3,575 citations

Journal Article•DOI•

Genome-scale DNA methylation maps of pluripotent and differentiated cells

[...]

Alexander Meissner¹, Tarjei S. Mikkelsen¹, Tarjei S. Mikkelsen², Hongcang Gu², Marius Wernig¹, Jacob H. Hanna¹, Andrey Sivachenko², Xiaolan Zhang², Bradley E. Bernstein³, Bradley E. Bernstein², Chad Nusbaum², David B. Jaffe², Andreas Gnirke², Rudolf Jaenisch¹, Eric S. Lander - Show less +11 more•Institutions (3)

Massachusetts Institute of Technology¹, Broad Institute², Harvard University³

07 Aug 2008-Nature

TL;DR: Low-throughput reduced representation bisulphite sequencing is established as a powerful technology for epigenetic profiling of cell populations relevant to developmental biology, cancer and regenerative medicine.

...read moreread less

Abstract: DNA methylation is essential for normal development and has been implicated in many pathologies including cancer. Our knowledge about the genome-wide distribution of DNA methylation, how it changes during cellular differentiation and how it relates to histone methylation and other chromatin modifications in mammals remains limited. Here we report the generation and analysis of genome-scale DNA methylation profiles at nucleotide resolution in mammalian cells. Using high-throughput reduced representation bisulphite sequencing and single-molecule-based sequencing, we generated DNA methylation maps covering most CpG islands, and a representative sampling of conserved non-coding elements, transposons and other genomic features, for mouse embryonic stem cells, embryonic-stem-cell-derived and primary neural cells, and eight other primary tissues. Several key findings emerge from the data. First, DNA methylation patterns are better correlated with histone methylation patterns than with the underlying genome sequence context. Second, methylation of CpGs are dynamic epigenetic marks that undergo extensive changes during cellular differentiation, particularly in regulatory regions outside of core promoters. Third, analysis of embryonic-stem-cell-derived and primary cells reveals that 'weak' CpG islands associated with a specific set of developmentally regulated genes undergo aberrant hypermethylation during extended proliferation in vitro, in a pattern reminiscent of that reported in some primary tumours. More generally, the results establish reduced representation bisulphite sequencing as a powerful technology for epigenetic profiling of cell populations relevant to developmental biology, cancer and regenerative medicine.

...read moreread less

2,482 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

MicroRNAs: Target Recognition and Regulatory Functions

[...]

David P. Bartel¹•Institutions (1)

Massachusetts Institute of Technology¹

23 Jan 2009-Cell

TL;DR: The current understanding of miRNA target recognition in animals is outlined and the widespread impact of miRNAs on both the expression and evolution of protein-coding genes is discussed.

...read moreread less

18,036 citations

Journal Article•DOI•

Full-length transcriptome assembly from RNA-Seq data without a reference genome.

[...]

Manfred Grabherr¹, Brian J. Haas¹, Moran Yassour², Moran Yassour¹, Joshua Z. Levin¹, Dawn Thompson¹, Ido Amit¹, Xian Adiconis¹, Lin Fan¹, Raktima Raychowdhury¹, Qiandong Zeng¹, Zehua Chen¹, Evan Mauceli¹, Nir Hacohen¹, Andreas Gnirke¹, Nicholas Rhind³, Federica Di Palma¹, Bruce W. Birren¹, Chad Nusbaum¹, Kerstin Lindblad-Toh⁴, Kerstin Lindblad-Toh¹, Nir Friedman², Aviv Regev¹ - Show less +19 more•Institutions (4)

Massachusetts Institute of Technology¹, Hebrew University of Jerusalem², University of Massachusetts Medical School³, Science for Life Laboratory⁴

01 Jul 2011-Nature Biotechnology

TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.

...read moreread less

Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

...read moreread less

15,665 citations

Journal Article•DOI•

RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome

[...]

Bo Li¹, Colin N. Dewey¹•Institutions (1)

University of Wisconsin-Madison¹

04 Aug 2011-BMC Bioinformatics

TL;DR: It is shown that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads, and estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired- end reads, depending on the number of possible splice forms for each gene.

...read moreread less

Abstract: RNA-Seq is revolutionizing the way transcript abundances are measured. A key challenge in transcript quantification from RNA-Seq data is the handling of reads that map to multiple genes or isoforms. This issue is particularly important for quantification with de novo transcriptome assemblies in the absence of sequenced genomes, as it is difficult to determine which transcripts are isoforms of the same gene. A second significant issue is the design of RNA-Seq experiments, in terms of the number of reads, read length, and whether reads come from one or both ends of cDNA fragments. We present RSEM, an user-friendly software package for quantifying gene and isoform abundances from single-end or paired-end RNA-Seq data. RSEM outputs abundance estimates, 95% credibility intervals, and visualization files and can also simulate RNA-Seq data. In contrast to other existing tools, the software does not require a reference genome. Thus, in combination with a de novo transcriptome assembler, RSEM enables accurate transcript quantification for species without sequenced genomes. On simulated and real data sets, RSEM has superior or comparable performance to quantification methods that rely on a reference genome. Taking advantage of RSEM's ability to effectively use ambiguously-mapping reads, we show that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads. On the other hand, estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired-end reads, depending on the number of possible splice forms for each gene. RSEM is an accurate and user-friendly software tool for quantifying transcript abundances from RNA-Seq data. As it does not rely on the existence of a reference genome, it is particularly useful for quantification with de novo transcriptome assemblies. In addition, RSEM has enabled valuable guidance for cost-efficient design of quantification experiments with RNA-Seq, which is currently relatively expensive.

...read moreread less

14,524 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse