Home
/
Authors
/
Bruce W. Birren

Author

Bruce W. Birren

Other affiliations: Massachusetts Institute of Technology, California Institute of Technology, Bio-Rad Laboratories

Bio: Bruce W. Birren is an academic researcher from Broad Institute. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 103, co-authored 205 publications receiving 113491 citations. Previous affiliations of Bruce W. Birren include Massachusetts Institute of Technology & California Institute of Technology.

Topics: Genome, Gene, Genomics, Population, Human genome ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1992
1991
1987
1986
1983

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Evaluation of 16s rDNA-based community profiling for human microbiome research

[...]

Doyle V. Ward¹, Dirk Gevers¹, Georgia Giannoukos¹, Ashlee M. Earl¹, Barbara A. Methé², Erica Sodergren³, Michael Feldgarden¹, Dawn Ciulla¹, Diana Tabbaa¹, Cesar Arze⁴, Elizabeth L. Appelbaum³, Leigh Aird¹, Scott Anderson¹, Tulin Ayvaz⁵, Edward A. Belter³, Monika Bihan², Toby Bloom¹, Jonathan Crabtree⁴, Laura Courtney³, Lynn K. Carmichael³, David J. Dooling³, Rachel L. Erlich¹, Candace N. Farmer³, Lucinda Fulton³, Robert S. Fulton³, Hongyu Gao³, John Gill², Brian J. Haas¹, Lisa Hemphill⁵, Otis Hall³, Susanna Hamilton¹, Theresa A. Hepburn¹, Niall J. Lennon¹, Vandita Joshi⁵, Cristyn Kells¹, Christie Kovar⁵, Divya Kalra⁵, Kelvin Li², Lora Lewis⁵, Shawn Leonard³, Donna M. Muzny⁵, Elaine R. Mardis³, Kathie A. Mihindukulasuriya³, Vincent Magrini³, Michelle O'Laughlin³, Craig Pohl³, Xiang Qin⁵, Keenan Ross¹, Matthew C. Ross⁵, Yu Hui A. Rogers², Navjeet Singh⁶, Yue Shang⁵, Katarzyna Wilczek-Boney⁵, Jennifer R. Wortman⁴, Kim C. Worley⁵, Bonnie P. Youmans, Shibu Yooseph², Yanjiao Zhou³, Patrick D. Schloss⁷, Richard K. Wilson³, Richard A. Gibbs⁵, Karen E. Nelson², George M. Weinstock³, Todd Z. DeSantis⁶, Joseph F. Petrosino⁵, Sarah K. Highlander⁵, Bruce W. Birren¹ - Show less +63 more•Institutions (7)

Broad Institute¹, J. Craig Venter Institute², Washington University in St. Louis³, University of Maryland, Baltimore⁴, Baylor College of Medicine⁵, Lawrence Berkeley National Laboratory⁶, University of Michigan⁷

13 Jun 2012-PLOS ONE

TL;DR: The data production protocols used for this work are those used by the participating centers to produce 16S rDNA sequence for the Human Microbiome Project, and these results can be informative for interpreting the large body of clinical 16s rDNA data produced for this project.

...read moreread less

Abstract: The Human Microbiome Project will establish a reference data set for analysis of the microbiome of healthy adults by surveying multiple body sites from 300 people and generating data from over 12,000 samples. To characterize these samples, the participating sequencing centers evaluated and adopted 16S rDNA community profiling protocols for ABI 3730 and 454 FLX Titanium sequencing. In the course of establishing protocols, we examined the performance and error characteristics of each technology, and the relationship of sequence error to the utility of 16S rDNA regions for classification- and OTU-based analysis of community structure. The data production protocols used for this work are those used by the participating centers to produce 16S rDNA sequence for the Human Microbiome Project. Thus, these results can be informative for interpreting the large body of clinical 16S rDNA data produced for this project.

...read moreread less

285 citations

Journal Article•DOI•

Whole genome amplification and de novo assembly of single bacterial cells.

[...]

Sébastien Rodrigue¹, Rex R. Malmstrom¹, Aaron M. Berlin², Bruce W. Birren², Matthew R. Henn², Sallie W. Chisholm¹ - Show less +2 more•Institutions (2)

Massachusetts Institute of Technology¹, Broad Institute²

02 Sep 2009-PLOS ONE

TL;DR: A pipeline that enables single-cell WGA on hundreds of cells at a time while virtually eliminating non-target DNA from the reactions is described and a post-amplification normalization procedure that mitigates extreme variations in sequencing coverage associated with multiple displacement amplification is developed.

...read moreread less

Abstract: Background: Single-cell genome sequencing has the potential to allow the in-depth exploration of the vast genetic diversity found in uncultured microbes. We used the marine cyanobacterium Prochlorococcus as a model system for addressing important challenges facing high-throughput whole genome amplification (WGA) and complete genome sequencing of individual cells. Methodology/Principal Findings: We describe a pipeline that enables single-cell WGA on hundreds of cells at a time while virtually eliminating non-target DNA from the reactions. We further developed a post-amplification normalization procedure that mitigates extreme variations in sequencing coverage associated with multiple displacement amplification (MDA), and demonstrated that the procedure increased sequencing efficiency and facilitated genome assembly. We report genome recovery as high as 99.6% with reference-guided assembly, and 95% with de novo assembly starting from a single cell. We also analyzed the impact of chimera formation during MDA on de novo assembly, and discuss strategies to minimize the presence of incorrectly joined regions in contigs. Conclusions/Significance: The methods describe in this paper will be useful for sequencing genomes of individual cells from a variety of samples.

...read moreread less

279 citations

Journal Article•DOI•

Genomic epidemiology of the Escherichia coli O104:H4 outbreaks in Europe, 2011

[...]

Yonatan H. Grad¹, Marc Lipsitch², Marc Lipsitch³, Michael Feldgarden², Harindra Arachchi², Gustavo C. Cerqueira², Michael Fitzgerald², Paul A. Godfrey², Brian J. Haas², Cheryl I. Murphy², Carsten Russ², Sean M. Sykes², Bruce J. Walker², Jennifer R. Wortman², Sarah Young², Qiandong Zeng², Amr Abouelleil², James Bochicchio², Sara Chauvin², Timothy DeSmet², Sharvari Gujja², Caryn McCowan², Anna Montmayeur², Scott Steelman², Jakob Frimodt-Møller⁴, Andreas Petersen⁵, Carsten Struve, Karen A. Krogfelt, Edouard Bingen⁶, François-Xavier Weill, Eric S. Lander³, Chad Nusbaum, Bruce W. Birren, Deborah T. Hung, William P. Hanage² - Show less +31 more•Institutions (6)

Brigham and Women's Hospital¹, Broad Institute², Harvard University³, Statens Serum Institut⁴, Hvidovre Hospital⁵, Sorbonne⁶

21 Feb 2012-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The striking difference in diversity between the German and French outbreak samples is consistent with several hypotheses, including a bottleneck that purged diversity in the German isolates, variation in mutation rates in the two E. coli outbreak populations, or uneven distribution of Diversity in the seed populations that led to each outbreak.

...read moreread less

Abstract: The degree to which molecular epidemiology reveals information about the sources and transmission patterns of an outbreak depends on the resolution of the technology used and the samples studied. Isolates of Escherichia coli O104:H4 from the outbreak centered in Germany in May–July 2011, and the much smaller outbreak in southwest France in June 2011, were indistinguishable by standard tests. We report a molecular epidemiological analysis using multiplatform whole-genome sequencing and analysis of multiple isolates from the German and French outbreaks. Isolates from the German outbreak showed remarkably little diversity, with only two single nucleotide polymorphisms (SNPs) found in isolates from four individuals. Surprisingly, we found much greater diversity (19 SNPs) in isolates from seven individuals infected in the French outbreak. The German isolates form a clade within the more diverse French outbreak strains. Moreover, five isolates derived from a single infected individual from the French outbreak had extremely limited diversity. The striking difference in diversity between the German and French outbreak samples is consistent with several hypotheses, including a bottleneck that purged diversity in the German isolates, variation in mutation rates in the two E. coli outbreak populations, or uneven distribution of diversity in the seed populations that led to each outbreak.

...read moreread less

279 citations

Journal Article•DOI•

Stable propagation of cosmid sized human DNA inserts in an F factor based vector.

[...]

Ung Jin Kim¹, Hiroaki Shizuya, Pieter J. de Jong², Bruce W. Birren, Melvin I. Simon - Show less +1 more•Institutions (2)

California Institute of Technology¹, Lawrence Livermore National Laboratory²

11 Mar 1992-Nucleic Acids Research

TL;DR: It is found that the clones based on Fosmid vector undergo detectable changes at a greatly reduced frequency and sequences that undergo drastic rearrangements and deletions during propagation in a conventional vector were stably propagated when recloned as Fosmids.

...read moreread less

Abstract: Instability of complex mammalian genomic DNA inserts is commonplace in cosmid libraries constructed in conventional multicopy vectors. To develop a means to construct stable libraries, we have developed a low copy number cosmid vector based on the E.coli F factor replicon (Fosmid). We have tested relative stability of human DNA inserts in Fosmlds and in two conventional multicopy vectors (Lawrist 16 and Supercos) by comparing the frequency of changes In restriction patterns of the inserts after propagating randomly picked human genomic clones based on these vectors. We found that the clones based on Fosmid vector undergo detectable changes at a greatly reduced frequency. We also observed that sequences that undergo drastic rearrangements and deletions during propagation In a conventional vector were stably propagated when recloned as Fosmids. The results indicate that Fosmid system may be useful for constructing stable libraries from complex genomes.

...read moreread less

278 citations

Journal Article•DOI•

Dothideomycete-Plant Interactions Illuminated by Genome Sequencing and EST Analysis of the Wheat Pathogen Stagonospora nodorum

[...]

James K. Hane¹, Rohan G. T. Lowe¹, Peter S. Solomon¹, Kar-Chun Tan¹, Conrad L. Schoch², Joseph W. Spatafora², Pedro W. Crous³, Chinappa Kodira⁴, Bruce W. Birren⁴, James E. Galagan⁴, Stefano F.F. Torriani, Bruce A. McDonald, Richard P. Oliver¹ - Show less +9 more•Institutions (4)

Murdoch University¹, Oregon State University², Centraalbureau voor Schimmelcultures³, Broad Institute⁴

01 Nov 2007-The Plant Cell

TL;DR: Statistical analysis shows that transcripts encoding proteins involved in protein synthesis and in the production of extracellular proteases, cellulases, and xylanases predominate in the infection library, suggesting that the fungus is dependant on the degradation of wheat macromolecular constituents to provide the carbon skeletons and energy for the synthesis of proteins and other components destined for the developing pycnidiospores.

...read moreread less

Abstract: Stagonospora nodorum is a major necrotrophic fungal pathogen of wheat (Triticum aestivum) and a member of the Dothideomycetes, a large fungal taxon that includes many important plant pathogens affecting all major crop plant families. Here, we report the acquisition and initial analysis of a draft genome sequence for this fungus. The assembly comprises 37,164,227 bp of nuclear DNA contained in 107 scaffolds. The circular mitochondrial genome comprises 49,761 bp encoding 46 genes, including four that are intron encoded. The nuclear genome assembly contains 26 classes of repetitive DNA, comprising 4.5% of the genome. Some of the repeats show evidence of repeat-induced point mutations consistent with a frequent sexual cycle. ESTs and gene prediction models support a minimum of 10,762 nuclear genes. Extensive orthology was found between the polyketide synthase family in S. nodorum and Cochliobolus heterostrophus, suggesting an ancient origin and conserved functions for these genes. A striking feature of the gene catalog was the large number of genes predicted to encode secreted proteins; the majority has no meaningful similarity to any other known genes. It is likely that genes for host-specific toxins, in addition to ToxA, will be found among this group. ESTs obtained from axenic mycelium grown on oleate (chosen to mimic early infection) and late-stage lesions sporulating on wheat leaves were obtained. Statistical analysis shows that transcripts encoding proteins involved in protein synthesis and in the production of extracellular proteases, cellulases, and xylanases predominate in the infection library. This suggests that the fungus is dependant on the degradation of wheat macromolecular constituents to provide the carbon skeletons and energy for the synthesis of proteins and other components destined for the developing pycnidiospores.

...read moreread less

275 citations

1
2
3
4
5
6
7
…
8
9
10
11
12
13
14
…
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

[...]

Anton Bankevich¹, Sergey Nurk, Dmitry Antipov, Alexey Gurevich, Mikhail Dvorkin, Alexander S. Kulikov, Valery M. Lesin, Sergey I. Nikolenko, Son Pham, Andrey D. Prjibelski, Alexey V. Pyshkin, Alexander Sirotkin, Nikolay Vyahhi, Glenn Tesler, Max A. Alekseyev, Pavel A. Pevzner - Show less +12 more•Institutions (1)

Saint Petersburg Academic University¹

07 May 2012-Journal of Computational Biology

TL;DR: SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies.

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V−SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online (http://bioinf.spbau.ru/spades). It is distributed as open source software.

...read moreread less

16,859 citations

Journal Article•DOI•

Full-length transcriptome assembly from RNA-Seq data without a reference genome.

[...]

Manfred Grabherr¹, Brian J. Haas¹, Moran Yassour², Moran Yassour¹, Joshua Z. Levin¹, Dawn Thompson¹, Ido Amit¹, Xian Adiconis¹, Lin Fan¹, Raktima Raychowdhury¹, Qiandong Zeng¹, Zehua Chen¹, Evan Mauceli¹, Nir Hacohen¹, Andreas Gnirke¹, Nicholas Rhind³, Federica Di Palma¹, Bruce W. Birren¹, Chad Nusbaum¹, Kerstin Lindblad-Toh¹, Kerstin Lindblad-Toh⁴, Nir Friedman², Aviv Regev¹ - Show less +19 more•Institutions (4)

Massachusetts Institute of Technology¹, Hebrew University of Jerusalem², University of Massachusetts Medical School³, Science for Life Laboratory⁴

01 Jul 2011-Nature Biotechnology

TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.

...read moreread less

Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

...read moreread less

15,665 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse