Home
/
Authors
/
Cheryl Heiner

Author

Cheryl Heiner

Bio: Cheryl Heiner is an academic researcher from Pacific Biosciences. The author has contributed to research in topics: Genome & Nucleic acid. The author has an hindex of 17, co-authored 41 publications receiving 7375 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data

[...]

Chen-Shan Chin¹, David Alexander¹, Patrick Marks¹, Aaron Klammer¹, James P Drake¹, Cheryl Heiner¹, Alicia Clum², Alex Copeland², John Huddleston³, Evan E. Eichler³, Stephen Turner¹, Jonas Korlach¹ - Show less +8 more•Institutions (3)

Pacific Biosciences¹, Joint Genome Institute², University of Washington³

01 Jun 2013-Nature Methods

TL;DR: This work presents a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing.

...read moreread less

Abstract: We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

...read moreread less

3,647 citations

Journal Article•DOI•

Real-Time DNA Sequencing from Single Polymerase Molecules

[...]

John Eid¹, Adrian Fehr¹, Jeremy Gray¹, Khai Luong¹, John Lyle¹, Geoff Otto¹, Paul Peluso¹, David R. Rank¹, Primo Baybayan¹, Brad Bettman¹, Arkadiusz Bibillo¹, Keith Bjornson¹, Bidhan Chaudhuri¹, Fred Christians¹, Ronald L. Cicero¹, Sonya Clark¹, Ravindra V. Dalal¹, Alex DeWinter¹, John Dixon¹, Mathieu Foquet¹, Alfred Gaertner¹, Paul Hardenbol¹, Cheryl Heiner¹, Kevin Hester¹, David P. Holden¹, Gregory J. Kearns¹, Xiangxu Kong¹, Ronald Kuse¹, Yves Lacroix¹, Steven Lin¹, Paul Lundquist¹, Congcong Ma¹, Patrick Marks¹, Mark Maxham¹, Devon Murphy¹, Insil Park¹, Thang Pham¹, Michael Phillips¹, Joy Roy¹, Robert Sebra¹, Gene Shen¹, Jon M. Sorenson¹, Austin B. Tomaney¹, Kevin Travers¹, Mark Trulson¹, John Vieceli¹, Jeffrey Wegener¹, Dawn Wu¹, Alicia Yang¹, Denis Zaccarin¹, Peter Zhao¹, Frank Zhong¹, Jonas Korlach¹, Stephen Turner¹ - Show less +50 more•Institutions (1)

Pacific Biosciences¹

02 Jan 2009-Science

TL;DR: Single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs) are presented.

...read moreread less

Abstract: We present single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs). We detected the temporal order of their enzymatic incorporation into a growing DNA strand with zero-mode waveguide nanostructure arrays, which provide optical observation volume confinement and enable parallel, simultaneous detection of thousands of single-molecule sequencing reactions. Conjugation of fluorophores to the terminal phosphate moiety of the dNTPs allows continuous observation of DNA synthesis over thousands of bases without steric hindrance. The data report directly on polymerase dynamics, revealing distinct polymerization states and pause sites corresponding to DNA secondary structure. Sequence data were aligned with the known reference sequence to assay biophysical parameters of polymerization for each template position. Consensus sequences were generated from the single-molecule reads at 15-fold coverage, showing a median accuracy of 99.3%, with no systematic error beyond fluorophore-dependent error rates.

...read moreread less

3,346 citations

Journal Article•DOI•

High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution.

[...]

Benjamin J. Callahan¹, Joan Wong², Cheryl Heiner², Steve Oh², Casey M. Theriot¹, Ajay S. Gulati³, Sarah K. McGill³, Michael K. Dougherty³ - Show less +4 more•Institutions (3)

North Carolina State University¹, Pacific Biosciences², University of North Carolina at Chapel Hill³

10 Oct 2019-Nucleic Acids Research

TL;DR: A high-throughput amplicon sequencing methodology based on PacBio circular consensus sequencing and the DADA2 sample inference method that measures the full-length 16S rRNA gene with single-nucleotide resolution and a near-zero error rate is presented.

...read moreread less

Abstract: Targeted PCR amplification and high-throughput sequencing (amplicon sequencing) of 16S rRNA gene fragments is widely used to profile microbial communities. New long-read sequencing technologies can sequence the entire 16S rRNA gene, but higher error rates have limited their attractiveness when accuracy is important. Here we present a high-throughput amplicon sequencing methodology based on PacBio circular consensus sequencing and the DADA2 sample inference method that measures the full-length 16S rRNA gene with single-nucleotide resolution and a near-zero error rate. In two artificial communities of known composition, our method recovered the full complement of full-length 16S sequence variants from expected community members without residual errors. The measured abundances of intra-genomic sequence variants were in the integral ratios expected from the genuine allelic variants within a genome. The full-length 16S gene sequences recovered by our approach allowed Escherichia coli strains to be correctly classified to the O157:H7 and K12 sub-species clades. In human fecal samples, our method showed strong technical replication and was able to recover the full complement of 16S rRNA alleles in several E. coli strains. There are likely many applications beyond microbial profiling for which high-throughput amplicon sequencing of complete genes with single-nucleotide resolution will be of use.

...read moreread less

263 citations

Patent•

Compositions and methods for nucleic acid sequencing

[...]

Kevin Travers¹, Geoff Otto¹, Stephen Turner¹, Cheryl Heiner¹, Congcong Ma¹ - Show less +1 more•Institutions (1)

Pacific Biosciences¹

27 Mar 2009

TL;DR: In this paper, the use and preparation of template constructions for nucleic acid sequencing is described, as well as methods for their use in the development and preparation. But they do not discuss the use of these constructions in their application.

...read moreread less

Abstract: Compositions and methods for nucleic acid sequencing include template constructs that comprise double stranded portions in a partially or completely contiguous constructs, to provide for redundant sequence determination through one or both of sequencing sense and antisense strands, and iteratively sequencing the entire construct multiple times. Additional sequence components are also optionally included within such template constructs. Methods are also provided for the use and preparation of these constructs as well as sequencing compositions for their application.

...read moreread less

210 citations

Journal Article•DOI•

The Diversity, Structure, and Function of Heritable Adaptive Immunity Sequences in the Aedes aegypti Genome

[...]

Zachary J. Whitfield¹, Patrick T. Dolan¹, Mark Kunitomi¹, Michel Tassetto¹, Matthew Seetin², Steve Oh², Cheryl Heiner², Ellen E. Paxinos², Raul Andino¹ - Show less +5 more•Institutions (2)

University of California, San Francisco¹, Pacific Biosciences²

20 Nov 2017-Current Biology

TL;DR: It is proposed that comparisons of EVEs across mosquito populations may explain differences in vector competence, and further study of the structure and function of these elements in the genome of mosquitoes may lead to epidemiological interventions.

...read moreread less

145 citations

1
2
3
4
…
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•DOI•

Sequencing technologies-the next generation

[...]

Michael L. Metzker¹•Institutions (1)

Baylor College of Medicine¹

01 Jan 2010-Nature Reviews Genetics

TL;DR: A technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments is presented.

...read moreread less

Abstract: Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.

...read moreread less

7,023 citations

Journal Article•DOI•

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

[...]

Brian J. Haas¹, Alexie Papanicolaou², Moran Yassour³, Moran Yassour⁴, Manfred Grabherr⁵, Philip D. Blood⁶, Joshua C. Bowden², M. B. Couger⁷, David Eccles⁸, Bo Li⁹, Matthias Lieber¹⁰, Matthew D. MacManes¹¹, Michael Ott², Joshua Orvis, Nathalie Pochet⁴, Nathalie Pochet¹², Francesco Strozzi¹³, Nathan T. Weeks¹⁴, Rick Westerman¹⁵, Thomas William, Colin N. Dewey⁹, Robert Henschel¹⁶, Richard D. LeDuc¹⁶, Nir Friedman³, Aviv Regev⁴ - Show less +21 more•Institutions (16)

Broad Institute¹, Commonwealth Scientific and Industrial Research Organisation², Hebrew University of Jerusalem³, Massachusetts Institute of Technology⁴, Science for Life Laboratory⁵, Pittsburgh Supercomputing Center⁶, Oklahoma State University–Stillwater⁷, Griffith University⁸, University of Wisconsin-Madison⁹, Dresden University of Technology¹⁰, California Institute for Quantitative Biosciences¹¹, Flanders Institute for Biotechnology¹², Parco Tecnologico Padano¹³, United States Department of Agriculture¹⁴, Purdue University¹⁵, Indiana University¹⁶

01 Aug 2013-Nature Protocols

TL;DR: This protocol provides a workflow for genome-independent transcriptome analysis leveraging the Trinity platform and presents Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes.

...read moreread less

Abstract: De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

...read moreread less

6,369 citations

Journal Article•DOI•

Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies

[...]

Anna Klindworth¹, Elmar Pruesse², Timmy Schweer², Jörg Peplies², Christian Quast², Matthias Horn², Frank Oliver Glöckner² - Show less +3 more•Institutions (2)

Max Planck Society¹, Jacobs University Bremen²

01 Jan 2013-Nucleic Acids Research

TL;DR: The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.

...read moreread less

Abstract: 16S ribosomal RNA gene (rDNA) amplicon analysis remains the standard approach for the cultivation-independent investigation of microbial diversity. The accuracy of these analyses depends strongly on the choice of primers. The overall coverage and phylum spectrum of 175 primers and 512 primer pairs were evaluated in silico with respect to the SILVA 16S/18S rDNA non-redundant reference dataset (SSURef 108 NR). Based on this evaluation a selection of 'best available' primer pairs for Bacteria and Archaea for three amplicon size classes (100-400, 400-1000, ≥ 1000 bp) is provided. The most promising bacterial primer pair (S-D-Bact-0341-b-S-17/S-D-Bact-0785-a-A-21), with an amplicon size of 464 bp, was experimentally evaluated by comparing the taxonomic distribution of the 16S rDNA amplicons with 16S rDNA fragments from directly sequenced metagenomes. The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.

...read moreread less

5,346 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse