Home
/
Authors
/
Paul Peluso

Author

Paul Peluso

Bio: Paul Peluso is an academic researcher from Pacific Biosciences. The author has contributed to research in topics: Genome & Sequence assembly. The author has an hindex of 36, co-authored 72 publications receiving 11023 citations.

Topics: Genome, Sequence assembly, Genomics, Reference genome, Contig ...read more

Papers published on a yearly basis

2023
2021
2020
2019
2018
2017
2016
2014
2013
2012
2011
2009
2008
2007
2006
2005

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Real-Time DNA Sequencing from Single Polymerase Molecules

[...]

John Eid¹, Adrian Fehr¹, Jeremy Gray¹, Khai Luong¹, John Lyle¹, Geoff Otto¹, Paul Peluso¹, David R. Rank¹, Primo Baybayan¹, Brad Bettman¹, Arkadiusz Bibillo¹, Keith Bjornson¹, Bidhan Chaudhuri¹, Fred Christians¹, Ronald L. Cicero¹, Sonya Clark¹, Ravindra V. Dalal¹, Alex DeWinter¹, John Dixon¹, Mathieu Foquet¹, Alfred Gaertner¹, Paul Hardenbol¹, Cheryl Heiner¹, Kevin Hester¹, David P. Holden¹, Gregory J. Kearns¹, Xiangxu Kong¹, Ronald Kuse¹, Yves Lacroix¹, Steven Lin¹, Paul Lundquist¹, Congcong Ma¹, Patrick Marks¹, Mark Maxham¹, Devon Murphy¹, Insil Park¹, Thang Pham¹, Michael Phillips¹, Joy Roy¹, Robert Sebra¹, Gene Shen¹, Jon M. Sorenson¹, Austin B. Tomaney¹, Kevin Travers¹, Mark Trulson¹, John Vieceli¹, Jeffrey Wegener¹, Dawn Wu¹, Alicia Yang¹, Denis Zaccarin¹, Peter Zhao¹, Frank Zhong¹, Jonas Korlach¹, Stephen Turner¹ - Show less +50 more•Institutions (1)

Pacific Biosciences¹

02 Jan 2009-Science

TL;DR: Single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs) are presented.

...read moreread less

Abstract: We present single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs). We detected the temporal order of their enzymatic incorporation into a growing DNA strand with zero-mode waveguide nanostructure arrays, which provide optical observation volume confinement and enable parallel, simultaneous detection of thousands of single-molecule sequencing reactions. Conjugation of fluorophores to the terminal phosphate moiety of the dNTPs allows continuous observation of DNA synthesis over thousands of bases without steric hindrance. The data report directly on polymerase dynamics, revealing distinct polymerization states and pause sites corresponding to DNA secondary structure. Sequence data were aligned with the known reference sequence to assay biophysical parameters of polymerization for each template position. Consensus sequences were generated from the single-molecule reads at 15-fold coverage, showing a median accuracy of 99.3%, with no systematic error beyond fluorophore-dependent error rates.

...read moreread less

3,346 citations

Journal Article•DOI•

Phased diploid genome assembly with single-molecule real-time sequencing

[...]

Chen-Shan Chin¹, Paul Peluso¹, Fritz J. Sedlazeck², Maria Nattestad³, Gregory T. Concepcion¹, Alicia Clum⁴, Christopher Dunn¹, Ronan C. O'Malley⁵, Rosa Figueroa-Balderas⁶, Abraham Morales-Cruz⁶, Grant R. Cramer⁷, Massimo Delledonne⁸, Chongyuan Luo⁵, Joseph R. Ecker⁵, Dario Cantu⁶, David R. Rank¹, Michael C. Schatz³, Michael C. Schatz² - Show less +14 more•Institutions (8)

Pacific Biosciences¹, Johns Hopkins University², Cold Spring Harbor Laboratory³, Joint Genome Institute⁴, Salk Institute for Biological Studies⁵, University of California, Davis⁶, University of Nevada, Reno⁷, University of Verona⁸

01 Dec 2016-Nature Methods

TL;DR: The open-source FALCON and FALcon-Unzip algorithms are introduced to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes.

...read moreread less

Abstract: While genome assembly projects have been successful in many haploid and inbred species, the assembly of noninbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms (https://github.com/PacificBiosciences/FALCON/) to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short- or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.

...read moreread less

1,490 citations

Journal Article•DOI•

Improved maize reference genome with single-molecule technologies

[...]

Yinping Jiao¹, Paul Peluso², Jinghua Shi, Tiffany Y. Liang, Michelle C. Stitzer³, Bo Wang¹, Michael S. Campbell¹, Joshua C. Stein¹, Xuehong Wei¹, Chen-Shan Chin², Katherine E. Guill⁴, Michael Regulski¹, Sunita Kumari¹, Andrew Olson¹, Jonathan I. Gent⁵, Kevin L. Schneider⁶, Thomas K. Wolfgruber⁶, Michael R. May³, Nathan M. Springer⁷, Eric Antoniou¹, W. Richard McCombie¹, Gernot G. Presting⁶, Michael D. McMullen⁴, Jeffrey Ross-Ibarra³, R. Kelly Dawe⁵, Alex Hastie, David R. Rank², Doreen Ware¹, Doreen Ware⁸ - Show less +25 more•Institutions (8)

Cold Spring Harbor Laboratory¹, Pacific Biosciences², University of California, Davis³, United States Department of Agriculture⁴, University of Georgia⁵, University of Hawaii at Manoa⁶, University of Minnesota⁷, Cornell University⁸

12 Jun 2017-Nature

TL;DR: The assembly and annotation of a reference genome of maize is reported, using single-molecule real-time sequencing and high-resolution optical mapping to identify transposable element lineage expansions that are unique to maize.

...read moreread less

Abstract: Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.

...read moreread less

919 citations

Journal Article•DOI•

Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.

[...]

Aaron M. Wenger¹, Paul Peluso¹, William J Rowell¹, Pi-Chuan Chang², Richard Hall¹, Gregory T. Concepcion¹, Jana Ebler³, Arkarachai Fungtammasan, Alexander Kolesnikov², Nathan D. Olson⁴, Armin Töpfer¹, Michael Alonge⁵, Medhat Mahmoud⁶, Yufeng Qian¹, Chen-Shan Chin, Adam M. Phillippy⁷, Michael C. Schatz⁵, Gene Myers⁸, Mark A. DePristo², Jue Ruan, Tobias Marschall³, Tobias Marschall⁸, Fritz J. Sedlazeck⁶, Justin M. Zook⁴, Heng Li⁹, Sergey Koren⁷, Andrew Carroll², David R. Rank¹, Michael W. Hunkapiller¹ - Show less +25 more•Institutions (9)

Pacific Biosciences¹, Google², Saarland University³, National Institute of Standards and Technology⁴, Johns Hopkins University⁵, Baylor College of Medicine⁶, National Institutes of Health⁷, Max Planck Society⁸, Harvard University⁹

12 Aug 2019-Nature Biotechnology

TL;DR: The optimization of circular consensus sequencing (CCS) is reported to improve the accuracy of single-molecule real-time (SMRT) sequencing (PacBio) and generate highly accurate (99.8%) long high-fidelity (HiFi) reads with an average length of 13.5 kilobases (kb).

...read moreread less

Abstract: The DNA sequencing technologies in use today produce either highly accurate short reads or less-accurate long reads. We report the optimization of circular consensus sequencing (CCS) to improve the accuracy of single-molecule real-time (SMRT) sequencing (PacBio) and generate highly accurate (99.8%) long high-fidelity (HiFi) reads with an average length of 13.5 kilobases (kb). We applied our approach to sequence the well-characterized human HG002/NA24385 genome and obtained precision and recall rates of at least 99.91% for single-nucleotide variants (SNVs), 95.98% for insertions and deletions 15 megabases (Mb) and concordance of 99.997%, substantially outperforming assembly with less-accurate long reads. High-fidelity reads improve variant detection and genome assembly on the PacBio platform.

...read moreread less

876 citations

Journal Article•DOI•

Origins of the E. coli Strain Causing an Outbreak of Hemolytic–Uremic Syndrome in Germany

[...]

David A. Rasko¹, Dale R. Webster², Jason W. Sahl¹, Ali Bashir², Nadia Boisen³, Flemming Scheutz³, Ellen E. Paxinos², Robert Sebra², Chen-Shan Chin², Dimitris Iliopoulos², Aaron Klammer², Paul Peluso², Lawrence Lee², Andrey Kislyuk², James H. Bullard², Andrew Kasarskis², Susanna Wang², John Eid², David R. Rank, Julia C. Redman¹, Susan R. Steyert¹, Jakob Frimodt-Møller⁴, Carsten Struve³, Andreas Petersen⁴, Karen A. Krogfelt³, James P. Nataro⁵, Eric E. Schadt², Matthew K. Waldor⁶ - Show less +24 more•Institutions (6)

University of Maryland, Baltimore¹, Pacific Biosciences², Statens Serum Institut³, University of Copenhagen⁴, University of Virginia⁵, Howard Hughes Medical Institute⁶

24 Aug 2011-The New England Journal of Medicine

TL;DR: The findings suggest that horizontal genetic exchange allowed for the emergence of the highly virulent Shiga-toxin-producing enteroaggregative E. coli O104:H4 strain that caused the German outbreak, and highlight the way in which the plasticity of bacterial genomes facilitates the emerged of new pathogens.

...read moreread less

Abstract: Background A large outbreak of diarrhea and the hemolytic–uremic syndrome caused by an unusual serotype of Shiga-toxin–producing Escherichia coli (O104:H4) began in Germany in May 2011. As of July 22, a large number of cases of diarrhea caused by Shiga-toxin–producing E. coli have been reported — 3167 without the hemolytic–uremic syndrome (16 deaths) and 908 with the hemolytic–uremic syndrome (34 deaths) — indicating that this strain is notably more virulent than most of the Shiga-toxin–producing E. coli strains. Preliminary genetic characterization of the outbreak strain suggested that, unlike most of these strains, it should be classified within the enteroaggregative pathotype of E. coli. Methods We used third-generation, single-molecule, real-time DNA sequencing to determine the complete genome sequence of the German outbreak strain, as well as the genome sequences of seven diarrhea-associated enteroaggregative E. coli serotype O104:H4 strains from Africa and four enteroaggregative E. coli reference st...

...read moreread less

840 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Sequencing technologies-the next generation

[...]

Michael L. Metzker¹•Institutions (1)

Baylor College of Medicine¹

01 Jan 2010-Nature Reviews Genetics

TL;DR: A technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments is presented.

...read moreread less

Abstract: Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.

...read moreread less

7,023 citations

Journal Article•DOI•

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

[...]

Brian J. Haas¹, Alexie Papanicolaou², Moran Yassour³, Moran Yassour⁴, Manfred Grabherr⁵, Philip D. Blood⁶, Joshua C. Bowden², M. B. Couger⁷, David Eccles⁸, Bo Li⁹, Matthias Lieber¹⁰, Matthew D. MacManes¹¹, Michael Ott², Joshua Orvis, Nathalie Pochet³, Nathalie Pochet¹², Francesco Strozzi¹³, Nathan T. Weeks¹⁴, Rick Westerman¹⁵, Thomas William, Colin N. Dewey⁹, Robert Henschel¹⁶, Richard D. LeDuc¹⁶, Nir Friedman⁴, Aviv Regev³ - Show less +21 more•Institutions (16)

Broad Institute¹, Commonwealth Scientific and Industrial Research Organisation², Massachusetts Institute of Technology³, Hebrew University of Jerusalem⁴, Science for Life Laboratory⁵, Pittsburgh Supercomputing Center⁶, Oklahoma State University–Stillwater⁷, Griffith University⁸, University of Wisconsin-Madison⁹, Dresden University of Technology¹⁰, California Institute for Quantitative Biosciences¹¹, Flanders Institute for Biotechnology¹², Parco Tecnologico Padano¹³, United States Department of Agriculture¹⁴, Purdue University¹⁵, Indiana University¹⁶

01 Aug 2013-Nature Protocols

TL;DR: This protocol provides a workflow for genome-independent transcriptome analysis leveraging the Trinity platform and presents Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes.

...read moreread less

Abstract: De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

...read moreread less

6,369 citations

Journal Article•DOI•

Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies

[...]

Anna Klindworth¹, Elmar Pruesse², Timmy Schweer², Jörg Peplies², Christian Quast², Matthias Horn², Frank Oliver Glöckner² - Show less +3 more•Institutions (2)

Max Planck Society¹, Jacobs University Bremen²

01 Jan 2013-Nucleic Acids Research

TL;DR: The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.

...read moreread less

Abstract: 16S ribosomal RNA gene (rDNA) amplicon analysis remains the standard approach for the cultivation-independent investigation of microbial diversity. The accuracy of these analyses depends strongly on the choice of primers. The overall coverage and phylum spectrum of 175 primers and 512 primer pairs were evaluated in silico with respect to the SILVA 16S/18S rDNA non-redundant reference dataset (SSURef 108 NR). Based on this evaluation a selection of 'best available' primer pairs for Bacteria and Archaea for three amplicon size classes (100-400, 400-1000, ≥ 1000 bp) is provided. The most promising bacterial primer pair (S-D-Bact-0341-b-S-17/S-D-Bact-0785-a-A-21), with an amplicon size of 464 bp, was experimentally evaluated by comparing the taxonomic distribution of the 16S rDNA amplicons with 16S rDNA fragments from directly sequenced metagenomes. The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.

...read moreread less

5,346 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse