Home
/
Authors
/
John Nobile

Author

John Nobile

Bio: John Nobile is an academic researcher from Life Technologies. The author has contributed to research in topics: Fluidics & DNA sequencing. The author has an hindex of 11, co-authored 21 publications receiving 11144 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome sequencing in microfabricated high-density picolitre reactors

[...]

Marcel Margulies, Michael Egholm, William E. Altman, Said Attiya, Joel S. Bader, Lisa A. Bemben, Jan Berka, Michael S. Braverman, Yi-Ju Chen, Zhoutao Chen, Scott Dewell, Lei Du, J. M. Fierro, Xavier V. Gomes, Brian C. Godwin, Wen He, Scott Edward Helgesen, Chun Heen Ho, Gerard P. Irzyk, Szilveszter C. Jando, Maria L. I. Alenquer, Thomas P. Jarvie, Kshama B. Jirage, Jong-Bum Kim, James R. Knight, Janna R. Lanza, John H. Leamon, Steven Lefkowitz, Ming Lei, Jing Li, Kenton Lohman, Hong Lu, Vinod Makhijani, Keith Mcdade, Michael P. McKenna, Eugene W. Myers¹, Elizabeth Nickerson, John Nobile, Ramona Plant, Bernard P. Puc, Michael T. Ronan, George T. Roth, Gary J. Sarkis, Jan Fredrik Simons, John Simpson, Maithreyan Srinivasan, Karrie R. Tartaro, Alexander Tomasz², Kari A. Vogt, Greg A. Volkmer, Shally H. Wang, Yong Wang, Michael P. Weiner³, Pengguang Yu, Richard F. Begley, Jonathan M. Rothberg - Show less +52 more•Institutions (3)

University of California, Berkeley¹, Rockefeller University², Rothberg Institute For Childhood Diseases³

15 Sep 2005-Nature

TL;DR: A scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments with 96% coverage at 99.96% accuracy in one run of the machine is described.

...read moreread less

Abstract: The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.

...read moreread less

8,434 citations

Journal Article•DOI•

An integrated semiconductor device enabling non-optical genome sequencing

[...]

Jonathan M. Rothberg¹, Wolfgang Hinz¹, Todd Rearick¹, Jonathan Schultz¹, William J. Mileski¹, Melville Davey¹, John H. Leamon¹, Kim L. Johnson¹, Mark James Milgrew¹, Matthew D. Edwards¹, Jeremy Hoon¹, Jan Fredrik Simons¹, David Marran¹, Jason W. Myers¹, John F. Davidson¹, Annika Branting¹, John Nobile¹, Bernard P. Puc¹, David Light¹, Travis A. Clark¹, Martin Huber¹, Jeffrey T. Branciforte¹, Isaac B. Stoner¹, Simon Cawley¹, Michael R. Lyons¹, Yutao Fu¹, Nils Homer¹, Marina Sedova¹, Xin Miao¹, Brian Reed¹, Jeffrey Sabina¹, Erika Feierstein¹, Michelle Schorn¹, Mohammad Alanjary¹, Eileen T. Dimalanta¹, Devin Dressman¹, Rachel Kasinskas¹, Tanya Sokolsky¹, Jacqueline A. Fidanza¹, Eugeni Namsaraev¹, Kevin McKernan¹, Alan Williams¹, G. Thomas Roth¹, James Bustillo¹ - Show less +40 more•Institutions (1)

Life Technologies¹

21 Jul 2011-Nature

TL;DR: A DNA sequencing technology in which scalable, low-cost semiconductor manufacturing techniques are used to make an integrated circuit able to directly perform non-optical DNA sequencing of genomes, showing its robustness and scalability by producing ion chips with up to 10 times as many sensors and sequencing a human genome.

...read moreread less

Abstract: The seminal importance of DNA sequencing to the life sciences, biotechnology and medicine has driven the search for more scalable and lower-cost solutions. Here we describe a DNA sequencing technology in which scalable, low-cost semiconductor manufacturing techniques are used to make an integrated circuit able to directly perform non-optical DNA sequencing of genomes. Sequence data are obtained by directly sensing the ions produced by template-directed DNA polymerase synthesis using all-natural nucleotides on this massively parallel semiconductor-sensing device or ion chip. The ion chip contains ion-sensitive, field-effect transistor-based sensors in perfect register with 1.2 million wells, which provide confinement and allow parallel, simultaneous detection of independent sequencing reactions. Use of the most widely used technology for constructing integrated circuits, the complementary metal-oxide semiconductor (CMOS) process, allows for low-cost, large-scale production and scaling of the device to higher densities and larger array sizes. We show the performance of the system by sequencing three bacterial genomes, its robustness and scalability by producing ion chips with up to 10 times as many sensors and sequencing a human genome.

...read moreread less

2,246 citations

Patent•

Apparatus and methods for performing electrochemical reactions

[...]

John Nobile¹, George Thomas Roth¹, Todd Rearick¹, Jonathan Schultz¹, Jonathan M. Rothberg¹, David Marran¹ - Show less +2 more•Institutions (1)

Life Technologies¹

24 May 2010

TL;DR: In this article, an array of electronic sensors integrated with a microwell array for confining analytes and/or particles for analytical reactions and a method for identifying microwells containing analytes or reaction byproducts is presented.

...read moreread less

Abstract: The invention is directed to apparatus and methods for delivering multiple reagents to, and monitoring, a plurality of analytical reactions carried out on a large-scale array of electronic sensors underminimal noise conditions. In one aspect, the invention provides method of improving signal-to-noise ratios of output signals from the electronic sensors sensing analytes or reaction byproducts by subtracting an average of output signals measured from neighboring sensors where analyte or reaction byproducts are absent. In other aspects, the invention provides an array of electronic sensors integrated with a microwell array for confining analytes and/or particles for analytical reactions and a method for identifying microwells containing analytes and/or particles by passing a sensor-active reagent over the array and correlating sensor response times to the presence or absence of analytes or particles. Such detection of analyte- or particle-containing microwells may be used as a step in additional noise reduction methods.

...read moreread less

253 citations

Patent•

Nucleic acid amplification with continuous flow emulsion

[...]

John Nobile, William Lun Lee, John H. Leamon

28 Jan 2005

TL;DR: In this article, a water-in-oil emulsion in a continuous flow was used to amplify a nucleic acid template by a polymerase chain reaction (PCR) with a single bead.

...read moreread less

Abstract: Embodiments of the present invention are directed to methods and devices/systems for amplifying genetic material and may include providing a water-in-oil emulsion in a continuous flow. The emulsion may include a plurality of water droplets comprising microreactors. Each of the plurality of microreactors may include a single bead capable of capturing a nucleic acid template, a single species nucleic acid template and sufficient reagents to amplify the copy number of the nucleic acid template. The method also includes flowing the emulsion across a first temperature zone and a second lower temperature zone to thermally process the microreactors to amplify the nucleic acid template by polymerase chain reaction.

...read moreread less

243 citations

Patent•

Fluidics interface systems and methods

[...]

Melville Davey¹, George Thomas Roth¹, David Marran¹, William J. Mileski¹, John Nobile¹ - Show less +1 more•Institutions (1)

Life Technologies¹

29 Dec 2011

TL;DR: In this paper, a system including a communication interface to communicatively couple to a sensor cartridge, a fluidic subsystem to exchange a reagent solution with the sensor cartridge and a computational circuitry communicative coupled to the communication interface and the fluid component is described.

...read moreread less

Abstract: A system including a communication interface to communicatively couple to a sensor cartridge, a fluidic subsystem to exchange a reagent solution with the sensor cartridge, and a computational circuitry communicatively coupled to the communication interface and the fluidic subsystem. The computation circuitry is to monitor a sensor signal of a sensor of the sensor cartridge, detect a leak based on the sensor signal, and control fluid flow of the fluidic subsystem in response to detecting.

...read moreread less

81 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg², Steven L. Salzberg³, Steven L. Salzberg¹•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

STAR: ultrafast universal RNA-seq aligner

[...]

Alexander Dobin¹, Carrie A. Davis¹, Felix Schlesinger¹, Jorg Drenkow¹, Chris Zaleski¹, Sonali Jha¹, Philippe Batut¹, Mark Chaisson¹, Thomas R. Gingeras¹ - Show less +5 more•Institutions (1)

Cold Spring Harbor Laboratory¹

01 Jan 2013-Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

...read moreread less

30,684 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

An obesity-associated gut microbiome with increased capacity for energy harvest

[...]

Peter J. Turnbaugh¹, Ruth E. Ley, Michael A. Mahowald, Vincent Magrini¹, Elaine R. Mardis¹, Jeffrey I. Gordon - Show less +2 more•Institutions (1)

Washington University in St. Louis¹

21 Dec 2006-Nature

TL;DR: It is demonstrated through metagenomic and biochemical analyses that changes in the relative abundance of the Bacteroidetes and Firmicutes affect the metabolic potential of the mouse gut microbiota and indicates that the obese microbiome has an increased capacity to harvest energy from the diet.

...read moreread less

Abstract: The worldwide obesity epidemic is stimulating efforts to identify host and environmental factors that affect energy balance. Comparisons of the distal gut microbiota of genetically obese mice and their lean littermates, as well as those of obese and lean human volunteers have revealed that obesity is associated with changes in the relative abundance of the two dominant bacterial divisions, the Bacteroidetes and the Firmicutes. Here we demonstrate through metagenomic and biochemical analyses that these changes affect the metabolic potential of the mouse gut microbiota. Our results indicate that the obese microbiome has an increased capacity to harvest energy from the diet. Furthermore, this trait is transmissible: colonization of germ-free mice with an 'obese microbiota' results in a significantly greater increase in total body fat than colonization with a 'lean microbiota'. These results identify the gut microbiota as an additional contributing factor to the pathophysiology of obesity.

...read moreread less

10,126 citations

Journal Article•DOI•

Velvet: Algorithms for de novo short read assembly using de Bruijn graphs

[...]

Daniel R. Zerbino¹, Ewan Birney¹•Institutions (1)

European Bioinformatics Institute¹

01 May 2008-Genome Research

TL;DR: Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies and is in close agreement with simulated results without read-pair information.

...read moreread less

Abstract: We have developed a new set of algorithms, collectively called "Velvet," to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25-50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of approximately 8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair information. Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies.

...read moreread less

9,389 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse