Home
/
Authors
/
Gary J. Sarkis

Author

Gary J. Sarkis

Bio: Gary J. Sarkis is an academic researcher. The author has contributed to research in topics: Nucleic acid & Hybrid genome assembly. The author has an hindex of 7, co-authored 10 publications receiving 9491 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome sequencing in microfabricated high-density picolitre reactors

[...]

Marcel Margulies, Michael Egholm, William E. Altman, Said Attiya, Joel S. Bader, Lisa A. Bemben, Jan Berka, Michael S. Braverman, Yi-Ju Chen, Zhoutao Chen, Scott Dewell, Lei Du, J. M. Fierro, Xavier V. Gomes, Brian C. Godwin, Wen He, Scott Edward Helgesen, Chun Heen Ho, Gerard P. Irzyk, Szilveszter C. Jando, Maria L. I. Alenquer, Thomas P. Jarvie, Kshama B. Jirage, Jong-Bum Kim, James R. Knight, Janna R. Lanza, John H. Leamon, Steven Lefkowitz, Ming Lei, Jing Li, Kenton Lohman, Hong Lu, Vinod Makhijani, Keith Mcdade, Michael P. McKenna, Eugene W. Myers¹, Elizabeth Nickerson, John Nobile, Ramona Plant, Bernard P. Puc, Michael T. Ronan, George T. Roth, Gary J. Sarkis, Jan Fredrik Simons, John Simpson, Maithreyan Srinivasan, Karrie R. Tartaro, Alexander Tomasz², Kari A. Vogt, Greg A. Volkmer, Shally H. Wang, Yong Wang, Michael P. Weiner³, Pengguang Yu, Richard F. Begley, Jonathan M. Rothberg - Show less +52 more•Institutions (3)

University of California, Berkeley¹, Rockefeller University², Rothberg Institute For Childhood Diseases³

15 Sep 2005-Nature

TL;DR: A scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments with 96% coverage at 99.96% accuracy in one run of the machine is described.

...read moreread less

Abstract: The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.

...read moreread less

8,434 citations

Patent•

Bead emulsion nucleic acid amplification

[...]

Jan Berka, Yi-Ju Chen, John H. Leamon, Steven Lefkowitz, Kenton Lohman, Vinod Makhijani, Jonathan M. Rothberg, Gary J. Sarkis, Maithreyan Srinivasan, Michael P. Weiner - Show less +6 more

28 Jan 2004

TL;DR: In this paper, a method for nucleic acid amplification is described, in which the templates, beads, and amplification reaction solution are emulsified and the nucleic acids are amplified to provide clonal copies of the templates attached to the beads.

...read moreread less

Abstract: Disclosed are methods for nucleic acid amplification wherein nucleic acid templates, beads, and amplification reaction solution are emulsified and the nucleic acid templates are amplified to provide clonal copies of the nucleic acid templates attached to the beads. Also disclosed are kits and apparatuses for performing the methods of the invention.

...read moreread less

413 citations

Journal Article•DOI•

Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing

[...]

Robert Pinard, Alex de Winter, Gary J. Sarkis, Mark Gerstein¹, Karrie R. Tartaro, Ramona Plant, Michael Egholm, Jonathan M. Rothberg, John H. Leamon - Show less +5 more•Institutions (1)

Yale University¹

23 Aug 2006-BMC Genomics

TL;DR: Of the amplification methodologies examined in this paper, the multiple displacement amplification products generated the least bias, and produced significantly higher yields of amplified DNA.

...read moreread less

Abstract: Whole genome amplification is an increasingly common technique through which minute amounts of DNA can be multiplied to generate quantities suitable for genetic testing and analysis. Questions of amplification-induced error and template bias generated by these methods have previously been addressed through either small scale (SNPs) or large scale (CGH array, FISH) methodologies. Here we utilized whole genome sequencing to assess amplification-induced bias in both coding and non-coding regions of two bacterial genomes. Halobacterium species NRC-1 DNA and Campylobacter jejuni were amplified by several common, commercially available protocols: multiple displacement amplification, primer extension pre-amplification and degenerate oligonucleotide primed PCR. The amplification-induced bias of each method was assessed by sequencing both genomes in their entirety using the 454 Sequencing System technology and comparing the results with those obtained from unamplified controls. All amplification methodologies induced statistically significant bias relative to the unamplified control. For the Halobacterium species NRC-1 genome, assessed at 100 base resolution, the D-statistics from GenomiPhi-amplified material were 119 times greater than those from unamplified material, 164.0 times greater for Repli-G, 165.0 times greater for PEP-PCR and 252.0 times greater than the unamplified controls for DOP-PCR. For Campylobacter jejuni, also analyzed at 100 base resolution, the D-statistics from GenomiPhi-amplified material were 15 times greater than those from unamplified material, 19.8 times greater for Repli-G, 61.8 times greater for PEP-PCR and 220.5 times greater than the unamplified controls for DOP-PCR. Of the amplification methodologies examined in this paper, the multiple displacement amplification products generated the least bias, and produced significantly higher yields of amplified DNA.

...read moreread less

338 citations

Journal Article•DOI•

A massively parallel PicoTiterPlate based platform for discrete picoliter-scale polymerase chain reactions.

[...]

John H. Leamon, William Lun Lee, Karrie R. Tartaro, Janna R. Lanza, Gary J. Sarkis, Alex D. deWinter, Jan Berka, Kenton Lohman - Show less +4 more

01 Nov 2003-Electrophoresis

TL;DR: The PicoTiterPlate as discussed by the authors is a platform for simultaneous polymerase chain reaction (PCR) amplification of up to 300,000 discrete reactions in a novel platform, which can be performed in extremely small volumes: individual reactions volumes are as low as 39.5 pL, with a total 15.3 micro-L reaction volume for the entire platform.

...read moreread less

Abstract: We demonstrate successful, simultaneous polymerase chain reaction (PCR) amplification of up to 300 000 discrete reactions in a novel platform, the PicoTiterPlate. In addition to elevated throughput, the PicoTiterPlate based amplifications (PTPCR) can be performed in extremely small volumes: individual reactions volumes are as low as 39.5 pL, with a total 15.3 microL reaction volume for the entire PicoTiterPlate. The bulk PTPCR product can be recovered and assayed with real-time PCR, or discrete PTPCR products can be driven to solid supports, enabling downstream applications such as translation/transcription or sequencing.

...read moreread less

284 citations

Patent•

Paired end sequencing

[...]

Jan Berka, Zhoutao Chen, Michael Egholm, Brian C. Godwin, Stephen K. Hutchison, John H. Leamon, Gary J. Sarkis, Jan Fredrik Simons - Show less +4 more

06 Jun 2006

TL;DR: In this article, the authors proposed a method of preparing a target nucleic acid fragments to produce a smaller nucleic acids which comprises the two ends of the target nucleIC acid.

...read moreread less

Abstract: The present invention provides for a method of preparing a target nucleic acid fragments to produce a smaller nucleic acid which comprises the two ends of the target nucleic acid. Specifically, the invention provides cloning and DNA manipulation strategies to isolate the two ends of a large target nucleic acid into a single small DNA construct for rapid cloning, sequencing, or amplification.

...read moreread less

179 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

An obesity-associated gut microbiome with increased capacity for energy harvest

[...]

Peter J. Turnbaugh¹, Ruth E. Ley, Michael A. Mahowald, Vincent Magrini¹, Elaine R. Mardis¹, Jeffrey I. Gordon - Show less +2 more•Institutions (1)

Washington University in St. Louis¹

21 Dec 2006-Nature

TL;DR: It is demonstrated through metagenomic and biochemical analyses that changes in the relative abundance of the Bacteroidetes and Firmicutes affect the metabolic potential of the mouse gut microbiota and indicates that the obese microbiome has an increased capacity to harvest energy from the diet.

...read moreread less

Abstract: The worldwide obesity epidemic is stimulating efforts to identify host and environmental factors that affect energy balance. Comparisons of the distal gut microbiota of genetically obese mice and their lean littermates, as well as those of obese and lean human volunteers have revealed that obesity is associated with changes in the relative abundance of the two dominant bacterial divisions, the Bacteroidetes and the Firmicutes. Here we demonstrate through metagenomic and biochemical analyses that these changes affect the metabolic potential of the mouse gut microbiota. Our results indicate that the obese microbiome has an increased capacity to harvest energy from the diet. Furthermore, this trait is transmissible: colonization of germ-free mice with an 'obese microbiota' results in a significantly greater increase in total body fat than colonization with a 'lean microbiota'. These results identify the gut microbiota as an additional contributing factor to the pathophysiology of obesity.

...read moreread less

10,126 citations

Journal Article•DOI•

Velvet: Algorithms for de novo short read assembly using de Bruijn graphs

[...]

Daniel R. Zerbino¹, Ewan Birney¹•Institutions (1)

European Bioinformatics Institute¹

01 May 2008-Genome Research

TL;DR: Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies and is in close agreement with simulated results without read-pair information.

...read moreread less

Abstract: We have developed a new set of algorithms, collectively called "Velvet," to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25-50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of approximately 8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair information. Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies.

...read moreread less

9,389 citations

Journal Article•DOI•

Genome sequencing in microfabricated high-density picolitre reactors

[...]

University of California, Berkeley¹, Rockefeller University², Rothberg Institute For Childhood Diseases³

15 Sep 2005-Nature

...read moreread less

8,434 citations

Journal Article•DOI•

Sequencing technologies-the next generation

[...]

Michael L. Metzker¹•Institutions (1)

Baylor College of Medicine¹

01 Jan 2010-Nature Reviews Genetics

TL;DR: A technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments is presented.

...read moreread less

Abstract: Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.

...read moreread less

7,023 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse