Home
/
Authors
/
Stephen T. Sherry

Author

Stephen T. Sherry

Other affiliations: Louisiana State University, LSU Health Sciences Center New Orleans, University Medical Center New Orleans ...read more

Bio: Stephen T. Sherry is an academic researcher from National Institutes of Health. The author has contributed to research in topics: Population & Human genome. The author has an hindex of 43, co-authored 73 publications receiving 50628 citations. Previous affiliations of Stephen T. Sherry include Louisiana State University & LSU Health Sciences Center New Orleans.

Topics: Population, Human genome, Genomics, Genome, dbSNP ...read more

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2007
2005
2004
2003
2002
2001
2000
1999
1998
1997
1995
1994
1993
1992
1991

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•DOI•

The variant call format and VCFtools

[...]

Petr Danecek¹, Adam Auton², Gonçalo R. Abecasis³, Cornelis A. Albers¹, Eric Banks⁴, Mark A. DePristo⁴, Robert E. Handsaker⁴, Gerton Lunter², Gabor T. Marth⁵, Stephen T. Sherry⁶, Gilean McVean², Richard Durbin¹ - Show less +8 more•Institutions (6)

Wellcome Trust¹, University of Oxford², University of Michigan³, Broad Institute⁴, Boston College⁵, National Institutes of Health⁶

01 Aug 2011-Bioinformatics

TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

...read moreread less

Abstract: Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: [email protected]

...read moreread less

10,164 citations

Journal Article•DOI•

dbSNP: the NCBI database of genetic variation

[...]

Stephen T. Sherry¹, Minghong Ward, Michael Kholodov, Jonathan Baker, Lon Phan, Elizabeth M. Smigielski, Karl Sirotkin - Show less +3 more•Institutions (1)

National Institutes of Health¹

01 Jan 2001-Nucleic Acids Research

TL;DR: The dbSNP database is a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping and evolutionary biology, and is integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink and the Human Genome Project data.

...read moreread less

Abstract: In response to a need for a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping and evolutionary biology, the National Center for Biotechnology Information (NCBI) has established the dbSNP database [S.T.Sherry, M.Ward and K.Sirotkin (1999) Genome Res., 9, 677–679]. Submissions to dbSNP will be integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink and the Human Genome Project data. The complete contents of dbSNP are available to the public at website: http://www.ncbi.nlm.nih.gov/SNP. The complete contents of dbSNP can also be downloaded in multiple formats via anonymous FTP at ftp:// ncbi.nlm.nih.gov/snp/.

...read moreread less

6,449 citations

Journal Article•DOI•

The International HapMap Project

[...]

John W. Belmont¹, Paul Hardenbol, Thomas D. Willis, Fuli Yu¹, Huanming Yang², Lan Yang Ch'Ang, Wei Huang³, Bin Liu², Yan Shen³, Paul K.H. Tam⁴, Lap-Chee Tsui⁴, Mary M.Y. Waye⁵, Jeffrey Tze Fei Wong⁶, Changqing Zeng², Qingrun Zhang², Mark S. Chee⁷, Luana Galver⁷, Semyon Kruglyak⁷, Sarah S. Murray⁷, Arnold Oliphant⁷, Alexandre Montpetit⁸, Fanny Chagnon⁸, Vincent Ferretti⁸, Martin Leboeuf⁸, Michael S. Phillips⁸, Andrei Verner⁸, Shenghui Duan⁹, Denise L. Lind¹⁰, Raymond D. Miller⁹, John P. Rice⁹, Nancy L. Saccone⁹, Patricia Taillon-Miller⁹, Ming Xiao¹⁰, Akihiro Sekine, Koki Sorimachi, Yoichi Tanaka, Tatsuhiko Tsunoda, Eiji Yoshino, David R. Bentley¹¹, Sarah E. Hunt¹¹, Don Powell¹¹, Houcan Zhang¹², Ichiro Matsuda¹³, Yoshimitsu Fukushima¹⁴, Darryl Macer¹⁵, Eiko Suda¹⁵, Charles N. Rotimi¹⁶, Clement Adebamowo¹⁷, Toyin Aniagwu¹⁷, Patricia A. Marshall¹⁸, Olayemi Matthew¹⁷, Chibuzor Nkwodimmah¹⁷, Charmaine D.M. Royal¹⁶, Mark Leppert¹⁹, Missy Dixon¹⁹, Fiona Cunningham²⁰, Ardavan Kanani²⁰, Gudmundur A. Thorisson²⁰, Peter E. Chen²¹, David J. Cutler²¹, Carl S. Kashuk²¹, Peter Donnelly²², Jonathan Marchini²², Gilean McVean²², Simon Myers²², Lon R. Cardon²², Andrew P. Morris²², Bruce S. Weir²³, James C. Mullikin²⁴, Michael Feolo²⁴, Mark J. Daly²⁵, Renzong Qiu²⁶, Alastair Kent, Georgia M. Dunston¹⁶, Kazuto Kato²⁷, Norio Niikawa²⁸, Jessica Watkin²⁹, Richard A. Gibbs¹, Erica Sodergren¹, George M. Weinstock¹, Richard K. Wilson⁹, Lucinda Fulton⁹, Jane Rogers¹¹, Bruce W. Birren²⁵, Hua Han², Hongguang Wang, Martin Godbout³⁰, John C. Wallenburg⁸, Paul L'Archevêque, Guy Bellemare, Kazuo Todani, Takashi Fujita, Satoshi Tanaka, Arthur L. Holden, Francis S. Collins²⁴, Lisa D. Brooks²⁴, Jean E. McEwen²⁴, Mark S. Guyer²⁴, Elke Jordan³¹, Jane Peterson²⁴, Jack Spiegel²⁴, Lawrence M. Sung³², Lynn F. Zacharia²⁴, Karen Kennedy²⁹, Michael Dunn²⁹, Richard Seabrook²⁹, Mark Shillito, Barbara Skene²⁹, John Stewart²⁹, David Valle²¹, Ellen Wright Clayton³³, Lynn B. Jorde¹⁹, Aravinda Chakravarti²¹, Mildred K. Cho³⁴, Troy Duster³⁵, Troy Duster³⁶, Morris W. Foster³⁷, Maria Jasperse³⁸, Bartha Maria Knoppers³⁹, Pui-Yan Kwok¹⁰, Julio Licinio⁴⁰, Jeffrey C. Long⁴¹, Pilar N. Ossorio⁴², Vivian Ota Wang³³, Charles N. Rotimi¹⁶, Patricia Spallone²⁹, Patricia Spallone⁴³, Sharon F. Terry⁴⁴, Eric S. Lander²⁵, Eric H. Lai⁴⁵, Deborah A. Nickerson⁴⁶, Gonçalo R. Abecasis⁴¹, David Altshuler⁴⁷, Michael Boehnke⁴¹, Panos Deloukas¹¹, Julie A. Douglas⁴¹, Stacey Gabriel²⁵, Richard R. Hudson⁴⁸, Thomas J. Hudson⁸, Leonid Kruglyak⁴⁹, Yusuke Nakamura⁵⁰, Robert L. Nussbaum²⁴, Stephen F. Schaffner²⁵, Stephen T. Sherry²⁴, Lincoln Stein²⁰, Toshihiro Tanaka - Show less +142 more•Institutions (50)

Baylor College of Medicine¹, Chinese Academy of Sciences², Chinese National Human Genome Center³, University of Hong Kong⁴, The Chinese University of Hong Kong⁵, Hong Kong University of Science and Technology⁶, Illumina⁷, McGill University⁸, Washington University in St. Louis⁹, University of California, San Francisco¹⁰, Wellcome Trust Sanger Institute¹¹, Beijing Normal University¹², Health Sciences University of Hokkaido¹³, Shinshu University¹⁴, University of Tsukuba¹⁵, Howard University¹⁶, University of Ibadan¹⁷, Case Western Reserve University¹⁸, University of Utah¹⁹, Cold Spring Harbor Laboratory²⁰, Johns Hopkins University²¹, University of Oxford²², North Carolina State University²³, National Institutes of Health²⁴, Massachusetts Institute of Technology²⁵, Chinese Academy of Social Sciences²⁶, Kyoto University²⁷, Nagasaki University²⁸, Wellcome Trust²⁹, Genome Canada³⁰, Foundation for the National Institutes of Health³¹, University of Maryland, Baltimore³², Vanderbilt University³³, Stanford University³⁴, University of California, Berkeley³⁵, New York University³⁶, University of Oklahoma³⁷, University of New Mexico³⁸, Université de Montréal³⁹, University of California, Los Angeles⁴⁰, University of Michigan⁴¹, University of Wisconsin-Madison⁴², London School of Economics and Political Science⁴³, Genetic Alliance⁴⁴, GlaxoSmithKline⁴⁵, University of Washington⁴⁶, Harvard University⁴⁷, University of Chicago⁴⁸, Fred Hutchinson Cancer Research Center⁴⁹, University of Tokyo⁵⁰

18 Dec 2003-Nature

TL;DR: The HapMap will allow the discovery of sequence variants that affect common disease, will facilitate development of diagnostic tools, and will enhance the ability to choose targets for therapeutic intervention.

...read moreread less

Abstract: The goal of the International HapMap Project is to determine the common patterns of DNA sequence variation in the human genome and to make this information freely available in the public domain. An international consortium is developing a map of these patterns across the genome by determining the genotypes of one million or more sequence variants, their frequencies and the degree of association between them, in DNA samples from populations with ancestry from parts of Africa, Asia and Europe. The HapMap will allow the discovery of sequence variants that affect common disease, will facilitate development of diagnostic tools, and will enhance our ability to choose targets for therapeutic intervention.

...read moreread less

5,926 citations

Journal Article•DOI•

A haplotype map of the human genome

[...]

John W. Belmont¹, Andrew Boudreau, Suzanne M. Leal¹, Paul Hardenbol +229 more•Institutions (40)

27 Oct 2005

TL;DR: A public database of common variation in the human genome: more than one million single nucleotide polymorphisms for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted.

...read moreread less

Abstract: Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution.

...read moreread less

5,479 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

Haploview: analysis and visualization of LD and haplotype maps

[...]

Jeffrey C. Barrett¹, Ben Fry¹, Julian Maller¹, Mark J. Daly¹•Institutions (1)

Massachusetts Institute of Technology¹

15 Jan 2005-Bioinformatics

TL;DR: Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.

...read moreread less

Abstract: Summary: Research over the last few years has revealed significant haplotype structure in the human genome. The characterization of these patterns, particularly in the context of medical genetic association studies, is becoming a routine research activity. Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface. Availability: http://www.broad.mit.edu/mpg/haploview/ Contact: jcbarret@broad.mit.edu

...read moreread less

13,862 citations

Journal Article•DOI•

DnaSP v5

[...]

Pablo Librado¹, Julio Rozas¹•Institutions (1)

University of Barcelona¹

01 Jun 2009-Bioinformatics

TL;DR: Version 5 implements a number of new features and analytical methods allowing extensive DNA polymorphism analyses on large datasets, including visualizing sliding window results integrated with available genome annotations in the UCSC browser.

...read moreread less

Abstract: Motivation: DnaSP is a software package for a comprehensive analysis of DNA polymorphism data. Version 5 implements a number of new features and analytical methods allowing extensive DNA polymorphism analyses on large datasets. Among other features, the newly implemented methods allow for: (i) analyses on multiple data files; (ii) haplotype phasing; (iii) analyses on insertion/deletion polymorphism data; (iv) visualizing sliding window results integrated with available genome annotations in the UCSC browser. Availability: Freely available to academic users from: http://www.ub.edu/dnasp Contact: [email protected]

...read moreread less

13,511 citations

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

...read moreread less

12,661 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse