Home
/
Authors
/
Sharon R. Grossman

Author

Sharon R. Grossman

Other affiliations: Harvard University, Massachusetts Institute of Technology

Bio: Sharon R. Grossman is an academic researcher from Broad Institute. The author has contributed to research in topics: Enhancer & Gene. The author has an hindex of 18, co-authored 21 publications receiving 8613 citations. Previous affiliations of Sharon R. Grossman include Harvard University & Massachusetts Institute of Technology.

Topics: Enhancer, Gene, Promoter, Genome-wide association study, Copy-number variation ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Integrating common and rare genetic variation in diverse human populations

[...]

David Altshuler¹, Richard A. Gibbs², Leena Peltonen³, Emmanouil T. Dermitzakis⁴, Stephen F. Schaffner¹, Fuli Yu², Penelope E. Bonnen², de Bakker Piw.⁵, Panagiotis Deloukas⁵, Stacey Gabriel¹, R. Gwilliam⁵, Sarah E. Hunt⁵, Michael Inouye⁵, Xiaoming Jia¹, Aarno Palotie, Melissa Parkin¹, Pamela Whittaker⁵, Kyle Chang², Alicia Hawes², Lora Lewis², Yanru Ren², D Wheeler², Donna M. Muzny², Chris P. Barnes⁵, Katayoon Darvishi⁶, Matthew E. Hurles⁵, Joshua M. Korn¹, K. Kristiansson⁵, Charles Lee⁶, S A McCarrol¹, James Nemesh¹, Alon Keinan⁷, Stephen B. Montgomery⁴, Samuela Pollack¹, Alkes L. Price⁶, Nicole Soranzo⁵, Claudia Gonzaga-Jauregui², Verneri Anttila, Wendy Brodeur¹, Mark J. Daly⁶, Stephen Leslie⁸, Gil McVean⁸, Loukas Moutsianas⁸, Huy Nguyen¹, Qingrun Zhang⁵, Ghori Mjr.⁵, Ralph McGinnis⁵, William M. McLaren⁵, Fumihiko Takeuchi⁵, Sharon R. Grossman⁶, Ilya Shlyakhter¹, Elizabeth Hostetter⁶, Pardis C. Sabeti⁶, Clement Adebamowo⁹, Morris W. Foster¹⁰, Deborah R. Gordon¹¹, Julio Licinio¹², M C Manca, Patricia A. Marshall¹³, Ichiro Matsuda¹⁴, D Ngare¹⁵, Vivian Ota Wang¹⁶, D Reddy¹⁷, Charles N. Rotimi¹⁶, Charmaine D.M. Royal¹⁸, Richard R. Sharp¹⁹, Changqing Zeng²⁰, Lisa D. Brooks¹⁶, Jean E. McEwen¹⁶ - Show less +65 more•Institutions (20)

Broad Institute¹, Baylor College of Medicine², University of Helsinki³, University of Geneva⁴, Wellcome Trust Sanger Institute⁵, Harvard University⁶, Cornell University⁷, University of Oxford⁸, University of Maryland, Baltimore⁹, University of Oklahoma¹⁰, University of California, San Francisco¹¹, Australian National University¹², Case Western Reserve University¹³, Health Sciences University of Hokkaido¹⁴, Moi University¹⁵, National Institutes of Health¹⁶, University of Houston–Clear Lake¹⁷, Duke University¹⁸, Cleveland Clinic¹⁹, Chinese Academy of Sciences²⁰

02 Sep 2010-Nature

TL;DR: An expanded public resource of genome variants in global populations supports deeper interrogation of genomic variation and its role in human disease, and serves as a step towards a high-resolution map of the landscape of human genetic variation.

...read moreread less

Abstract: Despite great progress in identifying genetic variants that influence human disease, most inherited risk remains unexplained. A more complete understanding requires genome-wide studies that fully examine less common alleles in populations with a wide range of ancestry. To inform the design and interpretation of such studies, we genotyped 1.6 million common single nucleotide polymorphisms (SNPs) in 1,184 reference individuals from 11 global populations, and sequenced ten 100-kilobase regions in 692 of these individuals. This integrated data set of common and rare alleles, called 'HapMap 3', includes both SNPs and copy number polymorphisms (CNPs). We characterized population-specific differences among low-frequency variants, measured the improvement in imputation accuracy afforded by the larger reference panel, especially in imputing SNPs with a minor allele frequency of

...read moreread less

2,863 citations

Journal Article•DOI•

Detecting Novel Associations in Large Data Sets

[...]

David N. Reshef¹, David N. Reshef², David N. Reshef³, Yakir A. Reshef⁴, Yakir A. Reshef², Hilary K. Finucane⁵, Sharon R. Grossman², Sharon R. Grossman⁴, Gilean McVean⁶, Gilean McVean¹, Peter J. Turnbaugh⁴, Eric S. Lander⁴, Eric S. Lander², Eric S. Lander³, Michael Mitzenmacher⁴, Pardis C. Sabeti⁴, Pardis C. Sabeti² - Show less +13 more•Institutions (6)

University of Oxford¹, Broad Institute², Massachusetts Institute of Technology³, Harvard University⁴, Weizmann Institute of Science⁵, Wellcome Trust Centre for Human Genetics⁶

16 Dec 2011-Science

TL;DR: A measure of dependence for two-variable relationships: the maximal information coefficient (MIC), which captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination of the data relative to the regression function.

...read moreread less

Abstract: Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination (R2) of the data relative to the regression function. MIC belongs to a larger class of maximal information-based nonparametric exploration (MINE) statistics for identifying and classifying relationships. We apply MIC and MINE to data sets in global health, gene expression, major-league baseball, and the human gut microbiota and identify known and novel relationships.

...read moreread less

2,414 citations

Journal Article•DOI•

Mapping copy number variation by population-scale genome sequencing

[...]

Ryan E. Mills¹, Klaudia Walter², Chip Stewart³, Robert E. Handsaker⁴ +371 more•Institutions (21)

03 Feb 2011-Nature

TL;DR: A map of unbalanced SVs is constructed based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations, and serves as a resource for sequencing-based association studies.

...read moreread less

Abstract: Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.

...read moreread less

1,085 citations

A map of human genome variation from population-scale sequencing

[...]

Richard Durbin, David Altshuler, Gonçalo R. Abecasis, David R. Bentley +358 more

01 Oct 2010

TL;DR: The pilot phase of the 1000 Genomes Project is presented, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms, and the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants are described.

...read moreread less

599 citations

Journal Article•DOI•

Detecting natural selection in genomic data.

[...]

Joseph J. Vitti¹, Sharon R. Grossman, Pardis C. Sabeti•Institutions (1)

Harvard University¹

25 Nov 2013-Annual Review of Genetics

TL;DR: A comprehensive outline of evolutionary genomics methods is provided, highlighting the importance of functional follow-up studies to characterize putative selected alleles and the use of selection scans as hypothesis-generating tools for investigating evolutionary histories.

...read moreread less

Abstract: The past fifty years have seen the development and application of numerous statistical methods to identify genomic regions that appear to be shaped by natural selection. These methods have been used to investigate the macro- and microevolution of a broad range of organisms, including humans. Here, we provide a comprehensive outline of these methods, explaining their conceptual motivations and statistical interpretations. We highlight areas of recent and future development in evolutionary genomics methods and discuss ongoing challenges for researchers employing such tests. In particular, we emphasize the importance of functional follow-up studies to characterize putative selected alleles and the use of selection scans as hypothesis-generating tools for investigating evolutionary histories.

...read moreread less

589 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Journal Article•DOI•

An integrated map of genetic variation from 1,092 human genomes

[...]

Gonçalo R. Abecasis¹, Adam Auton², Lisa D. Brooks³, Mark A. DePristo⁴, Richard Durbin⁵, Robert E. Handsaker⁴, Robert E. Handsaker⁶, Hyun Min Kang¹, Gabor T. Marth⁷, Gil McVean⁸ - Show less +6 more•Institutions (8)

University of Michigan¹, Yeshiva University², National Institutes of Health³, Broad Institute⁴, Wellcome Trust Sanger Institute⁵, Harvard University⁶, Boston College⁷, University of Oxford⁸

01 Nov 2012-Nature

TL;DR: It is shown that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites.

...read moreread less

Abstract: By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methods to integrate information across several algorithms and diverse data sources, we provide a validated haplotype map of 38 million single nucleotide polymorphisms, 1.4 million short insertions and deletions, and more than 14,000 larger deletions. We show that individuals from different populations carry different profiles of rare and common variants, and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites. This resource, which captures up to 98% of accessible single nucleotide polymorphisms at a frequency of 1% in related populations, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations.

...read moreread less

7,710 citations

Journal Article•DOI•

A Map of Human Genome Variation From Population-Scale Sequencing

[...]

Gonçalo R. Abecasis¹, David Altshuler², David Altshuler³, Adam Auton⁴, Lisa D Brooks⁵, Richard Durbin⁶, Richard A. Gibbs⁷, Matthew E. Hurles⁶, Gil McVean⁴ - Show less +5 more•Institutions (7)

University of Michigan¹, Harvard University², Broad Institute³, University of Oxford⁴, Johns Hopkins University⁵, Wellcome Trust Sanger Institute⁶, Baylor College of Medicine⁷

28 Oct 2010-Nature

TL;DR: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype as mentioned in this paper, and the results of the pilot phase of the project, designed to develop and compare different strategies for genomewide sequencing with high-throughput platforms.

...read moreread less

Abstract: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

...read moreread less

7,538 citations

Journal Article•DOI•

From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline

[...]

Géraldine A. Van der Auwera¹, Mauricio O. Carneiro¹, Christopher Hartl¹, Ryan Poplin¹, Guillermo del Angel¹, Ami Levy-Moonshine¹, Tadeusz Jordan¹, Khalid Shakir¹, David Roazen¹, Joel Thibault¹, Eric Banks¹, Kiran V. Garimella², David Altshuler¹, Stacey Gabriel¹, Mark A. DePristo¹ - Show less +11 more•Institutions (2)

Broad Institute¹, Wellcome Trust Centre for Human Genetics²

15 Oct 2013-Current protocols in human genetics

TL;DR: This unit describes how to use BWA and the Genome Analysis Toolkit to map genome sequencing data to a reference and produce high‐quality variant calls that can be used in downstream analyses.

...read moreread less

Abstract: This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.

...read moreread less

5,150 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse