Home
/
Authors
/
Gareth Highnam

Author

Gareth Highnam

Other affiliations: Virginia Tech

Bio: Gareth Highnam is an academic researcher from Virginia Bioinformatics Institute. The author has contributed to research in topics: Population & Genetic variation. The author has an hindex of 12, co-authored 17 publications receiving 1349 citations. Previous affiliations of Gareth Highnam include Virginia Tech.

Topics: Population, Genetic variation, Genome, Tandem repeat, Gene ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines

[...]

Wen Huang¹, Andreas Massouras², Andreas Massouras³, Yutaka Inoue⁴, Jason A. Peiffer¹, Miquel Ràmia⁵, Aaron M. Tarone⁶, Lavanya Turlapati¹, Thomas Zichner⁷, Dianhui Zhu⁸, Richard F. Lyman¹, Michael M. Magwire¹, Kerstin P. Blankenburg⁸, Mary Anna Carbone¹, Kyle Chang⁸, Lisa L. Ellis⁶, Sonia Fernandez⁸, Yi Han⁸, Gareth Highnam⁹, Carl E. Hjelmen⁶, John Jack¹, Mehwish Javaid⁸, Joy Jayaseelan⁸, Divya Kalra⁸, Sandy Lee⁸, Lora Lewis⁸, Mala Munidasa⁸, Fiona Ongeri⁸, Shohba Patel⁸, Lora Perales⁸, Agapito Perez⁸, Ling-Ling Pu⁸, Stephanie M. Rollmann¹, Robert Ruth⁸, Nehad Saada⁸, Crystal B. Warner⁸, Aneisa Williams⁸, Yuanqing Wu⁸, Akihiko Yamamoto¹, Yiqing Zhang⁸, Yiming Zhu⁸, Robert R. H. Anholt¹, Jan O. Korbel⁷, David Mittelman⁹, Donna M. Muzny⁸, Richard A. Gibbs⁸, Antonio Barbadilla⁵, J. Spencer Johnston⁶, Eric A. Stone¹, Stephen Richards⁸, Bart Deplancke², Bart Deplancke³, Trudy F. C. Mackay¹ - Show less +49 more•Institutions (9)

North Carolina State University¹, École Polytechnique Fédérale de Lausanne², Swiss Institute of Bioinformatics³, Osaka University⁴, Autonomous University of Barcelona⁵, Texas A&M University⁶, European Bioinformatics Institute⁷, Baylor College of Medicine⁸, Virginia Bioinformatics Institute⁹

01 Jul 2014-Genome Research

TL;DR: An integrated genotyping strategy was used to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants and identified 16 polymorphic inversions in the DGRP, finding variation in genome size and many quantitative traits are significantly associated with inversions.

...read moreread less

Abstract: The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed toward larger genomes. Insertion/deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting that their nonrandom distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome-wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes, and analysis and visualization tools are publicly available.

...read moreread less

569 citations

Journal Article•DOI•

The landscape of human STR variation

[...]

Thomas Willems¹, Melissa Gymrek¹, Gareth Highnam², David Mittelman², Yaniv Erlich¹ - Show less +1 more•Institutions (2)

Massachusetts Institute of Technology¹, Virginia Bioinformatics Institute²

18 Aug 2014-Genome Research

TL;DR: The largest-scale analysis of human STR variation to date is reported, using the call set collected in Phase 1 of the 1000 Genomes Project to analyze determinants of STR variation, assess the human reference genome's representation of STR alleles, find STR loci with common loss-of-function allele, and obtain initial estimates of the linkage disequilibrium between STRs and common SNPs.

...read moreread less

Abstract: Short tandem repeats are among the most polymorphic loci in the human genome. These loci play a role in the etiology of a range of genetic diseases and have been frequently utilized in forensics, population genetics, and genetic genealogy. Despite this plethora of applications, little is known about the variation of most STRs in the human population. Here, we report the largest-scale analysis of human STR variation to date. We collected information for nearly 700,000 STR loci across more than 1000 individuals in Phase 1 of the 1000 Genomes Project. Extensive quality controls show that reliable allelic spectra can be obtained for close to 90% of the STR loci in the genome. We utilize this call set to analyze determinants of STR variation, assess the human reference genome’s representation of STR alleles, find STR loci with common loss-of-function alleles, and obtain initial estimates of the linkage disequilibrium between STRs and common SNPs. Overall, these analyses further elucidate the scale of genetic variation beyond classical point mutations.

...read moreread less

227 citations

Journal Article•DOI•

Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles

[...]

Gareth Highnam¹, Christopher T. Franck¹, Andy Martin¹, Calvin Stephens¹, Ashwin Puthige¹, David Mittelman¹ - Show less +2 more•Institutions (1)

Virginia Bioinformatics Institute¹

01 Jan 2013-Nucleic Acids Research

TL;DR: A tool for genotyping microsatellite repeats called RepeatSeq is presented, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties.

...read moreread less

Abstract: Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties. Next, we apply RepeatSeq to high-coverage genomes from the 1000 Genomes Project to evaluate performance and accuracy. The software uses common formats, such as VCF, for compatibility with existing genome analysis pipelines. Source code and binaries are available at http://github.com/adaptivegenome/repeatseq.

...read moreread less

157 citations

Journal Article•DOI•

Polymorphic tandem repeats within gene promoters act as modifiers of gene expression and DNA methylation in humans

[...]

Javier Quilez¹, Audrey Guilmatre¹, Paras Garg¹, Gareth Highnam², Melissa Gymrek³, Yaniv Erlich³, Ricky S. Joshi¹, David Mittelman², Andrew J. Sharp¹ - Show less +5 more•Institutions (3)

Icahn School of Medicine at Mount Sinai¹, Virginia Bioinformatics Institute², Massachusetts Institute of Technology³

05 May 2016-Nucleic Acids Research

TL;DR: It is suggested that a significant fraction of TR variations exert functional effects via alterations of local gene expression or epigenetics, and it is concluded that targeted studies that focus on genotyping TR variants are required to fully ascertain functional variation in the genome.

...read moreread less

Abstract: Despite representing an important source of genetic variation, tandem repeats (TRs) remain poorly studied due to technical difficulties. We hypothesized that TRs can operate as expression (eQTLs) and methylation (mQTLs) quantitative trait loci. To test this we analyzed the effect of variation at 4849 promoter-associated TRs, genotyped in 120 individuals, on neighboring gene expression and DNA methylation. Polymorphic promoter TRs were associated with increased variance in local gene expression and DNA methylation, suggesting functional consequences related to TR variation. We identified >100 TRs associated with expression/methylation levels of adjacent genes. These potential eQTL/mQTL TRs were enriched for overlaps with transcription factor binding and DNaseI hypersensitivity sites, providing a rationale for their effects. Moreover, we showed that most TR variants are poorly tagged by nearby single nucleotide polymorphisms (SNPs) markers, indicating that many functional TR variants are not effectively assayed by SNP-based approaches. Our study assigns biological significance to TR variations in the human genome, and suggests that a significant fraction of TR variations exert functional effects via alterations of local gene expression or epigenetics. We conclude that targeted studies that focus on genotyping TR variants are required to fully ascertain functional variation in the genome.

...read moreread less

117 citations

Journal Article•DOI•

TAF1 Variants Are Associated with Dysmorphic Features, Intellectual Disability, and Neurological Manifestations.

[...]

Jason O'Rawe¹, Jason O'Rawe², Yiyang Wu¹, Yiyang Wu², Max Dorfel¹, Alan F. Rope³, P.Y. Billie Au⁴, Jillian S. Parboosingh⁴, Sungjin Moon⁵, Maria Kousi⁵, Konstantina Kosma⁶, Konstantina Kosma⁷, Christopher Smith⁴, Maria Tzetis⁷, Maria Tzetis⁶, Jane L. Schuette⁸, Robert B. Hufnagel⁹, Robert B. Hufnagel¹⁰, Carlos E. Prada¹⁰, Francisco Martínez¹¹, Carmen Orellana¹¹, Jonathan Crain¹, Alfonso Caro-Llopis¹¹, Silvestre Oltra¹¹, Sandra Monfort¹¹, Laura T. Jiménez-Barrón¹², Laura T. Jiménez-Barrón¹, Jeffrey Swensen, Sara Ellingwood, Rosemarie Smith¹³, Han Fang¹, Sandra Ospina¹⁴, Sander Stegmann, Nicolette S. den Hollander¹⁵, David Mittelman, Gareth Highnam, Reid J. Robison¹⁶, Edward Yang⁶, Laurence Faivre, Agathe Roubertie¹⁷, Jean Baptiste Rivière, Kristin G. Monaghan¹⁸, Kai Wang¹⁹, Erica E. Davis⁵, Nicholas Katsanis⁵, Vera M. Kalscheuer²⁰, Edith H. Wang²¹, Kay Metcalfe²², Tjitske Kleefstra²³, A. Micheil Innes⁴, Sophia Kitsiou-Tzeli⁷, Mónica Roselló¹¹, Catherine E. Keegan⁸, Gholson J. Lyon¹, Gholson J. Lyon¹⁶, Gholson J. Lyon² - Show less +52 more•Institutions (23)

Cold Spring Harbor Laboratory¹, Stony Brook University², Kaiser Permanente³, Alberta Children's Hospital⁴, Duke University⁵, Boston Children's Hospital⁶, National and Kapodistrian University of Athens⁷, University of Michigan⁸, National Institutes of Health⁹, Cincinnati Children's Hospital Medical Center¹⁰, Instituto Politécnico Nacional¹¹, National Autonomous University of Mexico¹², Maine Medical Center¹³, Del Rosario University¹⁴, Leiden University Medical Center¹⁵, Foundation for Biomedical Research¹⁶, University of Montpellier¹⁷, GeneDx¹⁸, University of Southern California¹⁹, Max Planck Society²⁰, University of Washington²¹, Central Manchester University Hospitals NHS Foundation Trust²², Radboud University Nijmegen²³

03 Dec 2015-American Journal of Human Genetics

TL;DR: It is suggested that mutations in TAF1 play a critical role in the development of this X-linked ID syndrome, and knockdown and mutant studies of this gene in zebrafish have shown a quantifiable effect on a neuronal phenotype.

...read moreread less

Abstract: We describe an X-linked genetic syndrome associated with mutations in TAF1 and manifesting with global developmental delay, intellectual disability (ID), characteristic facial dysmorphology, generalized hypotonia, and variable neurologic features, all in male individuals. Simultaneous studies using diverse strategies led to the identification of nine families with overlapping clinical presentations and affected by de novo or maternally inherited single-nucleotide changes. Two additional families harboring large duplications involving TAF1 were also found to share phenotypic overlap with the probands harboring single-nucleotide changes, but they also demonstrated a severe neurodegeneration phenotype. Functional analysis with RNA-seq for one of the families suggested that the phenotype is associated with downregulation of a set of genes notably enriched with genes regulated by E-box proteins. In addition, knockdown and mutant studies of this gene in zebrafish have shown a quantifiable, albeit small, effect on a neuronal phenotype. Our results suggest that mutations in TAF1 play a critical role in the development of this X-linked ID syndrome.

...read moreread less

96 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•

Patterns of Somatic Mutation in Human Cancer Genomes

[...]

Michael R. Stratton¹•Institutions (1)

Wellcome Trust Sanger Institute¹

15 Nov 2007-Clinical Cancer Research

TL;DR: In this paper, the coding exons of the family of 518 protein kinases were sequenced in 210 cancers of diverse histological types to explore the nature of the information that will be derived from cancer genome sequencing.

...read moreread less

Abstract: AACR Centennial Conference: Translational Cancer Medicine-- Nov 4-8, 2007; Singapore PL02-05 All cancers are due to abnormalities in DNA. The availability of the human genome sequence has led to the proposal that resequencing of cancer genomes will reveal the full complement of somatic mutations and hence all the cancer genes. To explore the nature of the information that will be derived from cancer genome sequencing we have sequenced the coding exons of the family of 518 protein kinases, ~1.3Mb DNA per cancer sample, in 210 cancers of diverse histological types. Despite the screen being directed toward the coding regions of a gene family that has previously been strongly implicated in oncogenesis, the results indicate that the majority of somatic mutations detected are “passengers”. There is considerable variation in the number and pattern of these mutations between individual cancers, indicating substantial diversity of processes of molecular evolution between cancers. The imprints of exogenous mutagenic exposures, mutagenic treatment regimes and DNA repair defects can all be seen in the distinctive mutational signatures of individual cancers. This systematic mutation screen and others have previously yielded a number of cancer genes that are frequently mutated in one or more cancer types and which are now anticancer drug targets (for example BRAF , PIK3CA , and EGFR ). However, detailed analyses of the data from our screen additionally suggest that there exist a large number of additional “driver” mutations which are distributed across a substantial number of genes. It therefore appears that cells may be able to utilise mutations in a large repertoire of potential cancer genes to acquire the neoplastic phenotype. However, many of these genes are employed only infrequently. These findings may have implications for future anticancer drug development.

...read moreread less

2,737 citations

Journal Article•DOI•

Genomic insights into the origin of farming in the ancient Near East

[...]

Iosif Lazaridis¹, Dani Nadel, Gary O. Rollefson², Deborah C. Merrett³, Nadin Rohland, Swapan Mallick⁴, Swapan Mallick¹, Daniel Fernandes⁵, Daniel Fernandes⁶, Mario Novak⁵, Beatriz Gamarra⁵, Kendra Sirak⁵, Kendra Sirak⁷, Sarah Connell⁵, Kristin Stewardson⁴, Eadaoin Harney⁴, Qiaomei Fu⁸, Gloria Gonzalez-Fortes⁹, Eppie R. Jones, Songül Alpaslan Roodenberg, György Lengyel¹⁰, Fanny Bocquentin, Boris Gasparian¹¹, Janet Monge¹², Michael Gregg¹², Vered Eshed, Ahuva Sivan Mizrahi, Christopher Meiklejohn¹³, Fokke Gerritsen, Luminita Bejenaru¹⁴, Matthias Blüher, Archie Campbell¹⁵, Gianpiero L. Cavalleri¹⁶, David Comas¹⁷, Philippe Froguel¹⁸, Edmund Gilbert¹⁶, Shona M. Kerr¹⁵, Peter Kovacs, Johannes Krause¹⁹, Darren McGettigan⁵, Michael Merrigan, D. Andrew Merriwether²⁰, Seamus O’Reilly, Martin B. Richards²¹, Ornella Semino²², Michel Shamoon-Pour²⁰, Gheorghe Stefanescu, Michael Stumvoll, Anke Tönjes, Antonio Torroni²², James F. Wilson, Loic Yengo, Nelli Hovhannisyan²³, Nick Patterson¹, Ron Pinhasi⁵, David Reich⁴, David Reich¹ - Show less +53 more•Institutions (23)

Broad Institute¹, Whitman College², Simon Fraser University³, Howard Hughes Medical Institute⁴, University College Dublin⁵, University of Coimbra⁶, Emory University⁷, Chinese Academy of Sciences⁸, University of Ferrara⁹, University of Miskolc¹⁰, Armenian National Academy of Sciences¹¹, University of Pennsylvania¹², University of Winnipeg¹³, Alexandru Ioan Cuza University¹⁴, University of Edinburgh¹⁵, Royal College of Surgeons in Ireland¹⁶, Spanish National Research Council¹⁷, Imperial College London¹⁸, Max Planck Society¹⁹, Binghamton University²⁰, University of Huddersfield²¹, University of Pavia²², Yerevan State University²³

25 Aug 2016-Nature

TL;DR: This paper reported genome-wide ancient DNA from 44 ancient Near Easterners ranging in time between ~12,000 and 1,400 bc, from Natufian hunter-gatherers to Bronze Age farmers, showing that the earliest populations of the Near East derived around half their ancestry from a 'Basal Eurasian' lineage that had little if any Neanderthal admixture and that separated from other non-African lineages before their separation from each other.

...read moreread less

Abstract: We report genome-wide ancient DNA from 44 ancient Near Easterners ranging in time between ~12,000 and 1,400 bc, from Natufian hunter–gatherers to Bronze Age farmers. We show that the earliest populations of the Near East derived around half their ancestry from a ‘Basal Eurasian’ lineage that had little if any Neanderthal admixture and that separated from other non-African lineages before their separation from each other. The first farmers of the southern Levant (Israel and Jordan) and Zagros Mountains (Iran) were strongly genetically differentiated, and each descended from local hunter–gatherers. By the time of the Bronze Age, these two populations and Anatolian-related farmers had mixed with each other and with the hunter–gatherers of Europe to greatly reduce genetic differentiation. The impact of the Near Eastern farmers extended beyond the Near East: farmers related to those of Anatolia spread westward into Europe; farmers related to those of the Levant spread southward into East Africa; farmers related to those of Iran spread northward into the Eurasian steppe; and people related to both the early farmers of Iran and to the pastoralists of the Eurasian steppe spread eastward into South Asia.

...read moreread less

695 citations

Journal Article•DOI•

Assembly and diploid architecture of an individual human genome via single-molecule technologies

[...]

Matthew Pendleton¹, Robert Sebra¹, Andy Wing Chun Pang, Ajay Ummat¹, Oscar Franzén¹, Tobias Rausch, Adrian M. Stütz, William Stedman, Thomas Anantharaman, Alex Hastie, Heng Dai, Markus Hsi-Yang Fritz, Han Cao, Ariella Cohain¹, Gintaras Deikus¹, Russell E. Durrett², Scott C. Blanchard², Roger B. Altman², Chen-Shan Chin³, Yan Guo³, Ellen E. Paxinos³, Jan O. Korbel⁴, Robert B. Darnell⁵, W. Richard McCombie⁶, Pui-Yan Kwok⁷, Christopher E. Mason², Eric E. Schadt¹, Ali Bashir¹ - Show less +24 more•Institutions (7)

Icahn School of Medicine at Mount Sinai¹, Cornell University², Pacific Biosciences³, European Bioinformatics Institute⁴, Howard Hughes Medical Institute⁵, Watson School of Biological Sciences⁶, University of California, San Francisco⁷

01 Aug 2015-Nature Methods

TL;DR: This work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

...read moreread less

Abstract: We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

...read moreread less

492 citations

Journal Article•DOI•

FlyBase: updates to the Drosophila melanogaster knowledge base.

[...]

Aoife Larkin¹, Steven J Marygold¹, Giulia Antonazzo¹, Helen Attrill¹, Gilberto dos Santos², Phani V. Garapati¹, Joshua L. Goodman³, L. Sian Gramates², Gillian Millburn¹, Victor B. Strelets³, Christopher J. Tabone², Jim Thurmond³ - Show less +8 more•Institutions (3)

University of Cambridge¹, Harvard University², Indiana University³

08 Jan 2021-Nucleic Acids Research

TL;DR: The introduction of several new features at FlyBase are described, including Pathway Reports, paralog information, disease models based on orthology, customizable tables within reports and overview displays of expression and disease data.

...read moreread less

Abstract: FlyBase (flybase.org) is an essential online database for researchers using Drosophila melanogaster as a model organism, facilitating access to a diverse array of information that includes genetic, molecular, genomic and reagent resources. Here, we describe the introduction of several new features at FlyBase, including Pathway Reports, paralog information, disease models based on orthology, customizable tables within reports and overview displays ('ribbons') of expression and disease data. We also describe a variety of recent important updates, including incorporation of a developmental proteome, upgrades to the GAL4 search tab, additional Experimental Tool Reports, migration to JBrowse for genome browsing and improvements to batch queries/downloads and the Fast-Track Your Paper tool.

...read moreread less

329 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse