Home
/
Authors
/
Adam Auton

Author

Adam Auton

Other affiliations: Broad Institute, Cornell University, University of Oxford ...read more

Bio: Adam Auton is an academic researcher from Albert Einstein College of Medicine. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 47, co-authored 94 publications receiving 51799 citations. Previous affiliations of Adam Auton include Broad Institute & Cornell University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006

Papers

PDF

Open Access

More filters

Posted Content•DOI•

Estimating heritability and its enrichment in tissue-specific gene sets in admixed populations

[...]

Yang Luo, Xinyi Li, Xin Wang, Steven Gazal¹, Steven Gazal², Josep M. Mercader¹, Josep M. Mercader², Benjamin M. Neale¹, Benjamin M. Neale², Jose C. Florez², Jose C. Florez¹, Adam Auton, Alkes L. Price², Alkes L. Price¹, Hilary K. Finucane¹, Soumya Raychaudhuri - Show less +12 more•Institutions (2)

Broad Institute¹, Harvard University²

25 May 2019-bioRxiv

TL;DR: Cov-LDSC is introduced, a method to accurately estimate genetic heritability and its enrichment in both homogenous and admixed populations with summary statistics and in-sample LD estimates, and develops a computationally efficient method to answer two specific questions.

...read moreread less

Abstract: The increasing size and diversity of genome-wide association studies provide an exciting opportunity to study how the genetics of complex traits vary among diverse populations. Here, we introduce covariate-adjusted LD score regression (cov-LDSC), a method to accurately estimate genetic heritability and its enrichment in both homogenous and admixed populations with summary statistics and in-sample LD estimates. In-sample LD can be estimated from a subset of the GWAS samples, allowing our method to be applied efficiently to very large cohorts. In simulations, we show that unadjusted LDSC underestimates by 10% − 60% in admixed populations; in contrast, cov-LDSC is robust to all simulation parameters. We apply cov-LDSC to genotyping data from approximately 170,000 Latino, 47,000 African American and 135,000 European individuals. We estimate and detect heritability enrichment in three quantitative and five dichotomous phenotypes respectively, making this, to our knowledge, the most comprehensive heritability-based analysis of admixed individuals. Our results show that most traits have high concordance of and consistent tissue-specific heritability enrichment among different populations. However, for age at menarche, we observe population-specific heritability estimates of . We observe consistent patterns of tissue-specific heritability enrichment across populations; for example, in the limbic system for BMI, the per-standardized-annotation effect size τ* is 0.16 ± 0.04, 0.28 ± 0.11 and 0.18 ± 0.03 in Latino, African American and European populations respectively. Our results demonstrate that our approach is a powerful way to analyze genetic data for complex traits from underrepresented populations. Author summary Admixed populations such as African Americans and Hispanic Americans bear a disproportionately high burden of disease but remain underrepresented in current genetic studies. It is important to extend current methodological advancements for understanding the genetic basis of complex traits in homogeneous populations to individuals with admixed genetic backgrounds. Here, we develop a computationally efficient method to answer two specific questions. First, does genetic variation contribute to the same amount of phenotypic variation (heritability) across diverse populations? Second, are the genetic mechanisms shared among different populations? To answer these questions, we use our novel method to conduct the first comprehensive heritability-based analysis of a large number of admixed individuals. We show that there is a high degree of concordance in total heritability and tissue-specific enrichment between different ancestral groups. However, traits such as age at menarche show a noticeable differences among populations. Our work provides a powerful way to analyze genetic data in admixed populations and may contribute to the applicability of genomic medicine to admixed population groups.

...read moreread less

12 citations

Posted Content•DOI•

Shared genetic background between children and adults with attention deficit/hyperactivity disorder

[...]

Paula Rovira¹, Ditte Demontis², Ditte Demontis³, Cristina Sánchez-Mora¹, Tetyana Zayats⁴, Tetyana Zayats⁵, Tetyana Zayats⁶, Marieke Klein⁷, Marieke Klein⁸, Nina Roth Mota⁸, Heike Weber⁹, Heike Weber¹⁰, Iris Garcia-Martínez, Mireia Pagerols¹, Laura Vilar¹, Lorena Arribas¹, Vanesa Richarte¹, Montserrat Corrales¹, Christian Fadeuilhe¹, Rosa Bosch¹, Gemma Martín¹, Peter Almos¹⁰, Alysa E. Doyle⁴, Eugenio H. Grevet¹¹, Oliver Grimm⁹, Anne Halmøy⁶, Anne Halmøy¹², Martine Hoogman⁸, Mara H. Hutz¹¹, Christian Jacob¹⁰, Sarah Kittel-Schneider⁹, Per M. Knappskog¹², Per M. Knappskog⁶, Astri J. Lundervold⁶, Olga Rivero¹⁰, Diego L. Rovaris¹¹, Angélica Salatino-Oliveira¹¹, Bruna Santos da Silva¹¹, Evgeniy Svirin¹³, Evgeniy Svirin¹⁰, Emma Sprooten⁸, Tatyana Strekalova¹⁴, Tatyana Strekalova¹³, Tatyana Strekalova¹⁰, Ole A. Andreassen¹⁵, Ole A. Andreassen¹⁶, Tobias Banaschewski, Mark A. Bellgrove¹⁷, Joseph Biederman⁴, Christie L. Burton, Jennifer Crosbie¹⁸, Søren Dalsgaard², Søren Dalsgaard³, Josephine Elia¹⁹, Josephine Elia²⁰, Hakon Hakonarson²¹, Hakon Hakonarson²², Catharina A. Hartman²³, Ziarih Hawi¹⁷, Johannes Hebebrand²⁴, Anke Hinney²⁴, Sandra K. Loo²⁵, James J. McGough²⁵, Benjamin M. Neale, Robert D. Oades²⁴, Ted Reichborn-Kjennerud²⁶, Aribert Rothenberger, Russell Schachar¹⁸, Irwin D. Waldman²⁷, Irwin D. Waldman⁵, Michelle Agee, Babak Alipanahi, Adam Auton, Robert K. Bell, Katarzyna Bryc, Sarah L. Elson, Pierre Fontanillas, Nicholas A. Furlotte, David A. Hinds, Karen E. Huber, Aaron Kleinman, Nadia K. Litterman, Jennifer C. McCreight, Matthew H. McIntyre, Joanna L. Mountain, Elizabeth S. Noblin, Carrie A.M. Northover, Steven J. Pitts, J. Fah Sathirapongsasuti, Olga V. Sazonova, Janie F. Shelton, Suyash Shringarpure, Chao Tian, Joyce Y. Tung, Vladimir Vacic, Xin Wang, Catherine H. Wilson, Alejandro Arias-Vasquez⁸, Edmund J.S. Sonuga-Barke²⁸, Edmund J.S. Sonuga-Barke³, Philip Asherson²⁸, Claiton H.D. Bau¹¹, Jan K. Buitelaar⁸, Bru Cormand, Stephen V. Faraone²⁹, Jan Haavik¹², Jan Haavik⁶, Stefan Johansson¹², Stefan Johansson⁶, Jonna Kuntsi²⁸, Henrik Larsson³⁰, Henrik Larsson³¹, Klaus-Peter Lesch¹⁰, Klaus-Peter Lesch¹⁴, Klaus-Peter Lesch¹³, Andreas Reif⁹, Luis Augusto Rohde¹¹, Miquel Casas, Anders D. Børglum², Anders D. Børglum³, Barbara Franke⁸, Josep Antoni Ramos-Quiroga¹, María Soler Artigas¹, Marta Ribasés¹ - Show less +120 more•Institutions (31)

Autonomous University of Barcelona¹, Lundbeck², Aarhus University³, Harvard University⁴, Broad Institute⁵, University of Bergen⁶, Utrecht University⁷, Radboud University Nijmegen⁸, Goethe University Frankfurt⁹, University of Würzburg¹⁰, Universidade Federal do Rio Grande do Sul¹¹, Haukeland University Hospital¹², I.M. Sechenov First Moscow State Medical University¹³, Maastricht University¹⁴, Oslo University Hospital¹⁵, University of Oslo¹⁶, Monash University¹⁷, University of Toronto¹⁸, Thomas Jefferson University¹⁹, Alfred I. duPont Hospital for Children²⁰, University of Pennsylvania²¹, Children's Hospital of Philadelphia²², University Medical Center Groningen²³, University of Duisburg-Essen²⁴, Semel Institute for Neuroscience and Human Behavior²⁵, Norwegian Institute of Public Health²⁶, Emory University²⁷, King's College London²⁸, State University of New York Upstate Medical University²⁹, Karolinska Institutet³⁰, Örebro University³¹

28 Mar 2019-bioRxiv

TL;DR: It is confirmed that persistent ADHD in adults is a neurodevelopmental disorder and the existing hypothesis of a shared genetic architecture underlying ADHD and different traits to a lifespan perspective is extended.

...read moreread less

Abstract: Attention deficit/hyperactivity disorder (ADHD) is a common neurodevelopmental disorder characterized by age-inappropriate symptoms of inattention, impulsivity and hyperactivity that persist into adulthood in the majority of the diagnosed children. Despite several risk factors during childhood predicting the persistence of ADHD symptoms into adulthood, the genetic architecture underlying the trajectory of ADHD over time is still unclear. We set out to study the contribution of common genetic variants to the risk for ADHD across the lifespan by conducting meta-analyses of genome-wide association studies on persistent ADHD in adults and ADHD in childhood separately and comparing the genetic background between them in a total sample of 17,149 cases and 32,411 controls. Our results show nine new independent genome-wide significant loci and support a shared contribution of common genetic variants to ADHD in children and adults. No subgroup heterogeneity was observed among children, while this group consists of future remitting and persistent individuals. We report similar patterns of genetic correlation of ADHD with other ADHD-related datasets and different traits and disorders among adults, children and when combining both groups. These findings confirm that persistent ADHD in adults is a neurodevelopmental disorder and extend the existing hypothesis of a shared genetic architecture underlying ADHD and different traits to a lifespan perspective.

...read moreread less

11 citations

Journal Article•DOI•

Genome-wide association study of REM sleep behavior disorder identifies polygenic risk and brain expression effects

[...]

Lynne Krohn, Karl Heilbron, Cornelis Blauwendraat, Regina H. Reynolds, Eric Yu, Konstantin Senkevich, Uladzislau Rudakou, Mehrdad Asghari Estiar, Emil K. Gustavsson, Kajsa Brolin, Jennifer A. Ruskey, Kathryn A. Freeman, Farnaz Asayesh, Ruth Chia, Isabelle Arnulf, Michele T.M. Hu, Jacques Montplaisir, J. F. Gagnon, Alex Desautels, Yves Dauvilliers, Gian Luigi Gigli, Mariarosaria Valente, Francesco Janes, Andrea Bernardini, Birgit Högl, Ambra Stefani, Abubaker Mohamed Ahmed Ibrahim, Karel Sonka, David Kemlink, W. Oertel, Annette Janzen, Giuseppe Plazzi, F. Biscarini, Elena Antelmi, Michela Figorilli, Monica Puligheddu, Brit Mollenhauer, Claudia Trenkwalder, Friederike Sixel-Döring, Valérie Cochen De Cock, Christelle Charley Monaca, Anna Heidbreder, Luigi Ferini-Strambi, Femke Dijkstra, Mineke K. Viaene, B. Abril, Bradley F. Boeve, Stella Aslibekyan, Adam Auton, Elizabeth Babalola, Robert K. Bell, Jessica Bielenberg, Katarzyna Bryc, Emily Bullis, Daniel L. Coker, Gabriel Cuellar-Partida, Devika Dhamija, Sayantan Das, Sarah L. Elson, Teresa J. Filshtein, Kipper Fletez-Brant, Pierre Fontanillas, Will Freyman, P. Gandhi, B. Hicks, David A. Hinds, Ethan M. Jewett, Yunxuan Jiang, Katelyn Kukar, Keng-Han Lin, Maya Lowe, J. McCreight, Matthew H. McIntyre, Steven J. Micheletti, Meghan E. Moreno, Joanna L. Mountain, Priyanka Nandakumar, Elizabeth S. Noblin, Jared O'Connell, A. Petrakovitz, G. David Poznik, Morgan Schumacher, Anjali J. Shastri, Janie F. Shelton, Jingchunzi Shi, Suyash Shringarpure, Vinh Tran, Joyce Y. Tung, Xin Wang, Wei Wang, Catherine H. Weldon, Peter Wilton, Alejandro Sanchez Hernandez, Corinna Wong, Christophe Toukam Tchakoute, Sonja W. Scholz, Mina Ryten, Sara Bandres-Ciga, Alastair J. Noyce, Paul Cannon, Lasse Pihlstrøm, Mike A. Nalls, Andrew B. Singleton, Guy A. Rouleau, Ronald B. Postuma, Ziv Gan-Or - Show less +102 more

01 Dec 2022-Nature Communications

TL;DR: This paper performed a genome-wide association study of RBD, identifying five RBD risk loci near SNCA, GBA, TMEM175, INPP5F, and SCARB2.

...read moreread less

Abstract: Abstract Rapid-eye movement (REM) sleep behavior disorder (RBD), enactment of dreams during REM sleep, is an early clinical symptom of alpha-synucleinopathies and defines a more severe subtype. The genetic background of RBD and its underlying mechanisms are not well understood. Here, we perform a genome-wide association study of RBD, identifying five RBD risk loci near SNCA, GBA, TMEM175, INPP5F, and SCARB2 . Expression analyses highlight SNCA-AS1 and potentially SCARB2 differential expression in different brain regions in RBD, with SNCA-AS1 further supported by colocalization analyses. Polygenic risk score, pathway analysis, and genetic correlations provide further insights into RBD genetics, highlighting RBD as a unique alpha-synucleinopathy subpopulation that will allow future early intervention.

...read moreread less

9 citations

Posted Content•DOI•

Discovery of 42 genome-wide significant loci associated with dyslexia

[...]

Catherine Doust¹, Pierre Fontanillas, Else Eising², Scott D. Gordon³, Zhengjun Wang⁴, Gökberk Alagöz², Barbara Molz², Beate St Pourcain², Clyde Francks², Riccardo E. Marioni¹, Jingjing Zhao⁴, Silvia Paracchini⁵, Joel B. Talcott, Anthony P. Monaco⁶, John F. Stein⁷, Jeffrey R. Gruen⁸, Richard K. Olson⁹, Erik G. Willcutt⁹, John C. DeFries⁹, Bruce F. Pennington¹⁰, Shelley D. Smith¹¹, Margaret J. Wright¹², Nicholas G. Martin³, Adam Auton, Timothy C. Bates¹, Simon E. Fisher², Michelle Luciano¹ - Show less +23 more•Institutions (12)

University of Edinburgh¹, Max Planck Society², QIMR Berghofer Medical Research Institute³, Shaanxi Normal University⁴, University of St Andrews⁵, Tufts University⁶, University of Oxford⁷, Yale University⁸, University of Colorado Boulder⁹, University of Denver¹⁰, University of Nebraska Medical Center¹¹, University of Queensland¹²

22 Aug 2021-medRxiv

TL;DR: This article found 42 independent genome-wide significant loci: 17 are in genes linked to or pleiotropic with cognitive ability/educational attainment; 25 are novel and may be more specifically associated with dyslexia.

...read moreread less

Abstract: Reading and writing are crucial for many aspects of modern life but up to 1 in 10 children are affected by dyslexia [1, 2], which can persist into adulthood. Family studies of dyslexia suggest heritability up to 70% [3, 4], yet no convincing genetic markers have been found due to limited study power [5]. Here, we present a genome-wide association study representing a 20-fold increase in sample size from prior work, with 51,800 adults self-reporting a dyslexia diagnosis and 1,087,070 controls. We identified 42 independent genome-wide significant loci: 17 are in genes linked to or pleiotropic with cognitive ability/educational attainment; 25 are novel and may be more specifically associated with dyslexia. Twenty-three loci (12 novel) were validated in independent cohorts of Chinese and European ancestry. We confirmed a similar genetic aetiology of dyslexia between sexes, and found genetic covariance with many traits, including ambidexterity, but not neuroanatomical measures of language-related circuitry. Causal analyses revealed a directional effect of dyslexia on attention deficit hyperactivity disorder and bidirectional effects on socio-educational traits but these relationships require further investigation. Dyslexia polygenic scores explained up to 6% of variance in reading traits in independent cohorts, and might in future enable earlier identification and remediation of dyslexia.

...read moreread less

9 citations

Posted Content•DOI•

Genetic predisposition to mosaic Y chromosome loss in blood is associated with genomic instability in other tissues and susceptibility to non-haematological cancers

[...]

Deborah Nunn¹, Giulio Genovese², Giulio Genovese³, Jonatan Halvardson⁴, Jacob C. Ulirsch², Jacob C. Ulirsch³, Daniel J Wright¹, Daniel J Wright⁵, Chikashi Terao⁶, Olafur B. Davidsson⁷, Felix R. Day¹, Patrick Sulem⁷, Y Jiang, Marcus Danielsson⁴, Hanna Davies⁴, Joe Dennis¹, Malcolm G. Dunlop⁸, Douglas F. Easton¹, VA Fisher, Florian Zink⁷, Richard S. Houlston⁹, Martin Ingelsson¹⁰, Siddhartha Kar¹, Nicola D. Kerrison¹, B Kinnersley⁷, Ragnar P. Kristjansson⁷, Philip J. Law⁹, Rong Li¹¹, Chey Loveday⁹, Jonas Mattisson⁴, Steve McCarroll², Steve McCarroll³, Yusuke Murakami¹², Anna Murray¹³, Paweł Olszewski¹⁴, Edyta Rychlicka-Buniowska⁴, Edyta Rychlicka-Buniowska¹⁴, Robert A. Scott¹, Unnur Thorsteinsdottir⁷, Unnur Thorsteinsdottir¹⁵, Ian Tomlinson¹⁶, B Torabi Moghadam¹, Clare Turnbull¹⁷, Clare Turnbull⁹, Nicholas J. Wareham¹, Daniel F. Gudbjartsson¹⁵, Daniel F. Gudbjartsson⁷, Intergral-Ilcco, Cimba, Yoichiro Kamatani¹⁸, Eva Hoffmann¹⁹, Stephen P. Jackson¹, Kari Stefansson⁷, Kari Stefansson¹⁵, Adam Auton, Ken K. Ong¹, Mitchell J. Machiela, Po-Ru Loh²⁰, Po-Ru Loh³, Jan P. Dumanski⁴, Jan P. Dumanski¹⁴, Stephen J. Chanock, Lars Forsberg¹⁰, Lars Forsberg⁴, John R. B. Perry¹ - Show less +61 more•Institutions (20)

University of Cambridge¹, Harvard University², Broad Institute³, Science for Life Laboratory⁴, Wellcome Trust Sanger Institute⁵, University of Shizuoka⁶, Amgen⁷, Western General Hospital⁸, Institute of Cancer Research⁹, Uppsala University¹⁰, Johns Hopkins University School of Medicine¹¹, University of Tokyo¹², Royal Devon and Exeter Hospital¹³, Gdańsk Medical University¹⁴, University of Iceland¹⁵, University of Birmingham¹⁶, Queen Mary University of London¹⁷, Kyoto University¹⁸, University of Copenhagen¹⁹, Brigham and Women's Hospital²⁰

06 Aug 2019-bioRxiv

TL;DR: This research has been conducted using the UK Biobank Resource under application 9905 and 19808 and was supported by the Medical Research Council [Unit Programme number MC_UU_12015/2].

...read moreread less

Abstract: This research has been conducted using the UK Biobank Resource under application 9905 and 19808. This work was supported by the Medical Research Council [Unit Programme number MC_UU_12015/2]. Full study-specific and individual acknowledgements can be found in the supplementary information.

...read moreread less

8 citations

1
2
3
4
5
6
7
8
9
10
11
12
…
13
14
15
16
17
18
19
…
20
21
22

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg², Steven L. Salzberg¹, Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University², Johns Hopkins University School of Medicine³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•DOI•

Integrative genomics viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander², Eric S. Lander¹, Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

10 Jan 2011-Nature Biotechnology

TL;DR: In this article, the authors present an approach for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

10,798 citations

Journal Article•DOI•

The variant call format and VCFtools

[...]

Petr Danecek¹, Adam Auton², Gonçalo R. Abecasis³, Cornelis A. Albers¹, Eric Banks⁴, Mark A. DePristo⁴, Robert E. Handsaker⁴, Gerton Lunter², Gabor T. Marth⁵, Stephen T. Sherry⁶, Gilean McVean², Richard Durbin¹ - Show less +8 more•Institutions (6)

Wellcome Trust¹, University of Oxford², University of Michigan³, Broad Institute⁴, Boston College⁵, National Institutes of Health⁶

01 Aug 2011-Bioinformatics

TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

...read moreread less

Abstract: Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: [email protected]

...read moreread less

10,164 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse