scispace - formally typeset
Search or ask a question
Institution

Wellcome Trust Centre for Human Genetics

FacilityOxford, United Kingdom
About: Wellcome Trust Centre for Human Genetics is a facility organization based out in Oxford, United Kingdom. It is known for research contribution in the topics: Population & Genome-wide association study. The organization has 2122 authors who have published 4269 publications receiving 433899 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: In this article, a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data is presented. But this protocol is not suitable for large consortia such as the GIANT Consortium.
Abstract: Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.

370 citations

Journal ArticleDOI
TL;DR: Using a set of validation genotypes at SNP and biallelic indels it is shown that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low frequency variants.
Abstract: A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.

369 citations

Journal ArticleDOI
TL;DR: The results indicate that AD is influenced by genes with general effects on dermal inflammation and immunity.
Abstract: We have carried out a genome screen for atopic dermatitis (AD) and have identified linkage to AD on chromosomes 1q21, 17q25 and 20p These regions correspond closely with known psoriasis loci, as does a previously identified AD locus on chromosome 3q21 The results indicate that AD is influenced by genes with general effects on dermal inflammation and immunity

368 citations

Journal ArticleDOI
TL;DR: Novel genetic associations at viable ADHD candidate genes are identified and confirmatory evidence for associations at previous candidate genes is provided to confirm the proposed genetic variants for ADHD.
Abstract: Attention deficit hyperactivity disorder (ADHD) is a complex condition with environmental and genetic etiologies. Up to this point, research has identified genetic associations with candidate genes from known biological pathways. In order to identify novel ADHD susceptibility genes, 600,000 SNPs were genotyped in 958 ADHD proband-parent trios. After applying data cleaning procedures we examined 429,981 autosomal SNPs in 909 family trios. We generated six quantitative phenotypes from 18 ADHD symptoms to be used in genome-wide association analyses. With the PBAT screening algorithm, we identified 2 SNPs, rs6565113 and rs552655 that met the criteria for significance within a specified phenotype. These SNPs are located in intronic regions of genes CDH13 and GFOD1, respectively. CDH13 has been implicated previously in substance use disorders. We also evaluated the association of SNPs from a list of 37 ADHD candidate genes that was specified a priori. These findings, along with association P-values with a magnitude less than 10(-5), are discussed in this manuscript. Seventeen of these candidate genes had association P-values lower then 0.01: SLC6A1, SLC9A9, HES1, ADRB2, HTR1E, DDC, ADRA1A, DBH, DRD2, BDNF, TPH2, HTR2A, SLC6A2, PER1, CHRNA4, SNAP25, and COMT. Among the candidate genes, SLC9A9 had the strongest overall associations with 58 association test P-values lower than 0.01 and multiple association P-values at a magnitude of 10(-5) in this gene. In sum, these findings identify novel genetic associations at viable ADHD candidate genes and provide confirmatory evidence for associations at previous candidate genes. Replication of these results is necessary in order to confirm the proposed genetic variants for ADHD.

368 citations

Journal ArticleDOI
TL;DR: Both PacBio and ONT sequencing are suitable for full-length single-molecule transcriptome analysis as this first use of ONT reads in a Hybrid-Seq analysis has shown.
Abstract: Background: Given the demonstrated utility of Third Generation Sequencing [Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT)] long reads in many studies, a comprehensive analysis and comparison of their data quality and applications is in high demand. Methods: Based on the transcriptome sequencing data from human embryonic stem cells, we analyzed multiple data features of PacBio and ONT, including error pattern, length, mappability and technical improvements over previous platforms. We also evaluated their application to transcriptome analyses, such as isoform identification and quantification and characterization of transcriptome complexity, by comparing the performance of size-selected PacBio, non-size-selected ONT and their corresponding Hybrid-Seq strategies (PacBio+Illumina and ONT+Illumina). Results: PacBio shows overall better data quality, while ONT provides a higher yield. As with data quality, PacBio performs marginally better than ONT in most aspects for both long reads only and Hybrid-Seq strategies in transcriptome analysis. In addition, Hybrid-Seq shows superior performance over long reads only in most transcriptome analyses. Conclusions: Both PacBio and ONT sequencing are suitable for full-length single-molecule transcriptome analysis. As this first use of ONT reads in a Hybrid-Seq analysis has shown, both PacBio and ONT can benefit from a combined Illumina strategy. The tools and analytical methods developed here provide a resource for future applications and evaluations of these rapidly-changing technologies.

368 citations


Authors

Showing all 2127 results

NameH-indexPapersCitations
Mark I. McCarthy2001028187898
John P. A. Ioannidis1851311193612
Gonçalo R. Abecasis179595230323
Simon I. Hay165557153307
Robert Plomin151110488588
Ashok Kumar1515654164086
Julian Parkhill149759104736
James F. Wilson146677101883
Jeremy K. Nicholson14177380275
Hugh Watkins12852491317
Erik Ingelsson12453885407
Claudia Langenberg12445267326
Adrian V. S. Hill12258964613
John A. Todd12151567413
Elaine Holmes11956058975
Network Information
Related Institutions (5)
Howard Hughes Medical Institute
34.6K papers, 5.2M citations

94% related

National Institutes of Health
297.8K papers, 21.3M citations

94% related

University of Massachusetts Medical School
31.8K papers, 1.9M citations

93% related

Laboratory of Molecular Biology
24.2K papers, 2.1M citations

93% related

Fred Hutchinson Cancer Research Center
30.9K papers, 2.2M citations

92% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
202221
202183
202074
2019134
2018182
2017323