EyeG2P: an automated variant filtering approach improves efficiency of diagnostic genomic testing for inherited ophthalmic disorders.
Summary (2 min read)
Introduction
- Inherited ophthalmic disorders are a major cause of blindness in children and working age adults.
- 1,2 Obtaining a genetic diagnosis in affected individuals can inform management, and clinical genomic testing is increasingly being used as a frontline diagnostic tool for these disorders.
- 3-8 Notably, the more widespread availability of gene-directed interventions including gene therapy and preimplantation genetic testing has increased both the value and risk of genomic testing.
- This places substantial demands on the delivery of testing in a timely and accurate manner.
- Here the authors describe and evaluate the diagnostic utility of EyeG2P, a publically available resource for the analysis of genomic variants identified in genes known as a cause of inherited ophthalmic conditions.
Methods
- The G2P web portal (https://www.ebi.ac.uk/gene2phenotype/)15 was used to develop and curate the ophthalmic disorders panel.
- New entries were initiated by selection of a relevant gene symbol from the list of preloaded genes (with their associated Ensembl identifiers).
- These connections were made after inspecting MEDLINE (through the PubMed interface); search terms included the gene name (HGNC) and the disease name (as a minimum).
- A disease mechanism was defined as both an allelic requirement (mode of inheritance, for example biallelic or monoallelic) and a mutation consequence (mode of pathogenicity, for example lossof-function).
- 15 Each locus-genotypemechanism-disease-evidence link was further characterized by assigning to it a set of phenotype terms (i.e. clinical signs and symptoms) from the Human Phenotype Ontology (HPO).
Sequencing and variant identification
- All genomic sequencing datasets were generated in a tertiary healthcare setting (North West Genomic Laboratory Hub, Manchester, UK; ISO 15189:2012; UKAS Medical reference 9865).
- All data collected is part of routine clinical care.
- Analyses to improve genomic diagnostic services for individuals with inherited ophthalmic conditions, as reported in this study, have been approved by the North West Research Ethics Committee (11/NW/0421 and 15/YH/0365).
- Raw sequencing reads were aligned to the GRCh37 reference genome using BWA-mem,18 with single nucleotide variants (SNVs) and indels identified using GATK.19 Larger and more complex indels were identified using Pindel, and copy number variants (CNVs) were identified using DeCON.20 Variants were filtered using quality and read depth thresholds as well as inhouse allele frequencies.
- The zygosity of CNVs were estimated based on their relative read depths.
Routine diagnostics
- Routine genomic analysis was performed utilizing the Congenica platform.
- This process involves filtering variants based on gene/location depending on the gene panel applied, population frequency and predicted molecular consequence.
- A complete list of presets for variant filtering are available in Supplementary Tables 1&2.
- After pre-filtering, variants were analysed by clinically accredited scientists and variants classified in accordance with the 2015 American College of Medical Genetics and Genomics (ACMG) best practice guidelines.
EyeG2P
- Merged VCF files containg SNVs, indels and CNVs were annotated using the G2P plugin for Ensembl Variant Effect Predictor.15,23.
- This plugin requires an input file which lists genes of interest and their allelic requirements; the authors utilized the EyeG2P dataset and an allele frequency cutoff of 0.001 for variants in monoallelic genes and 0.05 for variants in biallelic genes.
- Prospectively collected data from 83 individuals were also used for comparison.
- The authors assessed the sensitivity and the precision of EyeG2P in comparison to results from routine diagnostic analysis.
- All statistical analyses were performed in R and graphics created in R and BioRender.
Results
- Curation of the literature identified 667 genes for inclusion in EyeG2P Between April 2017 and June 2020, the authors interogated the biomedical literature for genes associated with highly penetrant genetic ophthalmic disorders.
- Within the 559 confirmed gene-disease pairs, the associated inheritance patterns were autosomal dominant in 155, autosomal recessive in 341 , X-linked in 31 , and other patterns (including both autosomal dominant and recessive) in 32 instances.
- The authors assessed the capability of EyeG2P to identify molecular diagnoses in 1234 individuals who had previously undergone diagnostic genetic testing at the North West Genomic Laboratory Hub (Manchester, UK).
- The 1267 variants prioritized by EyeG2P were identified in 166 distinct genes and had diverse predicted molecular consequences .
- For 31/33 individuals (94%), the confirmed molecular diagnosis was highlighted by EyeG2P; an average reduction of 7.4 variants for analysis was possible in each individual .
Discussion
- Characterising the genomic basis of inherited ophthalmic conditions has been shown to inform the management of individuals with these conditions.
- The expansion from single gene based testing methodologies to the routine use of large gene panels, exome and genome sequencing approaches requires that robust and accurate informatics filtering strategies are applied to the generated datasets.
- The authors have released these data as a freely available resource that can be dynamically filtered and revised to best aid the users requirements.
- The authors ability to detect disease-causing genomic variants from high-throughput sequencing datasets has expanded in recent years to include complex structural variants,26,27 exonic deletions and duplications,28,29 deeply intronic variants causing aberrant splicing,30-33 variants in regulatory regions34-36 and complex alleles comprised of combinations of genomic variants common in the general population.
- Moreover, EyeG2P can identify diverse genomic variants across the spectrum of genetically and clinically heterogeneous ophthalmic genetic conditions.
Did you find this useful? Give us your feedback
References
20,557Â citations
17,834Â citations
8,090Â citations
4,658Â citations
2,066Â citations
Related Papers (5)
Frequently Asked Questions (14)
Q2. What plugin was used to annotate variants?
21,22Merged VCF files containg SNVs, indels and CNVs were annotated using the G2P plugin for Ensembl Variant Effect Predictor.15,23Â
Q3. What is the definition of a disease mechanism?
A disease mechanism was defined as both an allelic requirement (mode of inheritance, for example biallelic or monoallelic) and a mutation consequence (mode of pathogenicity, for example lossof-function).Â
Q4. What is the importance of a robust and accurate informatics filtering strategy?
The expansion from single gene based testing methodologies to the routine use of large gene panels, exome and genome sequencing approaches requires that robust and accurate informatics filtering strategies are applied to the generated datasets.Â
Q5. How many variants were identified in the study?
Disease-causing variants were identified in 24 distinct genes; 10 cases had an autosomal dominant, 3 an X-linked and 18 an autosomal recessive disorder.Â
Q6. How many cases of ocular diseases were identified without a diagnosis?
In 10/52 cases without a confirmed diagnosis, variants of uncertain significance were identified in a disease-causing state through EyeG2P analysis, and no additional pathogenic variants were detected after routine analysis.Â
Q7. What is the genetic basis of ophthalmic conditions?
The genetic basis of ophthalmic conditions such as congenital cataract, inherited retinal disorders and optic neuropathies is diverse and includes genes encoded on autosomal, sex and mitochondrial chromosomes.Â
Q8. What is the purpose of this paper?
In conclusion, the authors demonstrate that EyeG2P can be effectively integrated with clinical diagnostic testing for inherited ophthalmic conditions to increase the efficiency of variant analysis.Â
Q9. What was the PPV of the EyeG2P analysis?
Following curation of over 1000 biomedical publications the authors identified 667 relevant genes and determined the associated modes of inheritance, mechanisms of disease causation and phenotypic features.Â
Q10. What are the sources of funding for EyeG2P?
The authors acknowledge funding from the Wellcome Trust Transforming Genomic Medicine Initiative (WT200990/Z/16/Z), the European Molecular Biology Laboratory and the Manchester NIHR Biomedical Research Centre (IS-BRC-1215-20007).Â
Q11. What is the common type of variant that is masked from the initial analysis?
Regions that are highly polymorphic and/or difficult to survey through shortread high-throughput techniques are masked from initial analysis, specifically RP1L1 exon 4, USH1C exon 18 and RPGRorf15.Â
Q12. What is the purpose of the paper?
The authors propose the application of EyeG2P as a firsttier analysis strategy for the diagnosis of inherited opthalmic conditions from high-throughput genomic datasets.Â
Q13. What is the plugin for Ensembl Variant Effect Predictor?
This plugin requires an input file which lists genes of interest and their allelic requirements; the authors utilized the EyeG2P dataset and an allele frequency cutoff of 0.001 for variants in monoallelic genes and 0.05 for variants in biallelic genes.Â
Q14. What is the difference between EyeG2P and the standard diagnostic testing methods?
The authors show that EyeG2P increases the precision and efficiency of genomic testing for inherited ophthalmic conditions over routine approaches for variant analysis, at little cost to overall diagnostic rates.Â