A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
read more
Citations
Integrative Genomics Viewer
Prevalence and spectrum of germline rare variants in BRCA1/2 and PALB2 among breast cancer cases in Sarawak, Malaysia.
Assessment of the functional impact of germline BRCA1/2 variants located in non-coding regions in families with breast and/or ovarian cancer predisposition
Next step in molecular genetics of hereditary breast/ovarian cancer: Multigene panel testing in clinical actionably genes and prioritization algorithms in the study of variants of uncertain significance.
Prioritizing variants in complete Hereditary Breast and Ovarian Cancer (HBOC) genes in patients lacking known BRCA mutations
References
Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008.
The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data
Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.
An integrated encyclopedia of DNA elements in the human genome
Mfold web server for nucleic acid folding and hybridization prediction
Related Papers (5)
Analysis of protein-coding genetic variation in 60,706 humans
Identification of pathogenic variant enriched regions across genes and gene families
Frequently Asked Questions (17)
Q2. How many antisense strand oligos were synthesized?
11,828 antisense strand oligos were synthesized (3497 ATM, 1591 BRCA1, 2395 BRCA2, 1860 CDH1, 883 CHEK2, 826 PALB2, and 776 TP53).
Q3. How many welldistributed loci were excluded as candidate deletion intervals?
The authors required that > 80 % of the control individuals be heterozyogous for at least two welldistributed loci within these intervals.
Q4. What was the important factor in determining potential hemizygosity?
Highly informative SNPs with a random genomic distribution in the controls (and other public databases) and which were nonpolymorphic in the individual with the suspected deletion were weighted more heavily in inferring potential hemizygosity.
Q5. What are the genes that have been reported to harbor mutations that increase HBOC risk?
The genes are: ATM, BRCA1, BRCA2, CDH1, CHEK2, PALB2, and TP53, and have been reported to harbor mutations that increase HBOC risk [54–76].
Q6. What is the impact of a single nucleotide change in a recognition sequence?
The impact of a single nucleotide change in a recognition sequence can range from insignificant to complete abolition of a protein binding site.
Q7. What is the reason why the NGS protocol is more sensitive?
BRCA coding variants were found in individuals who were previously screened for lesions in these genes, suggesting this NGS protocol is a more sensitive approach for detecting coding changes.
Q8. Why is the exon strength weaker than the natural one?
Although the cryptic exon is strengthened (final Ri,total = 6.9 bits, ΔRi = 14.7 bits), ASSEDA predicts the level of expression of this exon to be negligible, as it is weaker than the natural exon (Ri,total = 8.4 bits) due to the increased length of the predicted exon (+291 nt) [38].
Q9. What are the TFs that have evidence for binding to the promoters of the genes?
The authors identified 141 TFs with evidence for binding to the promoters of the genes the authors sequenced, including c-Myc, C/EBPβ, and Sp1, shown to transcriptionally regulate BRCA1, TP53, and ATM, respectively [98–100].
Q10. What is the role of the IT-based analysis in identifying splicing variants?
IT-based analysis of splicing variants has proven to be robust and accurate (as determined by functional assays for mRNA expression or binding assays) at analyzing splice site (SS) variants, including splicing regulatory factor binding sites (SRFBSs), and in distinguishing them from polymorphisms in both rare and common diseases [36–39].
Q11. What is the strategy to improve variant interpretation in patients?
One strategy to improve variant interpretation in patients is to reduce the full set of variants to a manageable list of potentially pathogenic variants.
Q12. What was the likely to alter stable 2° structures in mRNA?
Variants flagged by SNPfold with the highest probability of altering stable 2° structures in mRNA (where p-value < 0.1) were prioritized.
Q13. How many in silico programs evaluated the effects of the remaining variants?
The predicted effects on protein conservation and function of the remaining variants were evaluated by in silico tools: PolyPhen-2 [118], Mutation Assessor (release 2) [119, 120], and PROVEAN (v1.1.3) [121, 122].
Q14. What are the benefits of interpreting non-coding sequence variants?
The complexity of interpretation of non-coding sequence variants benefits from computational approaches [28] and direct functional analyses [29–33] that may each support evidence of pathogenicity.
Q15. What was the common consequence of false positive variant calls?
As previously reported [147], the authors noted that false positive variant calls within intronic and intergenic regions were the most common consequence of dephasing in low complexity, pyrimidine-enriched intervals.
Q16. What is the average number of variants per patient at each step?
The average number of variants per patient at each step is indicated in a table below each plot, along with the percent reduction in variants from one step to anotherThree prioritized variants have multiple predicted roles: ATM c.1538A >G in missense and SRFBS, CHEK2 c.190G >A in missense and UTR binding, and CHEK2 c.433C >
Q17. What were the likely to have a deleterious impact on protein activity?
Variants predicted by all four programs to be benign were less likely to have a deleterious impact on protein activity; however this did not exclude them from mRNA splicing analysis (described above in IT-Based Variant Analysis).