scispace - formally typeset
Open AccessJournal ArticleDOI

Predicting the functional, molecular, and phenotypic consequences of amino acid substitutions using hidden Markov models.

Reads0
Chats0
TLDR
The Functional Analysis Through Hidden Markov Models (FATHMM) software and server is described: a species‐independent method with optional species‐specific weightings for the prediction of the functional effects of protein missense variants, demonstrating that FATHMM can be efficiently applied to high‐throughput/large‐scale human and nonhuman genome sequencing projects with the added benefit of phenotypic outcome associations.
Abstract
The rate at which nonsynonymous single nucleotide polymorphisms (nsSNPs) are being identified in the human genome is increasing dramatically owing to advances in whole-genome/whole-exome sequencing technologies. Automated methods capable of accurately and reliably distinguishing between pathogenic and functionally neutral nsSNPs are therefore assuming ever-increasing importance. Here, we describe the Functional Analysis Through Hidden Markov Models (FATHMM) software and server: a species-independent method with optional species-specific weightings for the prediction of the functional effects of protein missense variants. Using a model weighted for human mutations, we obtained performance accuracies that outperformed traditional prediction methods (i.e., SIFT, PolyPhen, and PANTHER) on two separate benchmarks. Furthermore, in one benchmark, we achieve performance accuracies that outperform current state-of-the-art prediction methods (i.e., SNPs&GO and MutPred). We demonstrate that FATHMM can be efficiently applied to high-throughput/large-scale human and nonhuman genome sequencing projects with the added benefit of phenotypic outcome associations. To illustrate this, we evaluated nsSNPs in wheat (Triticum spp.) to identify some of the important genetic variants responsible for the phenotypic differences introduced by intense selection during domestication. A Web-based implementation of FATHMM, including a high-throughput batch facility and a downloadable standalone package, is available at http://fathmm.biocompute.org.uk.

read more

Citations
More filters
Posted ContentDOI

3Cnet: Pathogenicity prediction of human variants using knowledge transfer with deep recurrent neural networks

TL;DR: A pathogenicity predictor, named as 3Cnet, is developed using deep recurrent neural networks which analyzes the amino-acid context of a missense mutation to improve the diagnostic rate for patients and discover novel pathogenic variants with high probability.
Journal ArticleDOI

A case report of maturity-onset diabetes of the young (MODY12) in a Chinese Han patient with a novel ABCC8 gene mutation

TL;DR: A 30-year-old Chinese man with a heterozygous missense mutation c.3976G > A (p.Glu1326Lys) was diagnosed with MODY12, which was the new pathogenic mutation for the disease as discussed by the authors .
Posted ContentDOI

Identifying highly-penetrant disease causal mutations using next generation sequencing: Guide to whole process

TL;DR: The primary aim of this review is to document some of the key approaches and provide an analysis schema to make the analysis process more efficient and reliable in the context of discovering highly penetrant causal mutations/genes.
Journal ArticleDOI

Scrutiny of genome-wide somatic mutation profiles in centenarians identifies the key genomic regions for human longevity.

TL;DR: Wang et al. as mentioned in this paper found that centenarian genomes are characterized by a markedly skewed distribution of somatic mutations, with many genomic regions being specifically conserved but displaying a high function potential.
Journal ArticleDOI

Comprehensive bioinformatics analysis of structural and functional consequences of deleterious missense mutations in the human QDPR gene.

TL;DR: Sarma et al. as mentioned in this paper used sequence-based and structure-based tools to assess the protein's biological activity, with several computational tools identifying deleterious SNPs and finding that 10 mutations were harmful and linked to brain and central nervous system disorders.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

The Pfam protein families database

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Related Papers (5)