scispace - formally typeset
Open AccessJournal ArticleDOI

KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest

Yuran Jia, +2 more
- 29 Nov 2021 - 
- Vol. 12
Reads0
Chats0
TLDR
Wang et al. as mentioned in this paper developed a DNA-binding protein identification method called KK-DBP, which fuses multiple PSSM features to improve prediction accuracy and achieved a prediction accuracy of 81.22%.
Abstract
DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-related features, resulting in rough prediction results. In this article, we develop a DNA-binding protein identification method called KK-DBP. To improve prediction accuracy, we propose a feature extraction method that fuses multiple PSSM features. The experimental results show a prediction accuracy on the independent test dataset PDB186 of 81.22%, which is the highest of all existing methods.

read more

Citations
More filters
Journal ArticleDOI

DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features

TL;DR: This work proposes a methodology named “DNAPred_Prot”, which uses various position and frequency-dependent features from protein sequences for efficient and effective prediction of DNA-binding proteins, and it can be predicted that the suggested methodology performs better than other extant methods.
Journal ArticleDOI

PlDBPred: a novel computational model for discovery of DNA binding proteins in plants

TL;DR: Wang et al. as mentioned in this paper developed a comprehensive computational model for plant specific DNA-binding proteins (DBPs) identification, where five shallow learning and six deep learning models were initially used for prediction, where shallow learning methods outperformed deep learning algorithms.
Posted ContentDOI

NTpred: A robust and precise machine learning framework for in-silico identification of Tyrosine nitration sites in protein sequences

TL;DR: In this article , the authors proposed a method to identify Tyrosine nitration (NT) modification by extracting comprehensive features from raw protein sequences using four different sequence encoders.
Journal ArticleDOI

NTpred: a robust and precise machine learning framework for <i>in silico</i> identification of Tyrosine nitration sites in protein sequences

TL;DR: In this article , the authors presented the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders and fusing different combinations of individual encodings.
Journal ArticleDOI

Improving DNA-Binding Protein Prediction Using Three-Part Sequence-Order Feature Extraction and a Deep Neural Network Algorithm

TL;DR: In this paper , a new three-part sequence-order feature extraction (TPSO) strategy is developed to extract more discriminative information from protein sequences for predicting the DNA-binding proteins.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

Protein structure prediction and analysis using the Robetta server

TL;DR: The Robetta server provides automated tools for protein structure prediction and analysis and current capabilities include the prediction of the effects of mutations on protein-protein interactions using computational interface alanine scanning.
Journal ArticleDOI

ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties

TL;DR: ADMETlab 2.0 as discussed by the authors is a completely redesigned version of the widely used AMDETlab web server for the predictions of pharmacokinetics and toxicity properties of chemicals, of which the supported ADMET-related endpoints are approximately twice the number of the endpoints in the previous version, including 17 physicochemical properties, 13 medicinal chemistry properties, 23 ADME properties, 27 toxicity endpoints and 8 toxicophore rules.
Journal ArticleDOI

DUNet: A deformable network for retinal vessel segmentation

TL;DR: Wang et al. as discussed by the authors proposed Deformable U-Net (DUNet), which exploits the retinal vessels' local features with a U-shape architecture, in an end-to-end manner for retinal vessel segmentation.
Journal ArticleDOI

A novel features ranking metric with application to scalable visual and bioinformatics data classification

TL;DR: A Max-Relevance-Max-Distance (MRMD) feature ranking method, which balances accuracy and stability of feature ranking and prediction task, and runs faster than other filtering and wrapping methods, such as mRMR and Information Gain.