scispace - formally typeset
Open AccessJournal ArticleDOI

Multivariate Information Fusion With Fast Kernel Learning to Kernel Ridge Regression in Predicting LncRNA-Protein Interactions.

TLDR
A novel method for identifying LPI by employing Kernel Ridge Regression, based on Fast Kernel Learning (LPI-FKLKRR), which has extraordinary performance compared with LPI prediction schemes.
Abstract
Long non-coding RNAs (lncRNAs) constitute a large class of transcribed RNA molecules. They have a characteristic length of more than 200 nucleotides which do not encode proteins. They play an important role in regulating gene expression by interacting with the homologous RNA-binding proteins. Due to the laborious and time-consuming nature of wet experimental methods, more researchers should pay great attention to computational approaches for the prediction of lncRNA-protein interaction (LPI). An in-depth literature review in the state-of-the-art in silico investigations, leads to the conclusion that there is still room for improving the accuracy and velocity. This paper propose a novel method for identifying LPI by employing Kernel Ridge Regression, based on Fast Kernel Learning (LPI-FKLKRR). This approach, uses four distinct similarity measures for lncRNA and protein space, respectively. It is remarkable, that we extract Gene Ontology (GO) with proteins, in order to improve the quality of information in protein space. The process of heterogeneous kernels integration, applies Fast Kernel Learning (FastKL) to deal with weight optimization. The extrapolation model is obtained by gaining the ultimate prediction associations, after using Kernel Ridge Regression (KRR). Experimental outcomes show that the ability of modeling with LPI-FKLKRR has extraordinary performance compared with LPI prediction schemes. On benchmark dataset, it has been observed that the best Area Under Precision Recall Curve (AUPR) of 0.6950 is obtained by our proposed model LPI-FKLKRR, which outperforms the integrated LPLNP (AUPR: 0.4584), RWR (AUPR: 0.2827), CF (AUPR: 0.2357), LPIHN (AUPR: 0.2299), and LPBNI (AUPR: 0.3302). Also, combined with the experimental results of a case study on a novel dataset, it is anticipated that LPI-FKLKRR will be a useful tool for LPI prediction.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Integrating Bipartite Network Projection and KATZ Measure to Identify Novel CircRNA-Disease Associations

TL;DR: A novel computational method, named IBNPKATZ, is developed, which integrates the bipartite network projection algorithm and KATZ measure and is expected to be a useful biomedical research tool for predicting potential circRNA-disease associations.
Journal ArticleDOI

An Ameliorated Prediction of Drug–Target Interactions Based on Multi-Scale Discrete Wavelet Transform and Network Features

TL;DR: A kind of drug–target interactions predictor adopting multi-scale discrete wavelet transform and network features (named as DAWN) in order to solve the DTIs prediction problem.
Journal ArticleDOI

Probing lncRNA-Protein Interactions: Data Repositories, Models, and Algorithms.

TL;DR: To detect the performance of computational methods, parts of LPI prediction models are compared on Leave-One-Out cross-validation (LOOCV) and fivefold cross- validation (AUC) and the results show that SFPEL-LPI obtained the best performance of AUC.
Journal ArticleDOI

Machine-learning and high-throughput studies for high-entropy materials

TL;DR: In this article , the authors review both the materials informatics and experimental approaches for the high-throughput (HT) approach, especially to identify the specified functions for the new HEMs development.
Journal ArticleDOI

A deep learning model for plant lncRNA-protein interaction prediction with graph attention

TL;DR: GPLPI, a graph representation learning method, is proposed to predict plant lncRNA-protein interaction (LPI) from sequence and structural information and consistently outperforms other state-of-the-art methods.
References
More filters
Journal ArticleDOI

Identification of common molecular subsequences.

TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI

Matrix Factorization Techniques for Recommender Systems

TL;DR: As the Netflix Prize competition has demonstrated, matrix factorization models are superior to classic nearest neighbor techniques for producing product recommendations, allowing the incorporation of additional information such as implicit feedback, temporal effects, and confidence levels.
Proceedings ArticleDOI

Item-based collaborative filtering recommendation algorithms

TL;DR: This paper analyzes item-based collaborative ltering techniques and suggests that item- based algorithms provide dramatically better performance than user-based algorithms, while at the same time providing better quality than the best available userbased algorithms.
Journal ArticleDOI

What is a support vector machine

TL;DR: Support vector machines are becoming popular in a wide variety of biological applications, but how do they work and what are their most promising applications in the life sciences?
Journal ArticleDOI

Modular regulatory principles of large non-coding RNAs

TL;DR: This work synthesizes studies to provide an emerging model whereby large ncRNAs might achieve regulatory specificity through modularity, assembling diverse combinations of proteins and possibly RNA and DNA interactions.
Related Papers (5)