Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources (2017) | Yuansheng Liu

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Predicting miRNA-disease association based on inductive matrix completion.

[...]

Xing Chen¹, Lei Wang¹, Jia Qu¹, Na-Na Guan², Jianqiang Li² - Show less +1 more•Institutions (2)

China University of Mining and Technology¹, Shenzhen University²

15 Dec 2018-Bioinformatics

TL;DR: A novel model of Inductive Matrix Completion for MiRNA‐Disease Association prediction (IMCMDA) to complete the missing miRNA‐disease association based on the known associations and the integrated miRNA similarity and disease similarity.

...read moreread less

Abstract: Motivation It has been shown that microRNAs (miRNAs) play key roles in variety of biological processes associated with human diseases. In Consideration of the cost and complexity of biological experiments, computational methods for predicting potential associations between miRNAs and diseases would be an effective complement. Results This paper presents a novel model of Inductive Matrix Completion for MiRNA-Disease Association prediction (IMCMDA). The integrated miRNA similarity and disease similarity are calculated based on miRNA functional similarity, disease semantic similarity and Gaussian interaction profile kernel similarity. The main idea is to complete the missing miRNA-disease association based on the known associations and the integrated miRNA similarity and disease similarity. IMCMDA achieves AUC of 0.8034 based on leave-one-out-cross-validation and improved previous models. In addition, IMCMDA was applied to five common human diseases in three types of case studies. In the first type, respectively, 42, 44, 45 out of top 50 predicted miRNAs of Colon Neoplasms, Kidney Neoplasms, Lymphoma were confirmed by experimental reports. In the second type of case study for new diseases without any known miRNAs, we chose Breast Neoplasms as the test example by hiding the association information between the miRNAs and Breast Neoplasms. As a result, 50 out of top 50 predicted Breast Neoplasms-related miRNAs are verified. In the third type of case study, IMCMDA was tested on HMDD V1.0 to assess the robustness of IMCMDA, 49 out of top 50 predicted Esophageal Neoplasms-related miRNAs are verified. Availability and implementation The code and dataset of IMCMDA are freely available at https://github.com/IMCMDAsourcecode/IMCMDA. Supplementary information Supplementary data are available at Bioinformatics online.

...read moreread less

362 citations

Journal Article•DOI•

A review on machine learning principles for multi-view biological data integration.

[...]

Yifeng Li¹, Fang-Xiang Wu², Alioune Ngom³•Institutions (3)

National Research Council¹, University of Saskatchewan², University of Windsor³

22 Dec 2016-Briefings in Bioinformatics

TL;DR: It is shown that Bayesian models are able to use prior information and model measurements with various distributions, and a range of deep neural networks can be integrated in multi-modal learning for capturing the complex mechanism of biological systems.

...read moreread less

Abstract: Driven by high-throughput sequencing techniques, modern genomic and clinical studies are in a strong need of integrative machine learning models for better use of vast volumes of heterogeneous information in the deep understanding of biological systems and the development of predictive models. How data from multiple sources (called multi-view data) are incorporated in a learning system is a key step for successful analysis. In this article, we provide a comprehensive review on omics and clinical data integration techniques, from a machine learning perspective, for various analyses such as prediction, clustering, dimension reduction and association. We shall show that Bayesian models are able to use prior information and model measurements with various distributions; tree-based methods can either build a tree with all features or collectively make a final decision based on trees learned from each view; kernel methods fuse the similarity matrices learned from individual views together for a final similarity matrix or learning model; network-based fusion methods are capable of inferring direct and indirect associations in a heterogeneous network; matrix factorization models have potential to learn interactions among features from different views; and a range of deep neural networks can be integrated in multi-modal learning for capturing the complex mechanism of biological systems.

...read moreread less

333 citations

Cites methods from "Inferring MicroRNA-Disease Associat..."

...Random walk methods have been applied on either two-relational heterogeneous networks (such as gene–phenotype associations [88], drug–target interactions [89] and miRNA–disease associations [90, 91]) or multi-relational heterogeneous networks (for example, drug–disease associations [92]) to infer novel candidate relations....
[...]

Journal Article•DOI•

Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.

[...]

Marinka Zitnik¹, Francis Nguyen², Francis Nguyen³, Bo Wang, Jure Leskovec¹, Anna Goldenberg³, Michael M. Hoffman - Show less +3 more•Institutions (3)

Stanford University¹, Princess Margaret Cancer Centre², University of Toronto³

01 Oct 2019-Information Fusion

TL;DR: In this paper, the authors describe the principles of data integration and discuss current methods and available implementations, as well as current challenges in biomedical integrative methods and their perspective on the future development of the field.

...read moreread less

212 citations

Journal Article•DOI•

Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities

[...]

Marinka Zitnik¹, Francis Nguyen², Francis Nguyen³, Bo Wang, Jure Leskovec¹, Anna Goldenberg³, Michael M. Hoffman - Show less +3 more•Institutions (3)

Stanford University¹, Princess Margaret Cancer Centre², University of Toronto³

30 Jun 2018-arXiv: Quantitative Methods

TL;DR: The principles of data integration are described and current methods and available implementations are discussed and examples of successful data integration in biology and medicine are provided.

...read moreread less

Abstract: New technologies have enabled the investigation of biology and human health at an unprecedented scale and in multiple dimensions. These dimensions include a myriad of properties describing genome, epigenome, transcriptome, microbiome, phenotype, and lifestyle. No single data type, however, can capture the complexity of all the factors relevant to understanding a phenomenon such as a disease. Integrative methods that combine data from multiple technologies have thus emerged as critical statistical and computational approaches. The key challenge in developing such approaches is the identification of effective models to provide a comprehensive and relevant systems view. An ideal method can answer a biological or medical question, identifying important features and predicting outcomes, by harnessing heterogeneous data across several dimensions of biological variation. In this Review, we describe the principles of data integration and discuss current methods and available implementations. We provide examples of successful data integration in biology and medicine. Finally, we discuss current challenges in biomedical integrative methods and our perspective on the future development of the field.

...read moreread less

149 citations

Journal Article•DOI•

M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning

[...]

Leyi Wei¹, Huangrong Chen¹, Ran Su¹, Ran Su²•Institutions (2)

Tianjin University¹, Nankai University²

07 Sep 2018-Molecular therapy. Nucleic acids

TL;DR: A novel machine learning-based predictor called M6APred-EL, expected to be a practical and effective tool for the investigation of m6A functional mechanisms, is developed and compared with other state-of-the-art methods of benchmarking datasets.

...read moreread less

Abstract: N6-methyladenosine (m6A) modification is the most abundant RNA methylation modification and involves various biological processes, such as RNA splicing and degradation. Recent studies have demonstrated the feasibility of identifying m6A peaks using high-throughput sequencing techniques. However, such techniques cannot accurately identify specific methylated sites, which is important for a better understanding of m6A functions. In this study, we develop a novel machine learning-based predictor called M6APred-EL for the identification of m6A sites. To predict m6A sites accurately within genomic sequences, we trained an ensemble of three support vector machine classifiers that explore the position-specific information and physical chemical information from position-specific k-mer nucleotide propensity, physical-chemical properties, and ring-function-hydrogen-chemical properties. We examined and compared the performance of our predictor with other state-of-the-art methods of benchmarking datasets. Comparative results showed that the proposed M6APred-EL performed more accurately for m6A site identification. Moreover, a user-friendly web server that implements the proposed M6APred-EL is well established and is currently available at http://server.malab.cn/M6APred-EL/. It is expected to be a practical and effective tool for the investigation of m6A functional mechanisms.

...read moreread less

144 citations

Collapse

Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources

Citations

Cites methods from "Inferring MicroRNA-Disease Associat..."

References

"Inferring MicroRNA-Disease Associat..." refers background in this paper

"Inferring MicroRNA-Disease Associat..." refers background in this paper

"Inferring MicroRNA-Disease Associat..." refers background or methods in this paper

Related Papers (5)