scispace - formally typeset
Search or ask a question
Author

Yong Zhou

Bio: Yong Zhou is an academic researcher from China University of Mining and Technology. The author has contributed to research in topics: Community structure & Autoencoder. The author has an hindex of 11, co-authored 15 publications receiving 421 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: A new computational method for predicting DTIs from drug molecular structure and protein sequence by using the stacked autoencoder of deep learning, which has the advantage that it can automatically mine the hidden information from protein sequences and generate highly representative features through iterations of multiple layers.
Abstract: Identifying the interaction between drugs and target proteins is an important area of drug research, which provides a broad prospect for low-risk and faster drug development. However, due to the limitations of traditional experiments when revealing drug–protein interactions (DTIs), the screening of targets not only takes a lot of time and money but also has high false-positive and false-negative rates. Therefore, it is imperative to develop effective automatic computational methods to accurately predict DTIs in the postgenome era. In this article, we propose a new computational method for predicting DTIs from drug molecular structure and protein sequence by using the stacked autoencoder of deep learning, which can adequately extract the raw data information. The proposed method has the advantage that it can automatically mine the hidden information from protein sequences and generate highly representative features through iterations of multiple layers. The feature descriptors are then constructed...

133 citations

Journal ArticleDOI
TL;DR: The experimental results suggest that the MLMDA model could serve as a useful tool guiding the future experimental validation for those promising miRNA biomarker candidates.
Abstract: Emerging evidences show that microRNA (miRNA) plays an important role in many human complex diseases. However, considering the inherent time-consuming and expensive of traditional in vitro experiments, more and more attention has been paid to the development of efficient and feasible computational methods to predict the potential associations between miRNA and disease. In this work, we present a machine learning-based model called MLMDA for predicting the association of miRNAs and diseases. More specifically, we first use the k-mer sparse matrix to extract miRNA sequence information, and combine it with miRNA functional similarity, disease semantic similarity and Gaussian interaction profile kernel similarity information. Then, more representative features are extracted from them through deep auto-encoder neural network (AE). Finally, the random forest classifier is used to effectively predict potential miRNA–disease associations. The experimental results show that the MLMDA model achieves promising performance under fivefold cross validations with AUC values of 0.9172, which is higher than the methods using different classifiers or different feature combination methods mentioned in this paper. In addition, to further evaluate the prediction performance of MLMDA model, case studies are carried out with three Human complex diseases including Lymphoma, Lung Neoplasm, and Esophageal Neoplasms. As a result, 39, 37 and 36 out of the top 40 predicted miRNAs are confirmed by other miRNA–disease association databases. These prominent experimental results suggest that the MLMDA model could serve as a useful tool guiding the future experimental validation for those promising miRNA biomarker candidates. The source code and datasets explored in this work are available at http://220.171.34.3:81/ .

102 citations

Journal ArticleDOI
TL;DR: A novel computational approach based on protein sequence, namely PDTPS (Predicting Drug Targets with Protein Sequence) to predict DTI is proposed, which has good prediction performance and is a useful tool and suitable for predicting DTI, as well as other bioinformatics tasks.
Abstract: Knowledge of drug-target interaction (DTI) plays an important role in discovering new drug candidates. Unfortunately, there are unavoidable shortcomings; including the time-consuming and expensive nature of the experimental method to predict DTI. Therefore, it motivates us to develop an effective computational method to predict DTI based on protein sequence. In the paper, we proposed a novel computational approach based on protein sequence, namely PDTPS (Predicting Drug Targets with Protein Sequence) to predict DTI. The PDTPS method combines Bi-gram probabilities (BIGP), Position Specific Scoring Matrix (PSSM), and Principal Component Analysis (PCA) with Relevance Vector Machine (RVM). In order to evaluate the prediction capacity of the PDTPS, the experiment was carried out on enzyme, ion channel, GPCR, and nuclear receptor datasets by using five-fold cross-validation tests. The proposed PDTPS method achieved average accuracy of 97.73%, 93.12%, 86.78%, and 87.78% on enzyme, ion channel, GPCR and nuclear receptor datasets, respectively. The experimental results showed that our method has good prediction performance. Furthermore, in order to further evaluate the prediction performance of the proposed PDTPS method, we compared it with the state-of-the-art support vector machine (SVM) classifier on enzyme and ion channel datasets, and other exiting methods on four datasets. The promising comparison results further demonstrate that the efficiency and robust of the proposed PDTPS method. This makes it a useful tool and suitable for predicting DTI, as well as other bioinformatics tasks.

64 citations

Journal ArticleDOI
TL;DR: A new computational method, named distance-based sequence similarity for miRNA-disease association prediction (DBMDA), that directly learns a mapping from miRNA sequence to a Euclidean space and consists of inferring global similarity from region distances that can be figured by chaos game representation algorithm based on the miRNA sequences.
Abstract: MicroRNAs (miRNAs) play a critical role in human diseases. Determining the association between miRNAs and disease contributes to elucidating the pathogenesis of liver diseases and seeking the effective treatment method. Despite great recent advances in the field of the associations between miRNAs and diseases, implementing association verification and recognition efficiently at scale presents serious challenges to biological experimental approaches. Thus, computational methods for predicting miRNA-disease association have become a research hotspot. In this paper, we present a new computational method, named distance-based sequence similarity for miRNA-disease association prediction (DBMDA), that directly learns a mapping from miRNA sequence to a Euclidean space. The notable feature of our approach consists of inferring global similarity from region distances that can be figured by chaos game representation algorithm based on the miRNA sequences. In the 5-fold cross-validation experiment, the area under the curve (AUC) obtained by DBMDA in predicting potential miRNA-disease associations reached 0.9129. To assess the effectiveness of DBMDA more effectively, we compared it with different classifiers and former prediction models. Besides, we constructed two case studies for prostate neoplasms and colon neoplasms. Results show that 39 and 39 out of the top 40 predicted miRNAs were confirmed by other databases, respectively. BDMDA has made new attempts in sequence similarity and achieved excellent results, while at the same time providing a new perspective for predicting the relationship between diseases and miRNAs. The source code and datasets explored in this work are available online from the University of Chinese Academy of Sciences (http://220.171.34.3:81/).

48 citations

Journal ArticleDOI
TL;DR: A novel method by combining ensemble Rotation Forest (RF) classifier and Discrete Cosine Transform (DCT) algorithm to predict the interactions among proteins, which achieves excellent results on Yeast and H. pylori data sets.

47 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: A deep learning based model that uses only sequence information of both targets and drugs to predict DT interaction binding affinities is proposed, outperforming the KronRLS algorithm and SimBoost, a state‐of‐the‐art method for DT binding affinity prediction.
Abstract: Motivation The identification of novel drug-target (DT) interactions is a substantial part of the drug discovery process. Most of the computational methods that have been proposed to predict DT interactions have focused on binary classification, where the goal is to determine whether a DT pair interacts or not. However, protein-ligand interactions assume a continuum of binding strength values, also called binding affinity and predicting this value still remains a challenge. The increase in the affinity data available in DT knowledge-bases allows the use of advanced learning techniques such as deep learning architectures in the prediction of binding affinities. In this study, we propose a deep-learning based model that uses only sequence information of both targets and drugs to predict DT interaction binding affinities. The few studies that focus on DT binding affinity prediction use either 3D structures of protein-ligand complexes or 2D features of compounds. One novel approach used in this work is the modeling of protein sequences and compound 1D representations with convolutional neural networks (CNNs). Results The results show that the proposed deep learning based model that uses the 1D representations of targets and drugs is an effective approach for drug target binding affinity prediction. The model in which high-level representations of a drug and a target are constructed via CNNs achieved the best Concordance Index (CI) performance in one of our larger benchmark datasets, outperforming the KronRLS algorithm and SimBoost, a state-of-the-art method for DT binding affinity prediction. Availability and implementation https://github.com/hkmztrk/DeepDTA. Supplementary information Supplementary data are available at Bioinformatics online.

634 citations

Journal ArticleDOI
TL;DR: The current knowledge concerning the roles of IFN-γ in the TME as a part of the complex immune response to cancer is discussed and the importance of identifying IFn-γ responsive patients to improve their sensitivity to immuno-therapies is highlighted.
Abstract: Interferon-γ (IFN-γ) plays a key role in activation of cellular immunity and subsequently, stimulation of antitumor immune-response. Based on its cytostatic, pro-apoptotic and antiproliferative functions, IFN-γ is considered potentially useful for adjuvant immunotherapy for different types of cancer. Moreover, it IFN-γ may inhibit angiogenesis in tumor tissue, induce regulatory T-cell apoptosis, and/or stimulate the activity of M1 proinflammatory macrophages to overcome tumor progression. However, the current understanding of the roles of IFN-γ in the tumor microenvironment (TME) may be misleading in terms of its clinical application. Some researchers believe it has anti-tumorigenic properties, while others suggest that it contributes to tumor growth and progression. In our recent work, we have shown that concentration of IFN-γ in the TME determines its function. Further, it was reported that tumors treated with low-dose IFN-γ acquired metastatic properties while those infused with high dose led to tumor regression. Pro-tumorigenic role may be described through IFN-γ signaling insensitivity, downregulation of major histocompatibility complexes, upregulation of indoleamine 2,3-dioxygenase, and checkpoint inhibitors such as programmed cell death ligand 1. Significant research efforts are required to decipher IFN-γ-dependent pro- and anti-tumorigenic effects. This review discusses the current knowledge concerning the roles of IFN-γ in the TME as a part of the complex immune response to cancer and highlights the importance of identifying IFN-γ responsive patients to improve their sensitivity to immuno-therapies.

360 citations

Journal ArticleDOI
TL;DR: A survey of the existing algorithms and approaches for the detection of communities in social networks is presented and some of the applications of community detection are discussed.
Abstract: The expansion of the web and emergence of a large number of social networking sites SNS have empowered users to easily interconnect on a shared platform. A social network can be represented by a graph consisting of a set of nodes and edges connecting these nodes. The nodes represent the individuals/entities, and the edges correspond to the interactions among them. The tendency of people with similar tastes, choices, and preferences to get associated in a social network leads to the formation of virtual clusters or communities. Detection of these communities can be beneficial for numerous applications such as finding a common research area in collaboration networks, finding a set of likeminded users for marketing and recommendations, and finding protein interaction networks in biological networks. A large number of community-detection algorithms have been proposed and applied to several domains in the literature. This paper presents a survey of the existing algorithms and approaches for the detection of communities in social networks. We also discuss some of the applications of community detection. WIREs Data Mining Knowl Discov 2016, 6:115-135. doi: 10.1002/widm.1178

247 citations

Journal ArticleDOI
TL;DR: This article describes the data used in computational DTI prediction efforts, and categorize and elaborate the state-of-the-art methods for predicting DTIs, discussing the advantages and disadvantages of each method.
Abstract: Computational prediction of drug-target interactions (DTIs) has become an essential task in the drug discovery process. It narrows down the search space for interactions by suggesting potential interaction candidates for validation via wet-lab experiments that are well known to be expensive and time-consuming. In this article, we aim to provide a comprehensive overview and empirical evaluation on the computational DTI prediction techniques, to act as a guide and reference for our fellow researchers. Specifically, we first describe the data used in such computational DTI prediction efforts. We then categorize and elaborate the state-of-the-art methods for predicting DTIs. Next, an empirical comparison is performed to demonstrate the prediction performance of some representative methods under different scenarios. We also present interesting findings from our evaluation study, discussing the advantages and disadvantages of each method. Finally, we highlight potential avenues for further enhancement of DTI prediction performance as well as related research directions.

201 citations

Journal ArticleDOI
TL;DR: The data required for the task of DTI prediction is described followed by a comprehensive catalog consisting of machine learning methods and databases, which have been proposed and utilized to predict DTIs.
Abstract: The task of predicting the interactions between drugs and targets plays a key role in the process of drug discovery. There is a need to develop novel and efficient prediction approaches in order to avoid costly and laborious yet not-always-deterministic experiments to determine drug-target interactions (DTIs) by experiments alone. These approaches should be capable of identifying the potential DTIs in a timely manner. In this article, we describe the data required for the task of DTI prediction followed by a comprehensive catalog consisting of machine learning methods and databases, which have been proposed and utilized to predict DTIs. The advantages and disadvantages of each set of methods are also briefly discussed. Lastly, the challenges one may face in prediction of DTI using machine learning approaches are highlighted and we conclude by shedding some lights on important future research directions.

192 citations