scispace - formally typeset
Open AccessJournal ArticleDOI

iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals.

Reads0
Chats0
TLDR
A multi‐label classifier, called iATC‐mISF, was developed by incorporating the information of chemical‐chemical interaction, the informationOf the structural similarity, and theInformation of the fingerprintal similarity, which showed that the proposed predictor achieved remarkably higher prediction quality than its cohorts for the same purpose.
Abstract
Motivation Given a compound, can we predict which anatomical therapeutic chemical (ATC) class/classes it belongs to? It is a challenging problem since the information thus obtained can be used to deduce its possible active ingredients, as well as its therapeutic, pharmacological and chemical properties. And hence the pace of drug development could be substantially expedited. But this problem is by no means an easy one. Particularly, some drugs or compounds may belong to two or more ATC classes. Results To address it, a multi-label classifier, called iATC-mISF, was developed by incorporating the information of chemical–chemical interaction, the information of the structural similarity, and the information of the fingerprintal similarity. Rigorous cross-validations showed that the proposed predictor achieved remarkably higher prediction quality than its cohorts for the same purpose, particularly in the absolute true rate, the most important and harsh metrics for the multi-label systems. Availability and implementation The web-server for iATC-mISF is accessible at http://www.jci-bioinfo.cn/iATC-mISF. Furthermore, to maximize the convenience for most experimental scientists, a step-by-step guide was provided, by which users can easily get their desired results without needing to go through the complicated mathematical equations. Their inclusion in this article is just for the integrity of the new method and stimulating more powerful methods to deal with various multi-label systems in biology. Contact xxiao@gordonlifescience.org Supplementary information Supplementary data are available at Bioinformatics online.

read more

Citations
More filters
Journal ArticleDOI

Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou's general PseAAC.

TL;DR: This study made an attempt to develop a support vector machine (SVM) based computational approach for prediction of AMPs with improved accuracy, and achieved higher accuracy than several existing approaches, while compared using benchmark dataset.
Journal ArticleDOI

iDNA6mA-PseKNC: Identifying DNA N6-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.

TL;DR: A novel predictor called iDNA6mA-PseKNC is proposed that is established by incorporating nucleotide physicochemical properties into Pseudo K-tuple Nucleotide Composition (PSEKNC), and it has been observed via rigorous cross-validations that the predictor's sensitivity, specificity, accuracy, and stability are excellent.
Journal ArticleDOI

iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.

TL;DR: A two-layer seamless predictor named as 'iPromoter-2 L', which serves to identify a query DNA sequence as a promoter or non-promoter, and the second layer to predict which of the following six types the identified promoter belongs to.
Journal ArticleDOI

iRNA-PseColl: Identifying the Occurrence Sites of Different RNA Modifications by Incorporating Collective Effects of Nucleotides into PseKNC.

TL;DR: A novel platform called “iRNA-PseColl” has been developed, formed by incorporating both the individual and collective features of the sequence elements into the general pseudo K-tuple nucleotide composition (PseKNC) of RNA via the chemicophysical properties and density distribution of its constituent nucleotides.
Journal ArticleDOI

2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function

TL;DR: By incorporating the physicochemical properties of nucleotides into the pseudo K-tuple nucleotide composition (PseKNC), a powerful predictor called 2L-piRNA is proposed, a two-layer ensemble classifier in which the first layer is for identifying whether a query RNA molecule is piRNA or non-pi RNA, and the second layer for identifyingWhether a piRNA is with or without the function of instructing target mRNA deadenylation.
References
More filters
Journal ArticleDOI

The KEGG resource for deciphering the genome

TL;DR: A knowledge-based approach for network prediction is developed, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes.
Journal ArticleDOI

Some remarks on protein attribute prediction and pseudo amino acid composition.

TL;DR: This review is to discuss each of the five procedures of the introduction of pseudo amino acid composition (PseAAC), its different modes and applications as well as its recent development, particularly in how to use the general formulation of PseAAC to reflect the core and essential features that are deeply hidden in complicated protein sequences.
Journal ArticleDOI

Prediction of protein structural classes.

TL;DR: The very high success rate for both the training- set proteins and the testing-set proteins, which has been further validated by a simulated analysis and a jackknife analysis, indicates that it is possible to predict the structural class of a protein according to its amino acid composition if an ideal and complete database can be established.
Journal ArticleDOI

Recent progress in protein subcellular location prediction

TL;DR: The cell is deemed to be the most basic structural and functional unit of all living organisms and often is called a ‘‘building block of life’’ and playing a critical role in generating energy in the eukaryotic cell.
Journal ArticleDOI

iAMP-2L: A two-level multi-label classifier for identifying antimicrobial peptides and their functional types

TL;DR: A multi-label classifier was developed based on the pseudo amino acid composition and fuzzy K-nearest neighbor algorithm, where the components of PseAAC were featured by incorporating five physicochemical properties, called iAMP-2L.