scispace - formally typeset
Journal ArticleDOI

Epigenetic Target Fishing with Accurate Machine Learning Models.

TLDR
In this article, a large-scale study of 26 318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity was conducted. And the results indicated that the models reported herein have considerable potential to identify small molecules with epigenetics activity.
Abstract
Epigenetic targets are of significant importance in drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents many structure-activity relationships that have not been exploited thus far to develop predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26 318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. We built predictive models with high accuracy for small molecules' epigenetic target profiling through a systematic comparison of the machine learning models trained on different molecular fingerprints. The models were thoroughly validated, showing mean precisions of up to 0.952 for the epigenetic target prediction task. Our results indicate that the models reported herein have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as a freely accessible web application.

read more

Citations
More filters
Journal ArticleDOI

Epigenetic Target Profiler: A Web Server to Predict Epigenetic Targets of Small Molecules.

TL;DR: The Epigenetic Target Profiler (ETP) as discussed by the authors uses a consensus model based on two binary classification models for each target, relying on support vector machines and built on molecular fingerprints of different design.
Journal ArticleDOI

Advances in the Exploration of the Epigenetic Relevant Chemical Space.

TL;DR: In this paper, the authors review the chemical spaces explored for epigenetic drug discovery and discuss the advances in using structure-activity relationships stored in public chemogenomic databases, and discuss current efforts to chart and identify novel regions of the epigenetic relevant chemical space.
Journal ArticleDOI

Chemoinformatic Analysis of Isothiocyanates: Their Impact in Nature and Medicine.

TL;DR: In this article, a comprehensive analysis of ITCs present in public domain databases, including natural products, food chemicals, macromolecular targets of drugs, and the Protein Data Bank was conducted.
Journal ArticleDOI

KUALA: a machine learning-driven framework for kinase inhibitors repositioning

TL;DR: In this paper , a multi-target priority score and a repurposing threshold were proposed to suggest the best repurposable and non-repurposeable molecules for each target.
Journal ArticleDOI

Virtual Special Issue: Epigenetics 2022.

TL;DR: The Altmetric Attention Score as mentioned in this paper is a quantitative measure of the attention that a research article has received online, which is used to measure the importance of an article in the literature.
References
More filters
Journal Article

Scikit-learn: Machine Learning in Python

TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Journal ArticleDOI

Greedy function approximation: A gradient boosting machine.

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
Journal ArticleDOI

Neural networks and physical systems with emergent collective computational abilities

TL;DR: A model of a system having a large number of simple equivalent components, based on aspects of neurobiology but readily adapted to integrated circuits, produces a content-addressable memory which correctly yields an entire memory from any subpart of sufficient size.
Journal ArticleDOI

The random subspace method for constructing decision forests

TL;DR: A method to construct a decision tree based classifier is proposed that maintains highest accuracy on training data and improves on generalization accuracy as it grows in complexity.
Journal ArticleDOI

SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules

TL;DR: This chapter discusses the construction of Benzenoid and Coronoid Hydrocarbons through the stages of enumeration, classification, and topological properties in a number of computers used for this purpose.