scispace - formally typeset
Open AccessJournal ArticleDOI

Integrating logic-based machine learning and virtual screening to discover new drugs

TLDR
Early retrieved compounds showed high topological differences to molecules used as training data, showing the strength of this method for scaffold hopping, and the method was benchmarked on the Directory of Useful Decoys datasets.
Abstract
Investigational Novel Drug Discovery by Example (INDDEx™) is a technology developed to guide hit to lead discovery by learning rules from existing active compounds that link activity to chemical substructure. INDDEx is based on Inductive Logic Programming [1], which learns easily interpretable qualitative logic rules from active ligands that give an insight into chemistry, relate molecular substructure to activity, and can be used to guide the next steps of drug design chemistry. Support Vector Machines weight the rules to produce a quantitative model of structure-activity relationships. Whereas earlier testing [2,3] was performed on single dataset examples, this talk presents the largest and fullest test of the method. The method was benchmarked on the Directory of Useful Decoys (DUD) datasets [4], using the same methodology described in the paper on the assessment of LASSO [5] and DOCK. For each of the DUD datasets, the known active ligands were mixed with all the decoy compounds in DUD, and the retrieval rates of INDDEx and DUD were measured when they were trained on 2, 4, and 8 of the known active ligands (Figure 2). Early retrieved compounds showed high topological differences to molecules used as training data, showing the strength of this method for scaffold hopping. This work was supported by a BBSRC case studentship with Equinox Pharma Ltd (http://www.equinoxpharma.com). Figure 1 Recovery of actives in each of the DUD datasets from all decoys in the DUD, averaged across all 40 datasets.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Ensemble learning method for the prediction of new bioactive molecules.

TL;DR: A recently developed combination of different boosting methods (Adaboost) for the prediction of new bioactive molecules generates better results than other machine learning methods and is suitable for inclusion among the in silico tools for use in cheminformatics, computational chemistry and molecular biology.
Journal ArticleDOI

Cheminformatics analysis of the AR agonist and antagonist datasets in PubChem.

TL;DR: A comprehensive cheminformatics analysis was performed to investigate the structural characteristics and discontinued structure–activity relationship of the pairwise AR datasets and the identified molecular scaffold characteristics, MMPs as well as activity cliffs might provide useful information when designing new lead compounds for the androgen receptor.
References
More filters
Journal ArticleDOI

Inverse entailment and PROGOL

TL;DR: Mode-Directed Inverse Entailment (MDIE) is introduced as a generalisation and enhancement of previous approaches for inverting deduction and an implementation of MDIE in the Progol system is described.
Journal ArticleDOI

Benchmarking sets for molecular docking.

TL;DR: A directory of useful decoys (DUD), with 2950 ligands for 40 different targets, leading to a database of 98,266 compounds, which allowed 40x40 cross-docking, where the enrichments of each ligand set could be compared for all 40 targets, enabling a specificity metric for the docking screens.
Journal ArticleDOI

A novel logic-based approach for quantitative toxicology prediction

TL;DR: The SVILP approach has a major advantage in that it uses ILP automatically and consistently to derive rules, mostly novel, describing fragments that are toxicity alerts, and has the potential of tackling many problems relevant to chemoinformatics including in silico drug design.
Journal ArticleDOI

LASSO—ligand activity by surface similarity order: a new tool for ligand based virtual screening

TL;DR: It is shown that over a wide range of receptor families, eHiTS LASSO is consistently able to enrich screened databases and provides scaffold hopping ability.
Related Papers (5)