Integrating logic-based machine learning and virtual screening to discover new drugs

doi:10.1186/1758-2946-4-S1-O10

Open AccessJournal ArticleDOI

Integrating logic-based machine learning and virtual screening to discover new drugs

Christopher R. Reynolds, +1 more

- 01 May 2012 -

Journal of Cheminformatics

- Vol. 4, Iss: 1, pp 1-2

TLDR

Early retrieved compounds showed high topological differences to molecules used as training data, showing the strength of this method for scaffold hopping, and the method was benchmarked on the Directory of Useful Decoys datasets.

Abstract:

Investigational Novel Drug Discovery by Example (INDDEx™) is a technology developed to guide hit to lead discovery by learning rules from existing active compounds that link activity to chemical substructure. INDDEx is based on Inductive Logic Programming [1], which learns easily interpretable qualitative logic rules from active ligands that give an insight into chemistry, relate molecular substructure to activity, and can be used to guide the next steps of drug design chemistry. Support Vector Machines weight the rules to produce a quantitative model of structure-activity relationships. Whereas earlier testing [2,3] was performed on single dataset examples, this talk presents the largest and fullest test of the method. The method was benchmarked on the Directory of Useful Decoys (DUD) datasets [4], using the same methodology described in the paper on the assessment of LASSO [5] and DOCK. For each of the DUD datasets, the known active ligands were mixed with all the decoy compounds in DUD, and the retrieval rates of INDDEx and DUD were measured when they were trained on 2, 4, and 8 of the known active ligands (Figure 2). Early retrieved compounds showed high topological differences to molecules used as training data, showing the strength of this method for scaffold hopping. This work was supported by a BBSRC case studentship with Equinox Pharma Ltd (http://www.equinoxpharma.com). Figure 1 Recovery of actives in each of the DUD datasets from all decoys in the DUD, averaged across all 40 datasets.

Integrating logic-based machine learning and virtual screening to discover new drugs

Citations

Ensemble learning method for the prediction of new bioactive molecules.

Cheminformatics analysis of the AR agonist and antagonist datasets in PubChem.

References

Inverse entailment and PROGOL

Benchmarking sets for molecular docking.

Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds

A novel logic-based approach for quantitative toxicology prediction

LASSO—ligand activity by surface similarity order: a new tool for ligand based virtual screening

Related Papers (5)

Comprehensive comparison of ligand-based virtual screening tools against the DUD data set reveals limitations of current 3D methods.

Interpreting linear support vector machine models with heat map molecule coloring

Boosting Docking-Based Virtual Screening with Deep Learning

Iterative Refinement of a Binding Pocket Model: Active Computational Steering of Lead Optimization

How to optimize shape-based virtual screening: choosing the right query and including chemical information.