scispace - formally typeset
Search or ask a question
Topic

Quantitative structure–activity relationship

About: Quantitative structure–activity relationship is a research topic. Over the lifetime, 7560 publications have been published within this topic receiving 144670 citations. The topic is also known as: QSAR & Quantitative Structure-Activity Relationship.


Papers
More filters
Journal ArticleDOI
TL;DR: There is additional evidence that there exists no correlation between the values of q2 for the training set and accuracy of prediction (R2) for the test set and it is argued that this observation is a general property of any QSAR model developed with LOO cross-validation.
Abstract: Quantitative Structure–Activity Relationship (QSAR) models are used increasingly to screen chemical databases and/or virtual chemical libraries for potentially bioactive molecules. These developments emphasize the importance of rigorous model validation to ensure that the models have acceptable predictive power. Using k nearest neighbors (kNN) variable selection QSAR method for the analysis of several datasets, we have demonstrated recently that the widely accepted leave-one-out (LOO) cross-validated R2 (q2) is an inadequate characteristic to assess the predictive ability of the models [Golbraikh, A., Tropsha, A. Beware of q2! J. Mol. Graphics Mod. 20, 269-276, (2002)]. Herein, we provide additional evidence that there exists no correlation between the values of q2 for the training set and accuracy of prediction (R2) for the test set and argue that this observation is a general property of any QSAR model developed with LOO cross-validation. We suggest that external validation using rationally selected training and test sets provides a means to establish a reliable QSAR model. We propose several approaches to the division of experimental datasets into training and test sets and apply them in QSAR studies of 48 functionalized amino acid anticonvulsants and a series of 157 epipodophyllotoxin derivatives with antitumor activity. We formulate a set of general criteria for the evaluation of predictive power of QSAR models.

591 citations

Journal ArticleDOI
TL;DR: The predictions for the two external sets with 37 diverse compounds using multiple QSPR models indicate that the best linear models with four descriptors are sufficiently effective for predictive use and may be used as general utilities to screen the blood–brain barrier partitioning of drugs in a high-throughput fashion.
Abstract: In this study, the relationships between the brain-blood concentration ratio of 96 structurally diverse compounds with a large number of structurally derived descriptors were investigated. The linear models were based on molecular descriptors that can be calculated for any compound simply from a knowledge of its molecular structure. The linear correlation coefficients of the models were optimized by genetic algorithms (GAs), and the descriptors used in the linear models were automatically selected from 27 structurally derived descriptors. The GA optimizations resulted in a group of linear models with three or four molecular descriptors with good statistical significance. The change of descriptor use as the evolution proceeds demonstrates that the octane/water partition coefficient and the partial negative solvent-accessible surface area multiplied by the negative charge are crucial to brain-blood barrier permeability. Moreover, we found that the predictions using multiple QSPR models from GA optimization gave quite good results in spite of the diversity of structures, which was better than the predictions using the best single model. The predictions for the two external sets with 37 diverse compounds using multiple QSPR models indicate that the best linear models with four descriptors are sufficiently effective for predictive use. Considering the ease of computation of the descriptors, the linear models may be used as general utilities to screen the blood-brain barrier partitioning of drugs in a high-throughput fashion.

564 citations

Book
01 Jan 1993
TL;DR: Biological data - the additivity of group contributions parameters quantitative models statistical methods design of test series in QSAR applications of Hansch analysis applications of Free Wilson analysis and related models.
Abstract: Biological data - the additivity of group contributions parameters quantitative models statistical methods design of test series in QSAR applications of Hansch analysis applications of Free Wilson analysis and related models.

526 citations

Journal ArticleDOI
TL;DR: The present study reports that the web application can be easily used for identification of the X-outliers for training set compounds and detection of the test compounds residing outside the AD using the descriptor pool of the training and test sets.

517 citations

Journal ArticleDOI
TL;DR: The derivation of a simple QSAR model that permits the prediction of log BB for large compound sets, such as virtual combinatorial libraries, from a set of 55 diverse organic compounds is described.

507 citations


Network Information
Related Topics (5)
Ligand (biochemistry)
26.5K papers, 1M citations
80% related
Aqueous solution
189.5K papers, 3.4M citations
77% related
Active site
28.6K papers, 1.1M citations
77% related
Alkyl
223.5K papers, 2M citations
77% related
Reaction rate constant
42.9K papers, 1M citations
76% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023510
20221,020
2021284
2020356
2019334
2018313