scispace - formally typeset
Journal ArticleDOI

Beware of q2

Reads0
Chats0
TLDR
It is argued that the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power, which is the general property of QSAR models developed using LOO cross-validation.
Abstract
Validation is a crucial aspect of any quantitative structure-activity relationship (QSAR) modeling. This paper examines one of the most popular validation criteria, leave-one-out cross-validated R2 (LOO q2). Often, a high value of this statistical characteristic (q2 > 0.5) is considered as a proof of the high predictive ability of the model. In this paper, we show that this assumption is generally incorrect. In the case of 3D QSAR, the lack of the correlation between the high LOO q2 and the high predictive ability of a QSAR model has been established earlier [Pharm. Acta Helv. 70 (1995) 149; J. Chemomet. 10(1996)95; J. Med. Chem. 41 (1998) 2553]. In this paper, we use two-dimensional (2D) molecular descriptors and k nearest neighbors (kNN) QSAR method for the analysis of several datasets. No correlation between the values of q2 for the training set and predictive ability for the test set was found for any of the datasets. Thus, the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power. We argue that this is the general property of QSAR models developed using LOO cross-validation. We emphasize that the external validation is the only way to establish a reliable QSAR model. We formulate a set of criteria for evaluation of predictive ability of QSAR models.

read more

Citations
More filters
Journal ArticleDOI

Modeling of the Inhibition Constant (Ki) of Some Cruzain Ketone-Based Inhibitors Using 2D Spatial Autocorrelation Vectors and Data-Diverse Ensembles of Bayesian-Regularized Genetic Neural Networks

TL;DR: It was derived that atomic van der Waals volume distributions at topological lags 3, 5, and 6 in the 2D topological structure of the inhibitors have a high nonlinear influence on the inhibition constants.
Journal ArticleDOI

Optimization of biaryl piperidine and 4-amino-2-biarylurea MCH1 receptor antagonists using QSAR modeling, classification techniques and virtual screening

TL;DR: This paper presents the results of an optimization study on biaryl piperidine and 4-amino-2-biarylurea MCH1 receptor antagonists, which was accomplished by using quantitative-structure activity relationships (QSARs), classification and virtual screening techniques.
Journal ArticleDOI

Ecotoxicological modelling of cosmetics for aquatic organisms: A QSTR approach.

TL;DR: External validated quantitative structure–toxicity relationship models were developed for toxicity of cosmetic ingredients on three different ecotoxicologically relevant organisms, namely Pseudokirchneriella subcapitata, Daphnia magna and Pimephales promelas following the OECD guidelines by partial least squares (PLS) regression technique.
Journal ArticleDOI

General Linearized Biexponential Model for QSAR Data Showing Bilinear-Type Distribution

TL;DR: A general linearized biexponential (LinBiExp) model is proposed that can adequately describe data showing bilinear-type distribution as a function of not just often-employed lipophilicity descriptors but as afunction of any descriptor (e.g., molecular volume).
Journal ArticleDOI

Evaluation of shear strength parameters of granulated waste rubber using artificial neural networks and group method of data handling

TL;DR: In this article, a prediction model using the combinatorial algorithm in group method of data handling (GMDH) is proposed for the shear strength and vertical strain in the arrangement of closed-form equations.
References
More filters
Book

Graph theory

Frank Harary
Journal ArticleDOI

Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.

TL;DR: The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
Book

Substituent constants for correlation analysis in chemistry and biology

TL;DR: In this paper, the book is the window to get in the world and you can open the world easily, and these wise words are really familiar with you, so bring home now the book enPDFd substituent constants for correlation analysis in chemistry and biology to be your sources when going to read.
Related Papers (5)