Journal ArticleDOI
Beware of q2
Reads0
Chats0
TLDR
It is argued that the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power, which is the general property of QSAR models developed using LOO cross-validation.Abstract:
Validation is a crucial aspect of any quantitative structure-activity relationship (QSAR) modeling. This paper examines one of the most popular validation criteria, leave-one-out cross-validated R2 (LOO q2). Often, a high value of this statistical characteristic (q2 > 0.5) is considered as a proof of the high predictive ability of the model. In this paper, we show that this assumption is generally incorrect. In the case of 3D QSAR, the lack of the correlation between the high LOO q2 and the high predictive ability of a QSAR model has been established earlier [Pharm. Acta Helv. 70 (1995) 149; J. Chemomet. 10(1996)95; J. Med. Chem. 41 (1998) 2553]. In this paper, we use two-dimensional (2D) molecular descriptors and k nearest neighbors (kNN) QSAR method for the analysis of several datasets. No correlation between the values of q2 for the training set and predictive ability for the test set was found for any of the datasets. Thus, the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power. We argue that this is the general property of QSAR models developed using LOO cross-validation. We emphasize that the external validation is the only way to establish a reliable QSAR model. We formulate a set of criteria for evaluation of predictive ability of QSAR models.read more
Citations
More filters
Journal ArticleDOI
Statistical Modeling of Soil Moisture, Integrating Satellite Remote-Sensing (SAR) and Ground-Based Data
TL;DR: This work integrates active polarimetric satellite remote-sensing data with ground-based in-situ data across an agricultural monitoring site in Canada and applies a grouped step-wise algorithm to iteratively select best-performing predictors of soil moisture.
Journal ArticleDOI
Machine Learning-Based Modeling with Optimization Algorithm for Predicting Mechanical Properties of Sustainable Concrete
Muhammad Izhar Shah,Shazim Ali Memon,Muhammad Sohaib Khan Niazi,Muhammad Nasir Amin,Fahid Aslam,Muhammad Faisal Javed +5 more
TL;DR: MEP-based modeling with PSO could be an effective tool for accurate modeling of the concrete properties, thus directly contributing to the construction sector by consuming waste and protecting the environment.
Journal ArticleDOI
Prediction of inherent viscosity for polymers containing natural amino acids from the theoretical derived molecular descriptors
TL;DR: In this paper, an artificial neural network (ANN) was used for the prediction of inherent viscosity (η inh ) of a data set of 75 optically active polymers containing natural amino acids.
Journal ArticleDOI
CORAL: Predictions of rate constants of hydroxyl radical reaction using representation of the molecular structure obtained by combination of SMILES and Graph approaches
Andrey A. Toropov,Alla P. Toropova,S.E. Martyanov,Emilio Benfenati,Giuseppina Gini,Danuta Leszczynska,Jerzy Leszczynski +6 more
TL;DR: In this article, a hybrid representation of molecular structure by combination of simplified molecular input line entry system (SMILES) and the molecular graph can improve the predictive potential of CORAL models.
Journal ArticleDOI
Monte Carlo based modelling approach for designing and predicting cytotoxicity of 2-phenylindole derivatives against breast cancer cell line MCF7
Ruchi Gaikwad,Soumajit Ghorai,Sk. Abdul Amin,Nilanjan Adhikari,Tarun Patel,Kalpataru Das,Tarun Jha,Shovanlal Gayen +7 more
TL;DR: Results from the analysis were further used to design and predict some probable new 2-phenylindole derivatives having promising cytotoxicity (IC50 < 55 nM) against MCF7.
References
More filters
Journal ArticleDOI
Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.
TL;DR: The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
Book
Substituent constants for correlation analysis in chemistry and biology
Corwin Hansch,Albert J. Leo +1 more
TL;DR: In this paper, the book is the window to get in the world and you can open the world easily, and these wise words are really familiar with you, so bring home now the book enPDFd substituent constants for correlation analysis in chemistry and biology to be your sources when going to read.