Journal ArticleDOI
Beware of q2
Reads0
Chats0
TLDR
It is argued that the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power, which is the general property of QSAR models developed using LOO cross-validation.Abstract:
Validation is a crucial aspect of any quantitative structure-activity relationship (QSAR) modeling. This paper examines one of the most popular validation criteria, leave-one-out cross-validated R2 (LOO q2). Often, a high value of this statistical characteristic (q2 > 0.5) is considered as a proof of the high predictive ability of the model. In this paper, we show that this assumption is generally incorrect. In the case of 3D QSAR, the lack of the correlation between the high LOO q2 and the high predictive ability of a QSAR model has been established earlier [Pharm. Acta Helv. 70 (1995) 149; J. Chemomet. 10(1996)95; J. Med. Chem. 41 (1998) 2553]. In this paper, we use two-dimensional (2D) molecular descriptors and k nearest neighbors (kNN) QSAR method for the analysis of several datasets. No correlation between the values of q2 for the training set and predictive ability for the test set was found for any of the datasets. Thus, the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power. We argue that this is the general property of QSAR models developed using LOO cross-validation. We emphasize that the external validation is the only way to establish a reliable QSAR model. We formulate a set of criteria for evaluation of predictive ability of QSAR models.read more
Citations
More filters
Journal ArticleDOI
Systematic Comparison and Comprehensive Evaluation of 80 Amino Acid Descriptors in Peptide QSAR Modeling.
Peng Zhou,Qian Liu,Ting Wu,Qingqing Miao,Shuyong Shang,Heyi Wang,Zheng Chen,Shaozhou Wang,Heyan Wang +8 more
TL;DR: In this article, the authors exhaustively collect 80 published AADs and comprehensively evaluate their modeling performance (including fitting ability, internal stability, and predictive power) on 8 QSAR-oriented peptide sample sets (QPSs) by employing 2 sophisticated machine learning methods (MLMs).
Journal ArticleDOI
Synthesis, antimicrobial, anticancer evaluation and QSAR studies of 6-methyl-4-[1-(2-substituted-phenylamino-acetyl)-1H-indol-3-yl]-2-oxo/thioxo-1,2,3,4-tetrahydropyrimidine-5-carboxylic acid ethyl esters.
Sandeep Kumar Sharma,Pradeep Kumar,Balasubramanian Narasimhan,Kalavathy Ramasamy,Vasudevan Mani,Rakesh Kumar Mishra,Abu Bakar Abdul Majeed +6 more
TL;DR: The QSAR studies demonstrated the importance of topological parameter, Balaban index (J) followed by lipophillic parameter, log P in describing the antimicrobial activity of the synthesized compounds.
Journal ArticleDOI
A novel approach to predict aquatic toxicity from molecular structure.
Juan A. Castillo-Garit,Yovani Marrero-Ponce,Jeanette Escobar,Francisco Torrens,Richard Rotondo +4 more
TL;DR: The non-stochastic and stochastic linear indices appear to provide an interesting alternative to costly and time-consuming experiments for determining toxicity in aquatic toxicity models developed using the Dragon software.
Journal ArticleDOI
Quantitative Structure–Activity Relationship Models of Clinical Pharmacokinetics: Clearance and Volume of Distribution
Vijay K. Gombar,Stephen D. Hall +1 more
TL;DR: These QSAR models avoid uncertainty associated with preclinical-to-clinical extrapolation and require two-dimensional structure drawing as the sole input to predict systemic CL and steady-state Vd (Vdss) from intravenous (iv) dosing in humans.
Journal ArticleDOI
CORAL: Building up the model for bioconcentration factor and defining it’s applicability domain
Andrey A. Toropov,Alla P. Toropova,Anna Lombardo,Alessandra Roncaglioni,Emilio Benfenati,Giuseppina Gini +5 more
TL;DR: This work used CORAL to evaluate the applicability domain of the QSAR models, taking a model of bioconcentration factor (logBCF) as example, and introduced a new function, which uses the Delta(obs) = logBCF(expr)--log BCF(calc) of the predictions on the chemicals in the training set, which increased the model's predictivity and reliability.
References
More filters
Journal ArticleDOI
Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.
TL;DR: The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
Book
Substituent constants for correlation analysis in chemistry and biology
Corwin Hansch,Albert J. Leo +1 more
TL;DR: In this paper, the book is the window to get in the world and you can open the world easily, and these wise words are really familiar with you, so bring home now the book enPDFd substituent constants for correlation analysis in chemistry and biology to be your sources when going to read.