Journal ArticleDOI
On Some Aspects of Variable Selection for Partial Least Squares Regression Models
Partha Pratim Roy,Kunal Roy +1 more
Reads0
Chats0
TLDR
In this article, the optimum variable selection strategy for Partial Least Squares (PLS) regression using a model dataset of cytoprotection data is explored, where the compounds of the dataset were classified using K-means clustering technique applied on standardized descriptor matrix and ten combinations of training and test sets were generated based on the obtained clusters.Abstract:
This paper tries to explore the optimum variable selection strategy for Partial Least Squares (PLS) regression using a model dataset of cytoprotection data. The compounds of the dataset were classified using K-means clustering technique applied on standardized descriptor matrix and ten combinations of training and test sets were generated based on the obtained clusters. For a particular training set, PLS models were developed with a number of components optimized by leave-one-out Q2 and then the developed models were validated (externally) using the test set compounds. For each set, PLS model was initially constructed using all descriptors (variables). The variables having least standardized values of regression coefficients were deleted and the next model was developed with a reduced set of variables. These steps were performed several times until further reduction in number of variables did not improve Q2 value. In each case, statistical parameters like predictive R2 (R2pred), squared correlation coefficient between observed and predicted values with (r2) and without () intercept and Root Mean Square Error of Prediction (RMSEP) were calculated from the test set compounds. In case of all ten sets, Q2 values steadily increase on deletion of variables while R2pred values do not show any specific trend. In no case, the highest Q2 and highest R2pred appear in the same trial, i.e., with the same combinations of variables. This suggests that from the viewpoint of external predictability, choice of variables for PLS based on Q2 value may not be optimum. Moreover, a clear separation of r2 and r02 curves in some sets suggests that such models may not be truly predictive in spite of acceptable R2pred values. Another observation is that coefficient of determination R2 for the training set is more immune to changes on deletion of variables than the validation parameters like Q2 and R2pred. Finally, a new parameter rm2 has been suggested to indicate external predictability of QSAR models.read more
Citations
More filters
Journal ArticleDOI
Novel coumarins active against Trypanosoma cruzi and toxicity assessment using the animal model Caenorhabditis elegans.
Fabiana Gomes Nascimento Soares,Gabriela Göethel,Luciano Porto Kagami,Gustavo Machado das Neves,Elisa Sauer,Estefanía Birriel,Javier Varela,Itamar Luís Gonçalves,Gilsane Lino von Poser,Mercedes González,Daniel Fábio Kawano,Fávero Reisdorfer Paula,Eduardo Borges de Melo,Solange Cristina Garcia,Hugo Cerecetto,Vera Lucia Eifler-Lima +15 more
TL;DR: Results from the QSAR-3D study indicate that the volume and hydrophobicity of the substituents have a significant impact on the trypanocidal activities for derivatives that cause more than 50% of inhibition.
Journal ArticleDOI
QSAR study and rustic ligand-based virtual screening in a search for aminooxadiazole derivatives as PIM1 inhibitors
Adnane Aouidate,Adib Ghaleb,Mounir Ghamali,Samir Chtita,A. Ousaa,M’barek Choukrad,Abdelouahid Sbai,Mohammed Bouachrine,Tahar Lakhlifi +8 more
TL;DR: This approach can be easily handled by chemists to distinguish, which ones among the future designed aminooxadiazoles structures could be lead-like and those that couldn’t be, thus, they can be eliminated in the early stages of drug discovery process.
Journal ArticleDOI
Prediction of uplift capacity of suction caisson in clay using functional network and multivariate adaptive regression spline
TL;DR: In this paper, two recently developed AI techniques, functional network (FN) and multivariate adaptive regression spline (MARS), have been used to predict the uplift capacity of suction caisson in clay.
Journal ArticleDOI
Insight into the interactions between novel isoquinolin-1,3-dione derivatives and cyclin-dependent kinase 4 combining QSAR and molecular docking.
Junxia Zheng,Kong Hao,James M. Wilson,Jialiang Guo,Yiqun Chang,Mengjia Yang,Gaokeng Xiao,Pinghua Sun +7 more
TL;DR: Based on the QSAR and docking models, twenty new potent molecules have been designed and predicted better than the most active compound 12 in the literatures and contribute towards the development of more active CDK4 subtype-selective inhibitors.
Journal ArticleDOI
In silico screening for identification of novel β-1,3-glucan synthase inhibitors using pharmacophore and 3D-QSAR methodologies
Potshangbam Angamba Meetei,Ravindranath S. Rathore,Ravindranath S. Rathore,N. Prakash Prabhu,Vaibhav Vindal +4 more
TL;DR: A robust pharmacophore model was developed and structure activity relationship analysis of 42 pyridazinone derivatives as β-1,3-glucan synthase inhibitors revealed the pharmacokinetic efficiency of these compounds.
References
More filters
Book
Cluster Analysis
TL;DR: This fourth edition of the highly successful Cluster Analysis represents a thorough revision of the third edition and covers new and developing areas such as classification likelihood and neural networks for clustering.
Journal ArticleDOI
Beware of q2
TL;DR: It is argued that the high value of LOO q2 appears to be the necessary but not the sufficient condition for the model to have a high predictive power, which is the general property of QSAR models developed using LOO cross-validation.
Journal ArticleDOI
PLS regression methods
TL;DR: In this paper, the mathematical and statistical structure of PLS regression is developed and the PLS decomposition of the data matrices involved in model building is analyzed. But the PLP regression algorithm can be interpreted in a model building setting.
Burger's medicinal chemistry and drug discovery
Manfred E. Wolff,William Foye,Robert Southgate,Neal Frederick Osborne,Michael J. Pearson,George Burton,SmithKline Beecham +6 more