Building Predictive Models in R Using the caret Package
Reads0
Chats0
TLDR
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R to simplify model training and tuning across a wide variety of modeling techniques.Abstract:
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R. The package focuses on simplifying model training and tuning across a wide variety of modeling techniques. It also includes methods for pre-processing training data, calculating variable importance, and model visualizations. An example from computational chemistry is used to illustrate the functionality on a real data set and to benchmark the benefits of parallel processing with several types of models.read more
Citations
More filters
Journal ArticleDOI
Data fusion of vis–NIR and PXRF spectra to predict soil physical and chemical properties
Yakun Zhang,Alfred E. Hartemink +1 more
TL;DR: In this paper, a front-end data fusion method was used to combine visible near-infrared (vis-NIR) and portable X-ray fluorescence (PXRF) spectra for predicting different soil properties and investigated the contribution of different sensor data.
Journal ArticleDOI
Molecular Mechanisms Driving Switch Behavior in Xylem Cell Differentiation
Gina Turco,Joel Rodriguez-Medina,Stefan Siebert,Diane Han,Miguel Á. Valderrama-Gómez,Hannah E. Vahldick,Christine N. Shulse,Benjamin J. Cole,Celina E. Juliano,Diane E. Dickel,Michael A. Savageau,Siobhan M. Brady +11 more
TL;DR: This system provides an important model to study the emergent properties that may give rise to totipotency relative to terminal differentiation and reveals xylem cell subtypes.
Journal ArticleDOI
Highly interconnected enhancer communities control lineage-determining genes in human mesenchymal stem cells.
Jesper Grud Skat Madsen,Maria Stahl Madsen,Alexander Rauch,Sofie Traynor,Elvira Laila Van Hauwaert,Anders K. Haakonsson,Biola M. Javierre,Mette Hyldahl,Peter Fraser,Peter Fraser,Susanne Mandrup +10 more
TL;DR: It is found that enhancers form an elaborate network that is dynamic during differentiation and coupled with changes in enhancer activity, and that HICE are important for both signal integration and compartmentalization of the genome.
Journal ArticleDOI
Feature specific quantile normalization enables cross-platform classification of molecular subtypes using gene expression data.
TL;DR: Using feature specific quantile normalization (FSQN), a method to normalize and classify RNA-seq data using machine learning classifiers trained on DNA microarray data and molecular subtypes in two datasets: breast invasive carcinoma (BRCA) and colorectal cancer (CRC).
Journal ArticleDOI
Reduced neonatal brain-derived neurotrophic factor is associated with autism spectrum disorders.
Kristin Skogstrand,Kristin Skogstrand,Christian M. Hagen,Christian M. Hagen,Nis Borbye-Lorenzen,Nis Borbye-Lorenzen,Michael Christiansen,Michael Christiansen,Michael Christiansen,Jonas Bybjerg-Grauholm,Jonas Bybjerg-Grauholm,Marie Bækvad-Hansen,Marie Bækvad-Hansen,Thomas Werge,Thomas Werge,Thomas Werge,Anders D. Børglum,Ole Mors,Ole Mors,Merethe Nordentoft,Merethe Nordentoft,Preben Bo Mortensen,David M. Hougaard,David M. Hougaard +23 more
TL;DR: The low newborn blood levels of BDNF in children developing ASD is an important finding, suggesting that lower BDNF levels in newborns contributes to the etiology of ASD and indicates new directions for further research.
References
More filters
BookDOI
Modern Applied Statistics with S
W. N. Venables,Brian D. Ripley +1 more
TL;DR: A guide to using S environments to perform statistical analyses providing both an introduction to the use of S and a course in modern statistical methods.
Classification and Regression by randomForest
Andy Liaw,Matthew C. Wiener +1 more
TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.
Modern Applied Statistics With S
TL;DR: The modern applied statistics with s is universally compatible with any devices to read, and is available in the digital library an online access to it is set as public so you can download it instantly.
Proceedings ArticleDOI
Validity of the single processor approach to achieving large scale computing capabilities
TL;DR: In this paper, the authors argue that the organization of a single computer has reached its limits and that truly significant advances can be made only by interconnection of a multiplicity of computers in such a manner as to permit cooperative solution.