scispace - formally typeset
Open AccessJournal ArticleDOI

Building Predictive Models in R Using the caret Package

Max Kuhn
- 10 Nov 2008 - 
- Vol. 28, Iss: 5, pp 1-26
Reads0
Chats0
TLDR
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R to simplify model training and tuning across a wide variety of modeling techniques.
Abstract
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R. The package focuses on simplifying model training and tuning across a wide variety of modeling techniques. It also includes methods for pre-processing training data, calculating variable importance, and model visualizations. An example from computational chemistry is used to illustrate the functionality on a real data set and to benchmark the benefits of parallel processing with several types of models.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Topographic Cortico-cerebellar Networks Revealed by Visual Attention and Working Memory.

TL;DR: The findings indicate that recruitment by visuospatial attentional functions within cerebellar lobule VIIb/VIIIa is highly specific, and the topographic arrangement of these functions is mirrored in frontal and parietal cortex.
Journal ArticleDOI

Downscaling MODIS land surface temperature over a heterogeneous area: An investigation of machine learning techniques, feature selection, and impacts of mixed pixels

TL;DR: Compared against the LST derived from Landsat-8 thermal imageries, ELM required the least computational effort, and when it was combined with SVM-RFE, general efficiency of the downscaling procedure was increased substantially.
Journal ArticleDOI

Hyper-temporal remote sensing for digital soil mapping: Characterizing soil-vegetation response to climatic variability

TL;DR: In this article, a case study in a semiarid landscape of southeastern Arizona, USA is presented, where surface soil texture and coarse fragment classes were predicted using a 28-year time series of Landsat TM derived normalized difference vegetation index (NDVI) and modeled using support vector machine (SVM) classification, and results evaluated relative to more traditional RS approaches (e.g., mono-, bi-, and multi-temporal).
Journal ArticleDOI

Large-Scale Examination of Spatio-Temporal Patterns of Drifting Fish Aggregating Devices (dFADs) from Tropical Tuna Fisheries of the Indian and Atlantic Oceans

TL;DR: dFADs drift at sea on average for 39.5 days, with time at sea being shorter and distance travelled longer in the Indian than in the Atlantic Ocean, suggesting that 1,500-2,000 may be lost onshore each year.
Journal ArticleDOI

Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools

TL;DR: Folding free energy change calculation from Rosetta, structural information of the point mutations as well as amino acid physical properties were obtained for building thermostability prediction models with informatics modeling tools.
References
More filters
BookDOI

Modern Applied Statistics with S

TL;DR: A guide to using S environments to perform statistical analyses providing both an introduction to the use of S and a course in modern statistical methods.

Classification and Regression by randomForest

TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.

Modern Applied Statistics With S

TL;DR: The modern applied statistics with s is universally compatible with any devices to read, and is available in the digital library an online access to it is set as public so you can download it instantly.
Proceedings ArticleDOI

Validity of the single processor approach to achieving large scale computing capabilities

TL;DR: In this paper, the authors argue that the organization of a single computer has reached its limits and that truly significant advances can be made only by interconnection of a multiplicity of computers in such a manner as to permit cooperative solution.