Building Predictive Models in R Using the caret Package
Reads0
Chats0
TLDR
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R to simplify model training and tuning across a wide variety of modeling techniques.Abstract:
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R. The package focuses on simplifying model training and tuning across a wide variety of modeling techniques. It also includes methods for pre-processing training data, calculating variable importance, and model visualizations. An example from computational chemistry is used to illustrate the functionality on a real data set and to benchmark the benefits of parallel processing with several types of models.read more
Citations
More filters
Journal ArticleDOI
Blood DNA methylation and COVID-19 outcomes.
Joseph Balnis,Andy Madrid,Kirk J. Hogan,Lisa A. Drake,Hau Chieng,Anupama Tiwari,Catherine Vincent,Amit Chopra,Peter A. Vincent,Michael D. Robek,Harold A. Singer,Reid S. Alisch,Ariel Jaitovich +12 more
TL;DR: This paper performed a genome-wide analysis of circulating blood DNA CpG methylation using the Infinium Human MethylationEPIC BeadChip on 124 blood samples from hospitalized COVID-19-positive and COVID19-negative patients and compared these data with previously reported data from 39 healthy individuals collected before the pandemic.
Posted Content
Algorithmic Fairness in Education.
René F. Kizilcec,Hansol Lee +1 more
TL;DR: This introduction to algorithmic fairness in education draws parallels to prior literature on educational access, bias, and discrimination, and examines core components of algorithmic systems to identify sources of bias and discrimination in the process of developing and deploying these systems.
Journal ArticleDOI
Microbiome Learning Repo (ML Repo): A public repository of microbiome regression and classification tasks.
TL;DR: This work presents Microbiome Learning Repo (ML Repo), a public, web-based repository of 33 curated classification and regression tasks from 15 published human microbiome datasets, and expects it to be an important resource for algorithm developers.
Journal ArticleDOI
Differential immune profiles distinguish the mutational subtypes of gastrointestinal stromal tumor
Gerardo A. Vitiello,Timothy G. Bowler,Mengyuan Liu,Benjamin D. Medina,Jennifer Q. Zhang,Nesteene J. Param,Jennifer K. Loo,Rachel L. Goldfeder,Frédéric Chibon,Ferdinand Rossi,Shan Zeng,Ronald P. DeMatteo +11 more
TL;DR: In this paper, the authors performed RNA sequencing of 75 human GIST tumors from 75 patients, comprising the largest cohort of GISTs sequenced to date, to discover differences in the immune infiltrates of KIT and PDGFRA-mutant GIST.
Journal ArticleDOI
The developing premature infant gut microbiome is a major factor shaping the microbiome of neonatal intensive care unit rooms
Brandon Brooks,Matthew R. Olm,Brian Firek,Robyn Baker,David Geller-McGrath,Sophia R. Reimer,Karina R. Soenjoyo,Jennifer S. Yip,Dylan Dahan,Dylan Dahan,Brian C. Thomas,Michael J. Morowitz,Jillian F. Banfield +12 more
TL;DR: The data suggests that hospitalized infants, in combination with their caregivers, shape the microbiome of NICU rooms, suggesting microbes seeding NICU surfaces are sourced from reservoirs within the room and that these reservoirs contain actively dividing cells.
References
More filters
BookDOI
Modern Applied Statistics with S
W. N. Venables,Brian D. Ripley +1 more
TL;DR: A guide to using S environments to perform statistical analyses providing both an introduction to the use of S and a course in modern statistical methods.
Classification and Regression by randomForest
Andy Liaw,Matthew C. Wiener +1 more
TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.
Modern Applied Statistics With S
TL;DR: The modern applied statistics with s is universally compatible with any devices to read, and is available in the digital library an online access to it is set as public so you can download it instantly.
Proceedings ArticleDOI
Validity of the single processor approach to achieving large scale computing capabilities
TL;DR: In this paper, the authors argue that the organization of a single computer has reached its limits and that truly significant advances can be made only by interconnection of a multiplicity of computers in such a manner as to permit cooperative solution.