scispace - formally typeset
BookDOI

An introduction to statistical learning

TLDR
An introduction to statistical learning provides an accessible overview of the essential toolset for making sense of the vast and complex data sets that have emerged in science, industry, and other sectors in the past twenty years.
Abstract
Statistics An Intduction to Stistical Lerning with Applications in R An Introduction to Statistical Learning provides an accessible overview of the fi eld of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fi elds ranging from biology to fi nance to marketing to astrophysics in the past twenty years. Th is book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classifi cation, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fi elds, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical soft ware platform. Two of the authors co-wrote Th e Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. Th is book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. Th e text assumes only a previous course in linear regression and no knowledge of matrix algebra.

read more

Citations
More filters
Journal ArticleDOI

mockrobiota: a Public Resource for Microbiome Bioinformatics Benchmarking

TL;DR: This work presents mockrobiota, a public resource for sharing, validating, and documenting mock community data resources, and outlines its intended expansion and evolve to meet the changing needs of the omics community.
Journal ArticleDOI

Predicting Motor Insurance Claims Using Telematics Data—XGBoost versus Logistic Regression

TL;DR: This study compared the relative performances of logistic regression and XGBoost approaches for predicting the existence of accident claims using telematics data and showed thatLogistic regression is a suitable model given its interpretability and good predictive capacity.
Journal ArticleDOI

China’s energy consumption in construction and building sectors: An outlook to 2100

TL;DR: Wang et al. as discussed by the authors used ridge regression to derive the coefficients of the STIRPAT model to counter the impact of multicollinearity on regression results, which showed that the overall trend of China's CBS energy consumption is to continuously increase from the present, reach a peak in the range between 1155 and 1243 million tons of standard coal equivalent (Mtce) in 2050, and then decrease to 942-1116 Mtce in 2100.
Journal ArticleDOI

A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.

TL;DR: This paper explores the genomic based prediction performance of two popular machine learning methods: the Multi Layer Perceptron (MLP) and support vector machine (SVM) methods vs. the Bayesian threshold genomic best linear unbiased prediction (TGBLUP) model.
Posted Content

Quantification of Model Uncertainty in RANS Simulations: A Review

TL;DR: In this paper, a review examines both the parametric and structural uncertainties in turbulence models, and the fundamentals of uncertainty propagation and Bayesian inference are introduced in the context of RANS model uncertainty quantification.
Related Papers (5)