scispace - formally typeset
BookDOI

An introduction to statistical learning

TLDR
An introduction to statistical learning provides an accessible overview of the essential toolset for making sense of the vast and complex data sets that have emerged in science, industry, and other sectors in the past twenty years.
Abstract
Statistics An Intduction to Stistical Lerning with Applications in R An Introduction to Statistical Learning provides an accessible overview of the fi eld of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fi elds ranging from biology to fi nance to marketing to astrophysics in the past twenty years. Th is book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classifi cation, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fi elds, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical soft ware platform. Two of the authors co-wrote Th e Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. Th is book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. Th e text assumes only a previous course in linear regression and no knowledge of matrix algebra.

read more

Citations
More filters
Journal ArticleDOI

A comparative analysis of data preparation algorithms for customer churn prediction

TL;DR: This study benchmarks an optimized logit model against eight state-of-the-art data mining techniques that use standard input data, including real-world cross-sectional data from a large European telecommunication provider and finds effective data preparation improves AUC up to 14.5% and top decile lift up to 34%.
Journal ArticleDOI

Machine learning in geo- and environmental sciences: From small to large scale

TL;DR: The goal of this review paper is to provide the first comprehensive review of the recently developed methods in the ML algorithms and describe their application to porous media and geoscience.
Journal ArticleDOI

Insights to fracture stimulation design in unconventional reservoirs based on machine learning modeling

TL;DR: A comprehensive data mining process to evaluate well production performance in Montney Formations in western Canadian sedimentary basin finds that random forest has the best prediction performance for the first-year oil production.
Journal ArticleDOI

Comparison of Sentinel-2 and Landsat 8 imagery for forest variable prediction in boreal region

TL;DR: In this paper, the performance of Sentinel-2 and Landsat 8 data for forest variable prediction in the boreal forest of Southern Finland was compared with the brute force forward selection method with twelve modelling setups to train multivariable prediction models with either multilayer perceptron (MLP) or regression tree models.
Journal ArticleDOI

Machine-Learning Models for Sales Time Series Forecasting

TL;DR: The results show that using stacking techniques, a stacking approach for building regression ensemble of single models, can improve the performance of predictive models for sales time series forecasting.
Related Papers (5)