scispace - formally typeset
BookDOI

An introduction to statistical learning

TLDR
An introduction to statistical learning provides an accessible overview of the essential toolset for making sense of the vast and complex data sets that have emerged in science, industry, and other sectors in the past twenty years.
Abstract
Statistics An Intduction to Stistical Lerning with Applications in R An Introduction to Statistical Learning provides an accessible overview of the fi eld of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fi elds ranging from biology to fi nance to marketing to astrophysics in the past twenty years. Th is book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classifi cation, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fi elds, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical soft ware platform. Two of the authors co-wrote Th e Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. Th is book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. Th e text assumes only a previous course in linear regression and no knowledge of matrix algebra.

read more

Citations
More filters
Journal ArticleDOI

Towards an ensemble based system for predicting the number of software faults

TL;DR: Main impact of this work is to allow better utilization of testing resources helping in early and quick identification of most of the faults in the software system.
Journal ArticleDOI

Using worldwide edaphic data to model plant species niches: An assessment at a continental extent.

TL;DR: It is demonstrated that global edaphic data adds useful information for plant ENMs, particularly valuable for studies of species that are distributed in regions where more detailed information on soil properties is poor or does not exist.
Journal ArticleDOI

A comprehensive experimental evaluation of orthogonal polynomial expanded random vector functional link neural networks for regression

TL;DR: This research shows a significant improvement of network performance when one uses tansig activation function and Chebyshev orthogonal polynomial for regression problems and may be used as guidelines for OPE-RVFLNN development and implementation for regression Problems.
Journal ArticleDOI

Spatial prediction of soil water retention in a Páramo landscape: Methodological insight into machine learning using random forest

TL;DR: In this paper, the authors presented a detailed methodological insight to model setup and validation of the underlying machine learning approach with random forest, which performed well predicting volumetric water contents between 0.55 cm 3 cm−3.
Journal ArticleDOI

Determining the Points of Change in Time Series of Polarimetric SAR Data

TL;DR: The likelihood ratio test statistic for the homogeneity of several complex variance-covariance matrices is presented and a factorization is given into a product of test statistics that each tests simpler hypotheses of homogeneity up to a certain point and that are independent if the hypothesis of total homogeneity is true.
Related Papers (5)