scispace - formally typeset
Search or ask a question
Book

An Introduction to Statistical Learning: with Applications in R

TL;DR: This book presents some of the most important modeling and prediction techniques, along with relevant applications, that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years.
Abstract: An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fields, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical software platform. Two of the authors co-wrote The Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. This book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. The text assumes only a previous course in linear regression and no knowledge of matrix algebra.
Citations
More filters
Journal ArticleDOI
01 Dec 2017
TL;DR: The current status of AI applications in healthcare, in the three major areas of early detection and diagnosis, treatment, as well as outcome prediction and prognosis evaluation, are surveyed and its future is discussed.
Abstract: Artificial intelligence (AI) aims to mimic human cognitive functions. It is bringing a paradigm shift to healthcare, powered by increasing availability of healthcare data and rapid progress of analytics techniques. We survey the current status of AI applications in healthcare and discuss its future. AI can be applied to various types of healthcare data (structured and unstructured). Popular AI techniques include machine learning methods for structured data, such as the classical support vector machine and neural network, and the modern deep learning, as well as natural language processing for unstructured data. Major disease areas that use AI tools include cancer, neurology and cardiology. We then review in more details the AI applications in stroke, in the three major areas of early detection and diagnosis, treatment, as well as outcome prediction and prognosis evaluation. We conclude with discussion about pioneer AI systems, such as IBM Watson, and hurdles for real-life deployment of AI.

1,785 citations

Journal ArticleDOI
Hal R. Varian1
TL;DR: A few tools for manipulating and analyzing big data such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships.
Abstract: Computers are now involved in many economic transactions and can capture data associated with these transactions, which can then be manipulated and analyzed. Conventional statistical and econometric techniques such as regression often work well, but there are issues unique to big datasets that may require different tools. First, the sheer size of the data involved may require more powerful data manipulation tools. Second, we may have more potential predictors than appropriate for estimation, so we need to do some kind of variable selection. Third, large datasets may allow for more flexible relationships than simple linear models. Machine learning techniques such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships. In this essay, I will describe a few of these tools for manipulating and analyzing big data. I believe that these methods have a lot to offer and should be more widely known and used by econom...

1,120 citations

Posted Content
TL;DR: This work discusses core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration, and important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn.
Abstract: We give an overview of recent exciting achievements of deep reinforcement learning (RL). We discuss six core elements, six important mechanisms, and twelve applications. We start with background of machine learning, deep learning and reinforcement learning. Next we discuss core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration. After that, we discuss important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn. Then we discuss various applications of RL, including games, in particular, AlphaGo, robotics, natural language processing, including dialogue systems, machine translation, and text generation, computer vision, neural architecture design, business management, finance, healthcare, Industry 4.0, smart grid, intelligent transportation systems, and computer systems. We mention topics not reviewed yet, and list a collection of RL resources. After presenting a brief summary, we close with discussions. Please see Deep Reinforcement Learning, arXiv:1810.06339, for a significant update.

935 citations


Cites background from "An Introduction to Statistical Lear..."

  • ...Bishop (2011), Hastie et al. (2009), and Murphy (2012) are popular machine learning textbooks; James et al. (2013) gives an introduction to machine learning; Provost and Fawcett (2013) and Kuhn and Johnson (2013) discuss practical issues in machine learning applications; and Simeone (2017) is a…...

    [...]

Journal ArticleDOI
14 Jun 2018-Cell
TL;DR: A primer on machine learning for life scientists is provided, including an introduction to deep learning, which could impact disease biology, drug discovery, microbiome research, and synthetic biology.

619 citations


Cites background from "An Introduction to Statistical Lear..."

  • ...Categories of Machine-Learning Methods There are two overarching categories of machine learning methods—namely, unsupervised and supervised learning (see James et al., 2013; Rencher, 2002)....

    [...]

Journal ArticleDOI
01 Aug 2020
TL;DR: This Viewpoint provides a framework for the application of digital technologies in pandemic management and response, highlighting ways in which successful countries have adopted these technologies for pandemic planning, surveillance, testing, contact tracing, quarantine, and health care.
Abstract: Summary With high transmissibility and no effective vaccine or therapy, COVID-19 is now a global pandemic Government-coordinated efforts across the globe have focused on containment and mitigation, with varying degrees of success Countries that have maintained low COVID-19 per-capita mortality rates appear to share strategies that include early surveillance, testing, contact tracing, and strict quarantine The scale of coordination and data management required for effective implementation of these strategies has—in most successful countries—relied on adopting digital technology and integrating it into policy and health care This Viewpoint provides a framework for the application of digital technologies in pandemic management and response, highlighting ways in which successful countries have adopted these technologies for pandemic planning, surveillance, testing, contact tracing, quarantine, and health care

578 citations

Trending Questions (1)
What are the key concepts and techniques used in statistical learning?

Key concepts and techniques in statistical learning include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, and clustering, as outlined in the book "An Introduction to Statistical Learning."