Data Structures for Statistical Computing in Python

doi:10.25080/MAJORA-92BF1922-00A

Open AccessProceedings ArticleDOI

Data Structures for Statistical Computing in Python

Wes McKinney

- pp 56-61

Chats0

TLDR

P pandas is a new library which aims to facilitate working with data sets common to finance, statistics, and other related fields and to provide a set of fundamental building blocks for implementing statistical models.

Abstract:

In this paper we are concerned with the practical issues of working with data sets common to finance, statistics, and other related fields. pandas is a new library which aims to facilitate working with these data sets and to provide a set of fundamental building blocks for implementing statistical models. We will discuss specific design issues encountered in the course of developing pandas with relevant examples and some comparisons with the R language. We conclude by discussing possible future directions for statistical computing and data analysis using Python.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep learning for cellular image analysis

Erick Moen, +5 more

- 27 May 2019 -

Nature Methods

TL;DR: The intersection between deep learning and cellular image analysis is reviewed and an overview of both the mathematical mechanics and the programming frameworks of deep learning that are pertinent to life scientists are provided.

...read moreread less

Journal ArticleDOI

Pingouin: statistics in Python

Raphael Vallat

TL;DR: This presentation explains why Python is far behind the R programming language when it comes to general statistics and why many scientists still rely heavily on R to perform their statistical analyses.

...read moreread less

Journal ArticleDOI

Using DeepLabCut for 3D markerless pose estimation across species and behaviors

Tanmay Nath, +5 more

- 21 Jun 2019 -

Nature Protocols

TL;DR: This protocol describes how to use an open-source toolbox, DeepLabCut, to train a deep neural network to precisely track user-defined features with limited training data, which allows noninvasive behavioral tracking of movement.

...read moreread less

Journal ArticleDOI

Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh – A Python package)

Maximilian Christ, +4 more

- 13 Sep 2018 -

Neurocomputing

TL;DR: The Python package tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests) accelerates this process by combining 63 time series characterization methods, which by default compute a total of 794 time series features, with feature selection on basis automatically configured hypothesis tests.

...read moreread less

Journal ArticleDOI

Fastai: A Layered API for Deep Learning

Jeremy Howard, +1 more

- 11 Feb 2020 -

Information-an International Interdiscip...

TL;DR: This paper has used this library to successfully create a complete deep learning course, which was able to write more quickly than using previous approaches, and the code was more clear.

...read moreread less