scispace - formally typeset
Open AccessProceedings ArticleDOI

Data Structures for Statistical Computing in Python

Wes McKinney
- pp 56-61
Reads0
Chats0
TLDR
P pandas is a new library which aims to facilitate working with data sets common to finance, statistics, and other related fields and to provide a set of fundamental building blocks for implementing statistical models.
Abstract
In this paper we are concerned with the practical issues of working with data sets common to finance, statistics, and other related fields. pandas is a new library which aims to facilitate working with these data sets and to provide a set of fundamental building blocks for implementing statistical models. We will discuss specific design issues encountered in the course of developing pandas with relevant examples and some comparisons with the R language. We conclude by discussing possible future directions for statistical computing and data analysis using Python.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Deep learning for cellular image analysis

TL;DR: The intersection between deep learning and cellular image analysis is reviewed and an overview of both the mathematical mechanics and the programming frameworks of deep learning that are pertinent to life scientists are provided.
Journal ArticleDOI

Pingouin: statistics in Python

TL;DR: This presentation explains why Python is far behind the R programming language when it comes to general statistics and why many scientists still rely heavily on R to perform their statistical analyses.
Journal ArticleDOI

Using DeepLabCut for 3D markerless pose estimation across species and behaviors

TL;DR: This protocol describes how to use an open-source toolbox, DeepLabCut, to train a deep neural network to precisely track user-defined features with limited training data, which allows noninvasive behavioral tracking of movement.
Journal ArticleDOI

Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh – A Python package)

TL;DR: The Python package tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests) accelerates this process by combining 63 time series characterization methods, which by default compute a total of 794 time series features, with feature selection on basis automatically configured hypothesis tests.
Journal ArticleDOI

Fastai: A Layered API for Deep Learning

TL;DR: This paper has used this library to successfully create a complete deep learning course, which was able to write more quickly than using previous approaches, and the code was more clear.
Related Papers (5)