Data Structures for Statistical Computing in Python
Wes McKinney
- pp 56-61
Reads0
Chats0
TLDR
P pandas is a new library which aims to facilitate working with data sets common to finance, statistics, and other related fields and to provide a set of fundamental building blocks for implementing statistical models.Abstract:
In this paper we are concerned with the practical issues of working with data sets common to finance, statistics, and other related fields. pandas is a new library which aims to facilitate working with these data sets and to provide a set of fundamental building blocks for implementing statistical models. We will discuss specific design issues encountered in the course of developing pandas with relevant examples and some comparisons with the R language. We conclude by discussing possible future directions for statistical computing and data analysis using Python.read more
Citations
More filters
Posted Content
A Framework for Data-Driven Robotics
Serkan Cabi,Sergio Gomez Colmenarejo,Alexander Novikov,Ksenia Konyushkova,Scott Reed,Rae Jeong,Konrad Zolna,Yusuf Aytar,David Budden,Mel Vecerik,Oleg P. Sushkov,David J. Barker,Jonathan Scholz,Misha Denil,Nando de Freitas,Ziyu Wang +15 more
TL;DR: It is shown that using the framework presented, it is possible to train agents to perform a variety of challenging manipulation tasks including stacking rigid objects and handling cloth and to learn a robot policy offline using batch RL.
Posted ContentDOI
A Robust Role for Motor Cortex
TL;DR: A new role for motor cortex is proposed: extending the robustness of sub-cortical movement systems, specifically to unexpected situations demanding rapid motor responses adapted to environmental context, and the implications of this idea for current and future research are discussed.
Journal ArticleDOI
Hellinger Distance Weighted Ensemble for imbalanced data stream classification
TL;DR: The classifier ensemble for classifying binary, non-stationary and imbalanced data streams where the Hellinger Distance is used to prune the ensemble to prove the hdwe method's usefulness.
Posted Content
Optimizing Stochastic Gradient Descent in Text Classification Based on Fine-Tuning Hyper-Parameters Approach. A Case Study on Automatic Classification of Global Terrorist Attacks.
TL;DR: The research concludes that using a grid-search to find the hyper-parameters optimize SGD classification, not in the pre-classification settings only, but also in the performance of the classifiers in terms of accuracy and execution time.
Related Papers (5)
Scikit-learn: Machine Learning in Python
SciPy 1.0: fundamental algorithms for scientific computing in Python.
Pauli Virtanen,Ralf Gommers,Travis E. Oliphant,Matt Haberland,Matt Haberland,Tyler Reddy,David Cournapeau,Evgeni Burovski,Pearu Peterson,Warren Weckesser,Jonathan Bright,Stefan van der Walt,Matthew Brett,Joshua Wilson,K. Jarrod Millman,Nikolay Mayorov,Andrew Nelson,Eric Jones,Robert Kern,Eric B. Larson,CJ Carey,Ilhan Polat,Yu Feng,Eric Moore,Jake Vanderplas,Denis Laxalde,Josef Perktold,Robert Cimrman,Ian Henriksen,Ian Henriksen,E. A. Quintero,Charles R. Harris,Anne M. Archibald,Antônio H. Ribeiro,Fabian Pedregosa,Paul van Mulbregt,SciPy . Contributors +36 more
Array programming with NumPy
Charles R. Harris,K. Jarrod Millman,Stefan van der Walt,Stefan van der Walt,Ralf Gommers,Pauli Virtanen,David Cournapeau,Eric Wieser,Julian Taylor,Sebastian Berg,Nathaniel J. Smith,Robert Kern,Matti Picus,Stephan Hoyer,Marten H. van Kerkwijk,Matthew Brett,Matthew Brett,Allan Haldane,Jaime Fernández del Río,Mark Wiebe,Mark Wiebe,Pearu Peterson,Pierre Gérard-Marchant,Kevin Sheppard,Tyler Reddy,Warren Weckesser,Hameer Abbasi,Christoph Gohlke,Travis E. Oliphant +28 more