Data Structures for Statistical Computing in Python
Author(s) -
Wes McKinney
Publication year - 2010
Publication title -
proceedings of the python in science conferences
Language(s) - English
Resource type - Conference proceedings
ISSN - 2575-9752
DOI - 10.25080/majora-92bf1922-00a
Subject(s) - python (programming language) , computer science , data science , data exploration , statistical analysis , data structure , computational statistics , theoretical computer science , data mining , software engineering , programming language , machine learning , statistics , visualization , mathematics
In this paper we are concerned with the practical issues of working with data sets common to finance, statistics, and other related fields. pandas is a new library which aims to facilitate working with these data sets and to provide a set of fundamental building blocks for implementing statistical models. We will discuss specific design issues encountered in the course of developing pandas with relevant examples and some comparisons with the R language. We conclude by discussing possible future directions for statistical computing and data analysis using Python.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom