scispace - formally typeset
Open AccessJournal ArticleDOI

Array programming with NumPy

Reads0
Chats0
TLDR
In this paper, the authors review how a few fundamental array concepts lead to a simple and powerful programming paradigm for organizing, exploring and analysing scientific data, and their evolution into a flexible interoperability layer between increasingly specialized computational libraries is discussed.
Abstract
Array programming provides a powerful, compact and expressive syntax for accessing, manipulating and operating on data in vectors, matrices and higher-dimensional arrays. NumPy is the primary array programming library for the Python language. It has an essential role in research analysis pipelines in fields as diverse as physics, chemistry, astronomy, geoscience, biology, psychology, materials science, engineering, finance and economics. For example, in astronomy, NumPy was an important part of the software stack used in the discovery of gravitational waves1 and in the first imaging of a black hole2. Here we review how a few fundamental array concepts lead to a simple and powerful programming paradigm for organizing, exploring and analysing scientific data. NumPy is the foundation upon which the scientific Python ecosystem is constructed. It is so pervasive that several projects, targeting audiences with specialized needs, have developed their own NumPy-like interfaces and array objects. Owing to its central position in the ecosystem, NumPy increasingly acts as an interoperability layer between such array computation libraries and, together with its application programming interface (API), provides a flexible framework to support the next decade of scientific and industrial analysis. NumPy is the primary array programming library for Python; here its fundamental concepts are reviewed and its evolution into a flexible interoperability layer between increasingly specialized computational libraries is discussed.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Testing new-physics models with global comparisons to collider measurements: the Contur toolkit

TL;DR: This manual describes the design choices that have been made in the Contur tool, as well as detailing pitfalls and common issues to avoid, and hopes that with the help of this documentation, external groups will be able to run their own Contur studies, for example when proposing a new model, or pitching a new search.
Journal ArticleDOI

pyaneti – II. A multidimensional Gaussian process approach to analysing spectroscopic time-series

TL;DR: Pyaneti as mentioned in this paper uses a built-in multi-dimensional Gaussian process approach to model radial velocity and activity indicator time-series with different underlying covariance functions and allows multi-band and single transit modelling; it runs on Python 3 and features overall improvements in performance.
Journal ArticleDOI

GWAS and ExWAS of blood mitochondrial DNA copy number identifies 71 loci and highlights a potential causal role in dementia

- 13 Jan 2022 - 
TL;DR: Chong et al. as discussed by the authors developed a method for array-based mtDNA-CN estimation suitable for biobank-scale studies, called AutoMitoC, which applied to 395,781 UKBiobank study participants and performed genome-and exome-wide association studies, identifying novel common and rare genetic determinants.
Journal ArticleDOI

Revealing the impact of global heating on North Atlantic circulation using transparent machine learning

TL;DR: The North Atlantic Ocean is key to climate through its role in heat transport and storage as mentioned in this paper, but the physical drivers of this change are poor. But the North Atlantic ocean is also key to heat storage and storage.
Journal Article

Perfect density models cannot guarantee anomaly detection

TL;DR: A closer look at the behavior of distribution densities through the lens of reparametrization is taken and it is shown that these quantities carry less meaningful information than previously thought, beyond estimation issues or the curse of dimensionality.
References
More filters
Journal Article

Scikit-learn: Machine Learning in Python

TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Journal ArticleDOI

Matplotlib: A 2D Graphics Environment

TL;DR: Matplotlib is a 2D graphics package used for Python for application development, interactive scripting, and publication-quality image generation across user interfaces and operating systems.
Journal ArticleDOI

SciPy 1.0--Fundamental Algorithms for Scientific Computing in Python

TL;DR: SciPy as discussed by the authors is an open source scientific computing library for the Python programming language, which includes functionality spanning clustering, Fourier transforms, integration, interpolation, file I/O, linear algebra, image processing, orthogonal distance regression, minimization algorithms, signal processing, sparse matrix handling, computational geometry, and statistics.
Proceedings ArticleDOI

TensorFlow: a system for large-scale machine learning

TL;DR: TensorFlow as mentioned in this paper is a machine learning system that operates at large scale and in heterogeneous environments, using dataflow graphs to represent computation, shared state, and the operations that mutate that state.
Related Papers (5)