z-logo
open-access-imgOpen Access
Nested data structures in array frameworks
Author(s) -
J. Pivarski,
D. Lange,
P. Elmer
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1525/1/012053
Subject(s) - computer science , array data structure , syntax , computational science , code (set theory) , programming language , data structure , multiset , theoretical computer science , mathematics , artificial intelligence , set (abstract data type) , combinatorics
The need for nested data structures and combinatorial operations on arbitrary length lists has prevented particle physicists from adopting array-based data analysis frameworks, such as R, MATLAB, Numpy, and Pandas. These array frameworks work well for purely rectangular tables and hypercubes, but arrays of variable length arrays, called “jagged arrays,” are out of their scope. However, jagged arrays are a fundamental feature of particle physics data, as well as combining them to search for particle decays. To bridge this gap, we developed the awkward-array library, and in this paper we present feedback from some of the first physics groups using it for their analyses. They report similar computational performance to analysis code written in C++, but are split on the ease-of-use of array syntax. In a series of four phone interviews, all users noted how different array programming is from imperative programming, but whereas some found it easier in all aspects, others said it was more difficult to write, yet easier to read.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here