Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition
Author(s) -
Tom E Hardwicke,
Maya B Mathur,
Kyle MacDonald,
Gustav Nilsonne,
George C. Banks,
Mallory C. Kidwell,
Alicia Hofelich Mohr,
Elizabeth Clayton,
Erica J. Yoon,
Michael Tessler,
Richie L. Lenne,
Sara Altman,
Bria Long,
Michael C. Frank
Publication year - 2018
Publication title -
royal society open science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.84
H-Index - 51
ISSN - 2054-5703
DOI - 10.1098/rsos.180448
Subject(s) - credibility , data sharing , computer science , observational study , margin (machine learning) , open data , data quality , value (mathematics) , quality (philosophy) , science policy , reusability , cognition , data science , data mining , psychology , medicine , statistics , business , political science , mathematics , service (business) , world wide web , alternative medicine , psychiatry , marketing , philosophy , law , pathology , epistemology , machine learning , public administration , software , programming language
Access to data is a critical feature of an efficient, progressive and ultimately self-correcting scientific ecosystem. But the extent to which in-principle benefits of data sharing are realized in practice is unclear. Crucially, it is largely unknown whether published findings can be reproduced by repeating reported analyses upon shared data (‘analytic reproducibility’). To investigate this, we conducted an observational evaluation of a mandatory open data policy introduced at the journal Cognition . Interrupted time-series analyses indicated a substantial post-policy increase in data available statements (104/417, 25% pre-policy to 136/174, 78% post-policy), although not all data appeared reusable (23/104, 22% pre-policy to 85/136, 62%, post-policy). For 35 of the articles determined to have reusable data, we attempted to reproduce 1324 target values. Ultimately, 64 values could not be reproduced within a 10% margin of error. For 22 articles all target values were reproduced, but 11 of these required author assistance. For 13 articles at least one value could not be reproduced despite author assistance. Importantly, there were no clear indications that original conclusions were seriously impacted. Mandatory open data policies can increase the frequency and quality of data sharing. However, suboptimal data curation, unclear analysis specification and reporting errors can impede analytic reproducibility, undermining the utility of data sharing and the credibility of scientific findings.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom