Decision-Making in Research Tasks with Sequential Testing | Zendy

Thomas Pfeiffer | Zendy; David G. Rand | Zendy; Anna Dreber | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Decision-Making in Research Tasks with Sequential Testing

Author(s) -

Thomas Pfeiffer,

David G. Rand,

Anna Dreber

Publication year - 2009

Publication title -

plos one

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.99

H-Index - 332

ISSN - 1932-6203

DOI - 10.1371/journal.pone.0004607

Subject(s) - false positive paradox , computer science , reliability (semiconductor) , statistical hypothesis testing , prior probability , multiple comparisons problem , fraction (chemistry) , false positives and false negatives , test (biology) , simple (philosophy) , machine learning , econometrics , artificial intelligence , bayesian probability , statistics , mathematics , biology , paleontology , power (physics) , physics , chemistry , philosophy , organic chemistry , epistemology , quantum mechanics

Background In a recent controversial essay, published by JPA Ioannidis in PLoS Medicine, it has been argued that in some research fields, most of the published findings are false. Based on theoretical reasoning it can be shown that small effect sizes, error-prone tests, low priors of the tested hypotheses and biases in the evaluation and publication of research findings increase the fraction of false positives. These findings raise concerns about the reliability of research. However, they are based on a very simple scenario of scientific research, where single tests are used to evaluate independent hypotheses. Methodology/Principal Findings In this study, we present computer simulations and experimental approaches for analyzing more realistic scenarios. In these scenarios, research tasks are solved sequentially, i.e. subsequent tests can be chosen depending on previous results. We investigate simple sequential testing and scenarios where only a selected subset of results can be published and used for future rounds of test choice. Results from computer simulations indicate that for the tasks analyzed in this study, the fraction of false among the positive findings declines over several rounds of testing if the most informative tests are performed. Our experiments show that human subjects frequently perform the most informative tests, leading to a decline of false positives as expected from the simulations. Conclusions/Significance For the research tasks studied here, findings tend to become more reliable over time. We also find that the performance in those experimental settings where not all performed tests could be published turned out to be surprisingly inefficient. Our results may help optimize existing procedures used in the practice of scientific research and provide guidance for the development of novel forms of scholarly communication.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research