
Proof of Concept Example for Use of Simulation to Allow Data Pooling Despite Privacy Restrictions
Author(s) -
Teresa Filshtein,
Xiang Li,
Scott C Zimmerman,
Sarah F Ackley,
M. Maria Glymour,
Melinda C. Power
Publication year - 2021
Publication title -
epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.901
H-Index - 173
eISSN - 1531-5487
pISSN - 1044-3983
DOI - 10.1097/ede.0000000000001373
Subject(s) - pooling , computer science , proof of concept , internet privacy , data science , computer security , artificial intelligence , operating system
Integrating results from multiple samples is often desirable, but privacy restrictions may preclude full data pooling, and most datasets do not include fully harmonized variable sets. We propose a simulation-based method leveraging partial information across datasets to guide creation of synthetic data based on explicit assumptions about the underlying causal structure that permits pooled analyses that adjust for all desired confounders in the context of privacy restrictions.