z-logo
open-access-imgOpen Access
Data Sanitization to Reduce Private Information Leakage from Functional Genomics
Author(s) -
Gamze Gürsoy,
Prashant S. Emani,
Charlotte M. Bran,
Otto Jolanki,
Arif Harmanci,
J. Seth Strattan,
J. Michael Cherry,
Andrew D. Miranker,
Mark Gerstein
Publication year - 2020
Publication title -
cell
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 26.304
H-Index - 776
eISSN - 1097-4172
pISSN - 0092-8674
DOI - 10.1016/j.cell.2020.09.036
Subject(s) - genomics , biology , functional genomics , raw data , computational biology , data sharing , protocol (science) , personal genomics , incentive , genome , dna sequencing , computer science , gene , genetics , medicine , alternative medicine , pathology , microeconomics , economics , programming language
The generation of functional genomics datasets is surging, because they provide insight into gene regulation and organismal phenotypes (e.g., genes upregulated in cancer). The intent behind functional genomics experiments is not necessarily to study genetic variants, yet they pose privacy concerns due to their use of next-generation sequencing. Moreover, there is a great incentive to broadly share raw reads for better statistical power and general research reproducibility. Thus, we need new modes of sharing beyond traditional controlled-access models. Here, we develop a data-sanitization procedure allowing raw functional genomics reads to be shared while minimizing privacy leakage, enabling principled privacy-utility trade-offs. Our protocol works with traditional Illumina-based assays and newer technologies such as 10x single-cell RNA sequencing. It involves quantifying the privacy leakage in reads by statistically linking study participants to known individuals. We carried out these linkages using data from highly accurate reference genomes and more realistic environmental samples.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom