Providing Data With High Utility And No Disclosure Risk For The Public and Researchers: An Evaluation By Advanced Statistical Disclosure Risk Methods
Author(s) -
Matthias Templ
Publication year - 2014
Publication title -
austrian journal of statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.342
H-Index - 9
ISSN - 1026-597X
DOI - 10.17713/ajs.v43i4.43
Subject(s) - confidentiality , data set , computer science , set (abstract data type) , internet privacy , computer security , data mining , data science , actuarial science , business , artificial intelligence , programming language
The demand of data from surveys, registers or other data sets containing sensible information on people or enterprises have been increased significantly over the last years. However, before providing data to the public or to researchers, confidentiality has to be respected for any data set containing sensible individual information. Confidentiality can be achieved by applying statistical disclosure control (SDC) methods to the data. The research on SDC methods becomes more and more important in the last years because of an increase of the awareness on data privacy and because of the fact that more and more data are provided to the public or to researchers. However, for legal reasons this is only visible when the released data has (very) low disclosure risk. In this contribution existing disclosure risk methods are review and summarized. These methods are finally applied on a popular real-world data set - the Structural Earnings Survey (SES) of Austria. It is shown that the application of few selected anonymisation methods leads to well-protected anonymised data with high data utility and low information loss.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom