Premium
The probability of identification: applying ideas from forensic statistics to disclosure risk assessment
Author(s) -
Skinner C. J.
Publication year - 2007
Publication title -
journal of the royal statistical society: series a (statistics in society)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.103
H-Index - 84
eISSN - 1467-985X
pISSN - 0964-1998
DOI - 10.1111/j.1467-985x.2006.00457.x
Subject(s) - microdata (statistics) , identification (biology) , computer science , actuarial science , control (management) , risk assessment , forensic identification , computer security , census , artificial intelligence , sociology , business , geography , population , botany , demography , biology , archaeology
Summary. The paper establishes a correspondence between statistical disclosure control and forensic statistics regarding their common use of the concept of ‘probability of identification’. The paper then seeks to investigate what lessons for disclosure control can be learnt from the forensic identification literature. The main lesson that is considered is that disclosure risk assessment cannot, in general, ignore the search method that is employed by an intruder seeking to achieve disclosure. The effects of using several search methods are considered. Through consideration of the plausibility of assumptions and ‘worst case’ approaches, the paper suggests how the impact of search method can be handled. The paper focuses on foundations of disclosure risk assessment, providing some justification for some modelling assumptions underlying some existing record level measures of disclosure risk. The paper illustrates the effects of using various search methods in a numerical example based on microdata from a sample from the 2001 UK census.