Flexible data anonymization using ARX—Current status and challenges ahead | Zendy

Prasser Fabian | Zendy; Eicher Johanna | Zendy; Spengler Helmut | Zendy; Bild Raffael | Zendy; Kuhn Klaus A. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Flexible data anonymization using ARX—Current status and challenges ahead

Author(s) -

Prasser Fabian,

Eicher Johanna,

Spengler Helmut,

Bild Raffael,

Kuhn Klaus A.

Publication year - 2020

Publication title -

software: practice and experience

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.437

H-Index - 70

eISSN - 1097-024X

pISSN - 0038-0644

DOI - 10.1002/spe.2812

Subject(s) - computer science , scalability , data anonymization , data science , data quality , set (abstract data type) , data mining , field (mathematics) , context (archaeology) , risk analysis (engineering) , information privacy , computer security , database , engineering , medicine , paleontology , metric (unit) , operations management , mathematics , pure mathematics , biology , programming language

Summary The race for innovation has turned into a race for data. Rapid developments of new technologies, especially in the field of artificial intelligence, are accompanied by new ways of accessing, integrating, and analyzing sensitive personal data. Examples include financial transactions, social network activities, location traces, and medical records. As a consequence, adequate and careful privacy management has become a significant challenge. New data protection regulations, for example in the EU and China, are direct responses to these developments. Data anonymization is an important building block of data protection concepts, as it allows to reduce privacy risks by altering data. The development of anonymization tools involves significant challenges, however. For instance, the effectiveness of different anonymization techniques depends on context, and thus tools need to support a large set of methods to ensure that the usefulness of data is not overly affected by risk‐reducing transformations. In spite of these requirements, existing solutions typically only support a small set of methods. In this work, we describe how we have extended an open source data anonymization tool to support almost arbitrary combinations of a wide range of techniques in a scalable manner. We then review the spectrum of methods supported and discuss their compatibility within the novel framework. The results of an extensive experimental comparison show that our approach outperforms related solutions in terms of scalability and output data quality—while supporting a much broader range of techniques. Finally, we discuss practical experiences with ARX and present remaining issues and challenges ahead.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research