z-logo
open-access-imgOpen Access
Privacy preserving big data publishing: a scalable k ‐anonymization approach using MapReduce
Author(s) -
Mehta Brijesh B.,
Rao Udai Pratap
Publication year - 2017
Publication title -
iet software
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.305
H-Index - 43
eISSN - 1751-8814
pISSN - 1751-8806
DOI - 10.1049/iet-sen.2016.0264
Subject(s) - computer science , data anonymization , data publishing , scalability , k anonymity , volume (thermodynamics) , big data , anonymity , information privacy , data mining , identification (biology) , publishing , database , computer security , political science , law , physics , botany , quantum mechanics , biology
Big data is collected and processed using different sources and tools that lead to privacy issues. Privacy preserving data publishing techniques such as k ‐anonymity, l ‐diversity, and t ‐closeness are used to de‐identify the data; however, the chances of re‐identification are always remain present since data is collected from multiple sources. Owing to the large volume of data, less generalisation or suppression is required to achieve the same level of privacy, which is also known as ‘large crowd effect’, although it is always challenging to handle such a large data for anonymization. MapReduce handles large volume of data and distributes the data into the smaller chunks across the multiple nodes; consequently, the full advantage of large volume of data is underachieved. Therefore, scalability of privacy preserving techniques becomes a challenging area of research. The authors explore this area and propose an algorithm named scalable k ‐anonymization (SKA) using MapReduce for privacy preserving big data publishing. The authors also compare the approach with existing approaches that results into a remarkable improvement of the data utility and significantly enhances the performance in terms of running time.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here