
Hadoop solution for large data protection
Author(s) -
N. MASLOVA,
Olha Polovynka
Publication year - 2021
Publication title -
fìziko-matematične modelûvannâ ta ìnformacìjnì tehnologìï/fìzìko-matematične modelûvannâ ta ìnformacìjnì tehnologìï
Language(s) - English
Resource type - Journals
eISSN - 2617-5258
pISSN - 1816-1545
DOI - 10.15407/fmmit2021.33.023
Subject(s) - computer science , cloud computing , database , big data , work (physics) , gateway (web page) , data processing , process (computing) , data mining , operating system , world wide web , engineering , mechanical engineering
Investigated one of large data problems of - providing protection in the process of accumulation and processing. The case of application of Hadoop technology and its latest modification Apache Hadoop 3.3.0 is considered. A solution is proposed with strengthening the protection of processed data, connecting the Apache Knox Gateway, Apache Ranger and Apache Atlas tools. The possibil-ity of using data obtained as a result of the work of local databases, electronic archives, database management systems and individual users is provided. The solution also features the use of a pri-vate cloud and cryptographic algorithms. An example of the implementation of a secure solution to the problem of Intelligent Data Analysis is given on the example of a parallel version of the problem of finding association rules when working with unstructured data of large volumes.