Open Access
Automated Deployment of Data Lake
Author(s) -
J Ganavi
Publication year - 2021
Publication title -
international journal for research in applied science and engineering technology
Language(s) - English
Resource type - Journals
ISSN - 2321-9653
DOI - 10.22214/ijraset.2021.37946
Subject(s) - software deployment , cloud computing , computer science , database , architecture , core (optical fiber) , world wide web , operating system , telecommunications , art , visual arts
Abstract: A Data Lake is a central location that can store all your structured and unstructured data, no matter the source or format. Automated deployment for data lake solution is an automated reference implementation that deploys a highly available, cost-effective data lake architecture on the AWS Cloud along with a user-friendly console for searching and requesting datasets. The solution automatically configures the core AWS services necessary to easily tag, search, share, transform, analyse, and govern specific subsets of data across a company or with other external users. The solution deploys a console that users can access to search and browse available datasets for their business needs. Keywords: Data Lake, Cloud Computing, Aws, Ec2, S3, Athena, Glue, Cloud formation.