
STRENGTHNING THE PRODUCTIVITY OF STORAGE FOR BIG DATA STORAGE SYSTEMS USING DISTRIBUTED DEDUPLICATION
Publication year - 2020
Publication title -
international journal for innovative engineering and management research
Language(s) - English
Resource type - Journals
ISSN - 2456-5083
DOI - 10.48047/ijiemr/v09/i12/114
Subject(s) - computer science , data deduplication , upload , cloud computing , cloud storage , server , database , operating system , file server , file system , computer network , versioning file system , stub file , computer file
Cloud storage is one of the key features of cloud computing, which helps cloud users outsourcelarge numbers of data without upgrading their devices. However, Cloud Service Providers (CSPs) data storagefaces problems with data redundancy. The data deduplication technique aims at eliminating redundantinformation segments and maintains one single instance of the data set, even if any number of users own similardata set. Since blocks of data are spread on many servers, each block of the file has to be downloaded beforerestoring the file to decrease system output.We suggest a cloud storage server data recovery module to improve file access efficiency and reduce time spenton network bandwidth. Device coding is used in the suggested method to store blocks in distributed cloudstorage, and for data integrity, MD5 (Message Digest 5) is used. Running recovery algorithm helps the user toretrieve a file directly from the cloud servers without downloading every block. The scheme proposed improvessystem time efficiency and the ability to access the stored data quickly. This reduces bandwidth consumption andreduces overhead user processing while downloading the data file.