A Survey on Deduplication Checking In Cloud Computing

Prasad Chavan, Aishwarya Dupatre, SaiprasadChalikwar ,N. M. Waghdarikar ,N. S. Kawathekar


Cloud Computing has been one of the hottest buzzwords over the last few years but it is surprisingly known that the people have been using it for more than 10 years. Gmail, Facebook, Dropbox, Skype, PayPal, and Salesforce.com are all examples of cloud solutions which was not thinking about them in these terms. The main idea behind the cloud is that the information can be accessed over the internet without having any exhaustive familiarity of the communications used to enable it. The major services existing in Cloud computing is the Cloud storage. With the cloud storage, data can be stored on multiple third party servers which is not cared by the user and no one knows where exactly data saved. With the increase in size of the data every day, there is a need to handle, manage and mainly to store data, is a major problem faced by the people or organization. This article specifies about the study on space occupied by duplicate data over cloud. Where the data is increasing day by day , at the same time one thing to be noticeable that enough space at cloud is occupied by duplicate data so there is a need to check the data for de duplication at cloud before uploading.

