Existing mechanisms for data deduplication

2021 
Abstract The success of any business in the present scenario depends on the utilization of data effectively. Access of same data again for acquiring correct data for business analytics and each organization uses different sources to collect data hence most of the data are duplicated. This duplicated data increases the desirable data extraction time as well as the probability of errors. Data duplication is the major challenge in modern business. Data duplication may occur when data is collected from different sources. Data deduplication is generally called single-instance storage. This is the procedure that focuses to identify the duplicate copies and eliminates those duplicate copies and reduces the storage overhead. Many techniques are used in data deduplication for determining a single instance of stored data. If deduplication technology is used in the appropriate situation, it can help in a great impact and provide a better solution. This chapter provides a comprehensive review of different techniques and existing mechanisms that are used for data deduplication.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    20
    References
    0
    Citations
    NaN
    KQI
    []