Ddelta: A Deduplication-Inspired Fast Delta Compression Approach

Wen Xia,Hong Jiang,Dan Feng,Lei Tian,Min Fu,Yukun Zhou
DOI: https://doi.org/10.1016/j.peva.2014.07.016
IF: 2.205
2014-01-01
Performance Evaluation
Abstract:Delta compression is an efficient data reduction approach to removing redundancy among similar data chunks and files in storage systems. One of the main challenges facing delta compression is its low encoding speed, a worsening problem in face of the steadily increasing storage and network bandwidth and speed. In this paper, we present Ddelta, a deduplication-inspired fast delta compression scheme that effectively leverages the simplicity and efficiency of data deduplication techniques to improve delta encoding/decoding performance. The basic idea behind Ddelta is to (1) accelerate the delta encoding and decoding processes by a novel approach of combining Gear-based chunking and Spooky-based fingerprinting for fast identification of duplicate strings for delta calculation, and (2) exploit content locality of redundant data to detect more duplicates by greedily scanning the areas immediately adjacent to already detected duplicate chunks/strings. Our experimental evaluation of a Ddelta prototype based on real-world datasets shows that Ddelta achieves an encoding speedup of 2.5x-8x and a decoding speedup of 2x-20x over the classic delta-compression approaches Xdelta and Zdelta while achieving a comparable level of compression ratio. (C) 2014 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?