Data Roll-Back for CDP Based on Deduplication

Ma Jingwei,Wang Kebin,Zhao Bin,Ma Liang,Wang Gang,Liu Xiaoguang
2012-01-01
Journal of Computer Research and Development
Abstract:Continuous data protection system based on data deduplication is an ideal way of data backup. It can achieve time-continuous data protection, and can largely reduce the storage overhead. The speed of the rollback of the system seriously affects the whole system's performance. According to the features of continuous data protection based on data deduplication, we implemented two rapid rollback methods for this system. One is normal rollback method and the other is zero-copy rollback method. The normal method reads every block of metadata in the system and copy the corresponding data block to the right position of the target volume. This is suitable for the rollback of the entire volume and large size data. For small size data, the zero-copy method will be used. It emulates a physical volume to the user, then the user can read or mount the volume and get the data blocks which are really needed. To take use the locality of the data, an in memory cache is implemented in this method. We tested the speed of the two methods and experimental results show that the two methods can roll back the data very well, and the cache can largely speed up the process of zero-copy rollback method.
What problem does this paper attempt to address?