ELSA: Efficient Long-Term Secure Storage of Large Datasets

Matthias Geihs,Johannes Buchmann
DOI: https://doi.org/10.48550/arXiv.1810.11888
2018-10-29
Abstract:An increasing amount of information today is generated, exchanged, and stored digitally. This also includes long-lived and highly sensitive information (e.g., electronic health records, governmental documents) whose integrity and confidentiality must be protected over decades or even centuries. While there is a vast amount of cryptography-based data protection schemes, only few are designed for long-term protection. Recently, Braun et al. (AsiaCCS'17) proposed the first long-term protection scheme that provides renewable integrity protection and information-theoretic confidentiality protection. However, computation and storage costs of their scheme increase significantly with the number of stored data items. As a result, their scheme appears suitable only for protecting databases with a small number of relatively large data items, but unsuitable for databases that hold a large number of relatively small data items (e.g., medical record databases). In this work, we present a solution for efficient long-term integrity and confidentiality protection of large datasets consisting of relatively small data items. First, we construct a renewable vector commitment scheme that is information-theoretically hiding under selective decommitment. We then combine this scheme with renewable timestamps and information-theoretically secure secret sharing. The resulting solution requires only a single timestamp for protecting a dataset while the state of the art requires a number of timestamps linear in the number of data items. We implemented our solution and measured its performance in a scenario where 12 000 data items are aggregated, stored, protected, and verified over a time span of 100 years. Our measurements show that our new solution completes this evaluation scenario an order of magnitude faster than the state of the art.
Cryptography and Security
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to efficiently protect the integrity and confidentiality of large - scale data sets during long - term storage, especially when the data set consists of a large number of relatively small data items. Current long - term protection schemes are usually costly in terms of calculation and storage, especially when dealing with databases containing a large number of small data items. Therefore, these schemes have limitations in practical applications, such as in scenarios where long - term protection is required for medical records or government documents. To meet this challenge, the paper proposes a new solution - ELSA (Efficient Long - Term Secure Storage of Large Datasets), aiming to achieve efficient and secure long - term data protection through the following technical means: 1. **Renewable Vector Commitment Scheme**: A renewable vector commitment scheme is constructed, which has information - theoretic hiding under selective de - commitment. This commitment scheme allows a single short - term commitment for multiple data items, while supporting renewable integrity protection and information - theoretic confidentiality. 2. **Renewable Timestamp**: Combine the renewable timestamp technology to ensure the long - term integrity protection of data. 3. **Proactive Secret Sharing**: Use information - theoretically secure secret sharing technology to ensure the long - term confidentiality of data. Through these technical means, ELSA can significantly reduce the communication and computational complexity required to protect large - scale data sets. Specifically, compared with the existing state - of - the - art schemes, ELSA only needs one timestamp to protect the entire data set, instead of one timestamp for each data item. This not only reduces the number of timestamps but also significantly reduces the overall complexity of the system. Experimental results show that when dealing with a database containing 12,000 data items, the performance of ELSA is 17 times higher than that of existing schemes, and the memory consumption is reduced by 101 times. In addition, the storage space for timestamps and commitments is also greatly reduced, from 1.75 GB to 17.27 MB. These improvements make ELSA a practical solution for storing and protecting large and complex databases such as medical records or government document databases.