Machine Learning Predictors for Min-Entropy Estimation

Javier Blanco-Romero,Vicente Lorenzo,Florina Almenares Mendoza,Daniel Díaz-Sánchez
2024-06-28
Abstract:This study investigates the application of machine learning predictors for min-entropy estimation in Random Number Generators (RNGs), a key component in cryptographic applications where accurate entropy assessment is essential for cybersecurity. Our research indicates that these predictors, and indeed any predictor that leverages sequence correlations, primarily estimate average min-entropy, a metric not extensively studied in this context. We explore the relationship between average min-entropy and the traditional min-entropy, focusing on their dependence on the number of target bits being predicted. Utilizing data from Generalized Binary Autoregressive Models, a subset of Markov processes, we demonstrate that machine learning models (including a hybrid of convolutional and recurrent Long Short-Term Memory layers and the transformer-based GPT-2 model) outperform traditional NIST SP 800-90B predictors in certain scenarios. Our findings underscore the importance of considering the number of target bits in min-entropy assessment for RNGs and highlight the potential of machine learning approaches in enhancing entropy estimation techniques for improved cryptographic security.
Machine Learning,Cryptography and Security,Information Theory
What problem does this paper attempt to address?
The paper attempts to address the issue of using machine learning predictors in random number generators (RNG) to estimate min-entropy. Specifically, the researchers found that these predictors mainly estimate average min-entropy rather than min-entropy in the traditional sense. The paper explores the relationship between average min-entropy and traditional min-entropy, with a particular focus on the impact of the number of target bits on this relationship. By using Generalized Binary Autoregressive Models, the study shows that in certain cases, machine learning models (including hybrid models with convolutional and recurrent long short-term memory layers as well as transformer-based GPT-2 models) outperform traditional predictors recommended by NIST SP 800-90B. This finding highlights the importance of considering the number of target bits when using machine learning predictors to evaluate the min-entropy of RNGs.