A New Hashing Function: Statistical Behaviour and Algorithm

Zhiyu Tian,Shibai Tong,Shiyuan Yang
DOI: https://doi.org/10.1145/174263.174264
1993-01-01
ACM SIGIR Forum
Abstract:Existing hashing functions have various limitations. In this paper a new hashing function is proposed, which divides the range of the key-values into some equal segments, and maps the key-values in each segment linearly into the whole range of the address. The paper analyzes the statistical behavior of the function, and points out that, theoretically, by increasing the number of segments, the distribution of the resulting hash values can always approach uniform, if the key-values can be regarded as continuous. Two methods for obtaining the number of segments, the deterministic and the probabilistic, along with the algorithm, are also proposed.
What problem does this paper attempt to address?