Replication strategy for spatial data based on access correlation

Shaoming Pan,Yanwen Chong,Hong Li,Xicheng Tan
DOI: https://doi.org/10.13245/j.hust.170501
2017-01-01
Abstract:Due to the limited high-speed caching space and the massive dataset, replication strategy based solely on data′s popularities cannot work when users′ access behaviors change suddenly, a comprehensive replication strategy used both data′s popularities and data′ relationships to select replicas was proposed.First, hotspot data were selected based on their popularities to reduce the size of data and then reduce computational overhead.Then, access correlations were computed based on their relationships.Finally, some data could be selected as replicas and stored into high-speed caching system according to their access correlations with the data being requested, so as to prepare the next data for users in advance and to reduce average request response time.Experimental results show that the proposed comprehensive replication strategy can achieve a lower average request response time than some other algorithms by about 5.9% ~ 29.9%.
What problem does this paper attempt to address?