CinHBa: A Secondary Index with Hotscore Caching Policy on Key-Value Data Store.

Wei Ge,Yihua Huang,Di Zhao,Shengmei Luo,Chunfeng Yuan,Wenhui Zhou,Yun Tang,Juan Zhou
DOI: https://doi.org/10.1007/978-3-319-14717-8_47
2014-01-01
Abstract:We are now entering the era of big data. HBase comes out to organize data as key-value pairs and support fast queries on rowkeys, but queries on non-rowkey column are a blind spot of HBase. It is the main topic of this paper to provide high-performance query capability on non-rowkey column. An effective secondary index model is proposed, and the prototype system CinHBa is implemented. Furthermore, a novel caching policy, Hotscore Algorithm, is introduced in CinHBa to cache hottest index data into memory to improve query performance. Experiment evaluation shows that query response time of CinHBa is far less than native HBase without secondary index on 10M records. Besides that, CinHBa has good data scalability.
What problem does this paper attempt to address?