THUIR-DB: A large-scale, highly-efficient index, fast-access key-value store

Bin Liang,Yiqun Liu,Min Zhang,Shaoping Ma,Liyun Ru,Kuo Zhang
2013-01-01
Journal of Computational Information Systems
Abstract:Rapid data growth in web-based applications poses great challenges to the design and implementation of key-value storage systems (key-value stores). In this paper, we present the design of a highly efficient single machine key-value store called THUIR-DB, which features a highly-compacted index structure and a fast querying strategy. Like Google's LevelDB, THUIR-DB also doesn't focus on distributing, based on which we can build distributing key-value data store. Experimental results based on Google's N-gram dataset show great improvement with THUIR-DB in both time and memory efficiency compared with some widely adopted open source key-value stores such as LevelDB and Tokyo Cabinet. Experiment shows that THUIR-DB costs 1.06 bit index for every record and gains 1.2 million ops/sec throughputs based on the 0.7 billion-scale dataset. We have already applied THUIR-DB to our online system Weibo Xunren(xunren.thuir.org). Copyright © 2013 Binary Information Press.
What problem does this paper attempt to address?