An Effective Directory Index Framework Taking Advantages of Hash Table and B+-Tree

LIU Xianzhuo,WANG Jinlin,ZHU Ming,DENG Feng,SUN Peng
DOI: https://doi.org/10.7652/xjtuxb201304018
2013-01-01
Abstract:A new directory index framework that takes the advantages of both hash functions and B+-tree is proposed for storage system of Web server to solve the problem that existing directory indexing structures in file systems result in significant delays when they are used to process a large number of files or directories,especially when there are over hundreds of thousands of files in a single directory.The framework adopts hash functions to normalize file names with variable length to integers,thus file names can be found through a hash table in O(1) complexity.On the other hand,inode numbers of files in a directory are indexed by a B+-tree with high fanout to support fast insertion or removing operations.Benchmark of a simple implementation reveals that the proposed framework can easily hand up to half a million files in a single directory,and can significantly improve performance,that is,its speed in searching entries is faster by at least 40% than Ext3,and in creating or deleting files is faster by at least 73% than Ext3 or Reiserfs.
What problem does this paper attempt to address?