Highly Scalable Metadata Distribution Algorithm in Mass Storage System

Wei Wu,Chang-sheng Xie,Jian-zhong Huang,Cheng-feng Zhang
DOI: https://doi.org/10.1007/s11741-008-0509-1
2008-01-01
Abstract:Distribution of metadata in a metadata server cluster is important in mass storage system. A good distribution algorithm has a significant influence on the system performance, availability and scalability. Subtree partition and hash are two traditional metadata distribution algorithms used in distributed file systems. They both have a defect in system scalability. This paper proposes a new directory hash (DH) algorithm. By treating directory as the key value of hash function, implementing concentrated storage of metadata, pipelining operations and prefetching technology, DH algorithm can enhance the system scalability on the premise without sacrificing system performance.
What problem does this paper attempt to address?