A Client-Side Directory Prefetching Mechanism for GlusterFS

Zijian Liu,Fang Dong,Junxue Zhang,Pengcheng Zhou,Zhuqing Xu,Junzhou Luo
DOI: https://doi.org/10.1109/smc.2016.7844850
2016-01-01
Abstract:Distributed file system has the characteristics of large capacity, good scalability and high reliability, which make it widely used in many areas involving large-scale data storage. It offers simplified, highly-available services for users to access data. However, due to the non-metadata design, the performance of traversal operation on large directories in those non-metadata distributed file systems is poor. With the increasing amount of files, it severely affects the user experience. In this paper, we present a directory prefetching mechanism on the client side to reduce directory traversal operation latency in non-metadata distributed file system. The mechanism, combined with the client's cache, adopts the directory access history to predict future access pattern and fetches the content of the directory without user intervention. Our goal is to reduce the overall access latency in the non-metadata distributed file system in order to better satisfy the user experience.
What problem does this paper attempt to address?