Dynamic Load Balance Algorithm for Big-data Distributed Storage
ZHANG Li-zong,CUI Yuan,LUO Guang-chun,CHEN Ai-guo,LU Guo-ming,WANG Xiao-xue
DOI: https://doi.org/10.11896/j.issn.1002-137X.2017.05.032
2017-01-01
Computer Science
Abstract:Distributed storage is the major approach for handling the “Big Data”.Currently,the major technology is hadoop distributed file system (HDFS),which has been beset by the issues of scalability and write latency.In official 2.0 version,a new feature‘ HDFS Federation’ addresses this limitation by adding support for multiple NarneNodes/name spaces to HDFS.However,it does not take the isomerism of NameNode into account,and still lacks of dynamic load balance ability.Consequently,a dynamic load balance algorithm for HDFS NameNode was proposed,and it dynamically allocated the metadata into a NameNodes cluster with multiple copies,in order to improve the performance of metadata utilizations.In addition,the proposed algorithm increases the readability by the adoption of metadata caches,and improves the stability by a built-in failover mechanism.Finally,an experiment was carried out,to illustrate and evaluate the utilizations of the proposed algorithm.