Improving Downloading Performance in Hadoop Distributed File System

曹宁,吴中海,刘宏志,张齐勋
DOI: https://doi.org/10.3724/sp.j.1087.2010.02060
2010-01-01
Journal of Computer Applications
Abstract:Concerning the problems such as low downloading efficiency and imbalanced load of DataNode in Hadoop Distributed File System HDFS.Inthis paper two methods to optimize were proposed one was to improve the whole process of downloading a file the other was to optimize the downloading a block by a parallel download algorithm for dynamically allocating load by speed.Mathematical analysis and experiments prove that two methods can enhance the efficiency.Meanwhile by combining the two methods downloading is more efficient and more stable when the load of DataNode can be balanced to some extent.
What problem does this paper attempt to address?