Load Balance Of The Original Pages Storage In The Distributed Crawling System

Weizhe Zhang,Hongli Zhang,Jinfu Chen,Chuanliang Yu
2012-01-01
Abstract:Load balancing plays a key role in making full and equal usage of the machines in the cluster system, and in shortening system response time. Storage and computation should be fully considered into load balancing. In order to improve the parallelism of crawlers' writing and reduce the response time of the crawlers' writing, this paper proposes a new method on the sensitive division of the channel date storage-compute-sensitive partition. Experiments have proven that this method is the optimal load partition strategy.
What problem does this paper attempt to address?