Optimization Strategy for Organization and Distribution of Documents in Web Server Cluster

Pu Yan
2006-01-01
Computer Science
Abstract:Web server cluster has been widely used to increase the performance of Web servers. For a large website, how to organize and distribute the Web documents is imperative. This paper proposes a strategy to organize and distribute Web documents in Web server cluster, whose aim is to reduce the average response time of cluster system. Through the analysis of Web servers’ logs, discover the client’s access patterns, then group the webpages closely related into webpage clusters, then calculate every webpage cluster’s copy number according to their load, finally distribute the copies of webpage clusters among servers. Such strategy can reduce the overhead of TCP connection migration, and achieve load balancing among servers, thereby reducing the average response time of cluster system. In addition, compared with the method of mirror image, our strategy saves storage space, and reduces the overhead of maintaining the coherence of the documents.
What problem does this paper attempt to address?