Challenges, Techniques and Directions in Building XSeek: an XML Search Engine

Ziyang Liu,Peng Sun,Yu Huang,Yichuan Cai,Yi Chen
2009-01-01
Abstract:Yahoo! is building a set of scalable, highly-available data storage and processing services, and deploying them in a cloud model to make application development and ongoing maintenance significantly easier. In this paper we discuss the vision and requirements, as well as the components that will go into the cloud. We highlight the challenges and research questions that arise from trying to build a comprehensive web-scale cloud infrastructure, emphasizing data storage and processing capabilities. (The Yahoo! cloud infrastructure also includes components for provisioning, virtualization, and edge content delivery, but these aspects are only briefly touched on.)
What problem does this paper attempt to address?