Migrating a Digital Library to a Private Cloud

Jian Wu,Pradeep B. Teregowda,Kyle Williams,Madian Khabsa,Douglas Jordan,Eric Treece,Zhaohui Wu,C. Lee Giles
DOI: https://doi.org/10.1109/IC2E.2014.77
2014-01-01
Abstract:A private cloud deployment of an infrastructure as a service (IaaS) clusteris a cost effective solution to many small and intermediate digital libraries and maybe companies. As a working online digital library search engine, the physical infrastructure of CiteSeerX represents many of the clusters for a typical digital library in terms of size and functionalities. CiteSeerX used to run on a cluster consisting of eighteen loosely coupled physical machines. In this work we share the experiences and lessons learned through migrating CiteSeerX into a private cloud environment using virtualization technique. We also discuss alternative solutions including a public cloud deployment using Amazon EC2 and EBS services. We found that the private cloud via virtualization is a better model for a digital library system like CiteSeerX. We also report system status, activities and proposed variations after the new system has been running for over half a year.
What problem does this paper attempt to address?