Design and implementation of a data space for the bioinformatics community

Jin Zhang,Jinlei Jiang,Rui Fang,Yongwei Wu
2011-01-01
Abstract:Applying grid technology to bioinformatics has made great progress. However, existing grid systems lack a dedicated file system, making it difficult to transfer data between clients and the grid, and to move, share and manage data in the grid environment. This hinders the futher application of grid in bioinformatics. To deal with these issues, a new data space solution for the bioinformatics community of China national grid (CNGrid) was designed and implemented on the basis of Tsinghua cloud, a cloud computing platform developed by our own. The solution, which uses Carrier and Corsair, two key components of Tsinghua Cloud, as the underlying file system and storage management tool respectively, facilitates hierarchical data sharing and management in the grid environment and sets an example for solving similar issues. Sharing mechanism of the system is divided into three levels: assigned a certain number for each user's personal private space, each group is assigned to the appropriate storage space, for the hot field of bioinformatics resources to build a global shared memory space.
What problem does this paper attempt to address?