A Packaging Approach for Massive Amounts of Small Geospatial Files with HDFS.

Jifeng Cui,Yong Zhang,Chao Li,Chunxiao Xing
DOI: https://doi.org/10.1007/978-3-642-32281-5_20
2012-01-01
Abstract:The efficiency of dealing with massive small geospatial files deeply affects the performance of Web Geography Information System (WebGIS). The Hadoop Distributed File System (HDFS) is scalable to satisfy the requirement of massive data files storage, but not efficient in dealing with small files. In this paper, we proposed a method to pack a group of small files into one large logical file, and set up Hilbert spatial index inside the block with their spatial adjacency relation. The experimentation proved that this method reduces the size of block indices and increases the speed to search and retrieve the massive small spatial files. © 2012 Springer-Verlag.
What problem does this paper attempt to address?