Processing Technology of Massive Human Health Data Based on Hadoop

Miao Liu,Junsheng Yu,Zhijiao Chen,Jinglin Guo,Jun Zhao
DOI: https://doi.org/10.2991/mmebc-16.2016.284
2016-01-01
Abstract:With the development of science and medical industry, people pay more and more attention to their health status. And massive human health data are generated in this process. As an important component of the cloud computing technology, the open source framework Hadoop provides us with a platform for storing and processing massive data. For the bottleneck of the existing Hadoop framework to deal with the small files in human health data, this paper proposes two optimization strategy: index optimization and metadata prefetching. At the end of the paper, the simulation results show that the method has excellent performance.
What problem does this paper attempt to address?