A collaborative analysis framework for distributed gridded environmental data

Hao Xu,Sha Li,Yuqi Bai,Wenhao Dong,Wenyu Huang,Shiming Xu,Yanluan Lin,Bin Wang,Fanghua Wu,Xiaoge Xin,Li Zhang,Zaizhi Wang,Tongwen Wu
DOI: https://doi.org/10.1016/j.envsoft.2018.09.007
IF: 5.471
2019-01-01
Environmental Modelling & Software
Abstract:As the amount of environmental data expands exponentially worldwide, researchers are challenged to efficiently analyze data maintained in multiple data centers. Because distributed data access, server-side analysis, multinode collaboration, and extensible analytic functions are still research gaps in this field, this paper introduces a collaborative analysis framework for gridded environmental data, i.e. CAFE. Multiple CAFE nodes can collaborate to perform complex data analysis. Analytic functions are performed near where data are stored. A web-based user interface allows researchers to search for data of interest, submit analytic tasks, check the status of tasks, visualize the analysis results, and download the resulting data files. CAFE facilitates overall research efficiency by dramatically lowering the amount of data that must be transmitted from data centers to researchers for analysis. The results of this study may lead to the further development of collaborative computing paradigm for environmental data analysis.
What problem does this paper attempt to address?