Filter-embedding Semiring Fusion for Programming with MapReduce

Kento Emoto,Sebastian Fischer,Zhenjiang Hu
DOI: https://doi.org/10.1007/s00165-012-0241-8
2012-01-01
Formal Aspects of Computing
Abstract:We show that MapReduce, the de facto standard for large scale data-intensive parallel programming, can be equipped with a programming theory in calculational form. By integrating the generate-and-test programming paradigm and semirings for aggregation of results, we propose a novel parallel programming framework for MapReduce. The framework consists of two important calculation theorems: the shortcut fusion theorem of semiring homomorphisms bridges the gap between specifications and efficient implementations, and the filter-embedding theorem helps to develop parallel programs in a systematic and incremental way.
What problem does this paper attempt to address?