Matrix Multiplication on Two Interconnected Processors

Brett A. Becker,Alexey Lastovetsky
DOI: https://doi.org/10.1109/clustr.2006.311901
2006-01-01
Abstract:This paper presents a new partitioning algorithm to perform matrix multiplication on two interconnected heterogeneous processors. Data is partitioned in a way which minimizes the total volume of communication between the processors compared to more general partitionings, resulting in a lower total execution time whenever the power ratio between the processors is greater than 3:1. The algorithm has interesting and important applicability, particularly as the top-level partitioning in a hierarchal algorithm that is to perform matrix multiplication on two interconnected clusters of computers
What problem does this paper attempt to address?