On the communication cost of distributed database processing

Luk, W.S.,Xiao Wang,Franky Ling
DOI: https://doi.org/10.1109/DCS.1988.12556
1988-01-01
Abstract:Various communication aspects of locally distributed database processing are studied, using some distributed sorting and distributed hash-based join algorithms as examples. The algorithms are implemented on diskless workstations connected by an Ethernet network to simulate a distributed main memory system environment. This experimental testbed is described. Raw communication performance data (i.e. memory-to-memory data transfer timing) are presented. The effects of the underlying distributed operating system and the speed of the processor on the communication performance are shown. Two distributed sorting algorithms are used as examples to study the issue of concurrent transmissions of messages. Distributed hash join is used as a case study for communication/local-processing tradeoff. The idea of load sharing among a number of sites to speed up the join operation is introduced.<>
What problem does this paper attempt to address?