Lossy Bulk Synchronous Parallel Processing Model for Very Large Scale Grids

Elankovan Sundararajan,Aaron Harwood,Kotagiri Ramamohanarao
IF: 4.755
2006-01-01
Clinical Orthopaedics and Related Research
Abstract:The performance of a parallel algorithm in a very large scale grid is significantly influenced by the underlyin g Internet protocols and inter-connectivity. Many grid program- ming platforms use TCP due to its reliability, usually with some optimizations to reduce its costs. However, TCP does not perform well in a high bandwidth and high delay network environment. On the other hand, UDP is the fastest protocol avail- able because it omits connection setup process, acknowledgments and retransmissions sacrificing reliable transfer. Many ne w bulk data transfer schemes using UDP for data transmission such as RBUDP, Tsunami, and SABUL have been introduced and shown to have better performance compared to TCP. In this paper, we consider the use of UDP and examine the relationship between packet loss and speedup with respect to the number of grid nodes. Our measurement suggests that packet loss rates between 5%- 15% on average are not uncommon between PlanetLab nodes that are widely distributed over the Internet. We show that transmitting multiple copies of same packet produces higher speedup. We show the minimum number of packet duplication required to maximize the possible speedup for a given number of nodes using a BSP based model. Our work demonstrates that by using an appropriate number of packet copies, we can increase performance of parallel program.
What problem does this paper attempt to address?