Practical Byzantine Reliable Broadcast on Partially Connected Networks (Extended version)

Silvia Bonomi,Jérémie Decouchant,Giovanni Farina,Vincent Rahli,Sébastien Tixeuil
2024-02-27
Abstract:In this paper, we consider the Byzantine reliable broadcast problem on authenticated and partially connected networks. The state-of-the-art method to solve this problem consists in combining two algorithms from the literature. Handling asynchrony and faulty senders is typically done thanks to Gabriel Bracha's authenticated double-echo broadcast protocol, which assumes an asynchronous fully connected network. Danny Dolev's algorithm can then be used to provide reliable communications between processes in the global fault model, where up to f processes among N can be faulty in a communication network that is at least 2f+1-connected. Following recent works that showed that Dolev's protocol can be made more practical thanks to several optimizations, we show that the state-of-the-art methods to solve our problem can be optimized thanks to layer-specific and cross-layer optimizations. Our simulations with the Omnet++ network simulator show that these optimizations can be efficiently combined to decrease the total amount of information transmitted or the protocol's latency (e.g., respectively, -25% and -50% with a 16B payload, N=31 and f=4) compared to the state-of-the-art combination of Bracha's and Dolev's protocols.
Distributed, Parallel, and Cluster Computing,Data Structures and Algorithms,Networking and Internet Architecture
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve Byzantine Reliable Broadcast (BRB) in partially - connected networks. Specifically, the paper focuses on how to deal with asynchrony and faulty senders in a partially - connected and authenticated network. Existing solutions are usually a combination of two algorithms: Gabriel Bracha's double - echo broadcast protocol (used to handle asynchrony and faulty senders) and Danny Dolev's algorithm (used to provide reliable communication between processes in the global failure model). However, these methods have performance bottlenecks in practical applications, such as a high total amount of transmitted information and high protocol latency. The main contributions of the paper are as follows: 1. It describes how to extend the existing optimizations of Dolev's RC algorithm to the combination of Bracha and Dolev algorithms. 2. It proposes 12 new modifications that can be applied to the combination of Bracha and Dolev algorithms, some of which are cross - layer, and demonstrates the practical feasibility of these modifications through experiments. 3. It evaluates the impact of each modification on protocol latency and throughput, using actual deployments under multiple settings. 4. It details how to combine these modifications according to network asynchrony, connectivity, and load size to optimize latency and/or throughput, thus helping future deployments for specific purposes. Through these contributions, the paper aims to further optimize the Byzantine reliable broadcast protocol in partially - connected networks, making it more practical and efficient.