High-Throughput and Memory-Efficient Parallel Viterbi Decoder for Convolutional Codes on GPU

Alireza Mohammadidoost,Matin Hashemi
DOI: https://doi.org/10.48550/arXiv.2011.09337
2020-11-18
Distributed, Parallel, and Cluster Computing
Abstract:This paper describes a parallel implementation of Viterbi decoding algorithm. Viterbi decoder is widely used in many state-of-the-art wireless systems. The proposed solution optimizes both throughput and memory usage by applying optimizations such as unified kernel implementation and parallel traceback. Experimental evaluations show that the proposed solution achieves higher throughput compared to previous GPU-accelerated solutions.
What problem does this paper attempt to address?