EKENet: Efficient knowledge enhanced network for real-time scene parsing

Ao Luo,Fan Yang,Xin Li,Rui Huang,Hong Cheng
DOI: https://doi.org/10.1016/j.patcog.2020.107671
IF: 8
2021-03-01
Pattern Recognition
Abstract:<p>Scene parsing is essential for many high-level AI applications, such as intelligent vehicles and traffic surveillance. In this work, we propose a highly efficient and powerful deep convolutional neural network, namely Efficient Knowledge Enhanced Network (EKENet), for parsing scenes in real-time. Unlike most existing approaches that compromise efficiency for the sake of high accuracy, EKENet achieves an ideal trade-off between the two. Our EKENet is built upon a novel building block, namely Efficient Dual ion (EDA) block, which employs an efficiently parallel convolution structure for extracting spatial features and modeling cross-channel correlations in a dual fashion. Additionally, a novel <em>light-weight</em> Encoding-Enhancing (EE) module is designed to enhance our EKENet, which can efficiently encode high-level knowledge extracted from top layers to guide the learning of low-level features from bottom layers.</p><p>Extensive experiments on challenging benchmarks, Cityscapes and CamVid datasets, demonstrate that EKENet achieves the new state-of-the-art performance in terms of speed and accuracy tradeoff.</p>
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?