Progressive Dictionary Learning with Hierarchical Structure for Scalable Video Coding

Xin Tang,Wenrui Dai,Hongkai Xiong
DOI: https://doi.org/10.1109/dcc.2015.28
IF: 10.6
2017-01-01
IEEE Transactions on Image Processing
Abstract:To enable learning-based video coding for transmission over heterogenous networks, this paper proposes a scalable video coding framework by progressive dictionary learning. With the hierarchical B-picture prediction structure, the inter-predicted frames would be reconstructed in terms of the spatio-temporal dictionary in a successive sense. Within the progressive dictionary learning, the training set is enriched with the samples from the reconstructed frames in the coarse layer. Through minimizing the expected cost, the stochastic gradient descent is leveraged to update the dictionary for practical coding. It is demonstrated that the learning-based scalable framework can effectively guarantee the consistency of motion trajectory with the well-designed spatio-temporal dictionary.
What problem does this paper attempt to address?