Low bit-rate SNR scalable video coding based on overcomplete dictionary learning and sparse representation

In the current decade, scalability has been developed in video coding (VC) schemes to reply end-user demands and heterogeneity of networks. In this paper, a low bit-rate signal-to-noise ratio (SNR) scalable VC based on dictionary learning (DL) and sparse representation is proposed. A notable feature...

Full description

Saved in:
Bibliographic Details
Published inMultidimensional systems and signal processing Vol. 31; no. 2; pp. 465 - 489
Main Authors Irannejad, Maziar, Mahdavi-Nasab, Homayoun
Format Journal Article
LanguageEnglish
Published New York Springer US 01.04.2020
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0923-6082
1573-0824
DOI10.1007/s11045-019-00671-6

Cover

More Information
Summary:In the current decade, scalability has been developed in video coding (VC) schemes to reply end-user demands and heterogeneity of networks. In this paper, a low bit-rate signal-to-noise ratio (SNR) scalable VC based on dictionary learning (DL) and sparse representation is proposed. A notable feature of SNR scalability compared to spatial and temporal versions is that there are not any limitations in the number of enhancement layers, making it more applicable to adapt to different conditions. In this research, unlike traditional VC in which the discrete cosine transform (DCT) coefficients of video signals are quantized to obtain different SNR qualities, sparse codes are applied. Sparse coding is done over trained overcomplete dictionaries, for which three different DL algorithms, namely MOD, K-SVD, and RLS-DLA, are utilized and compared. The dictionaries are trained over the DCT domain of general natural images, to achieve higher compression and prevent blocking artifacts. The results of the proposed method are compared with non-scalable coding based on DL, and scalable and non-scalable coding schemes based on complete DCT dictionary employed in traditional VC standards such as MPEG.X and H.26X. The results show that, although video scalability naturally decreases the quality compared to non-scalable coding, the proposed scheme presents superior subjective and rate–distortion performance compared to non-scalable and scalable VC based on the traditional DCT quantization. Moreover, among the three DL methods applied, RLS-DLA achieves superior results both for non-scalable and scalable VC.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0923-6082
1573-0824
DOI:10.1007/s11045-019-00671-6