Rate distortion optimization over large scale video corpus with machine learning

We present an efficient codec-agnostic method for bitrate allocation over a large scale video corpus with the goal of minimizing the average bitrate subject to constraints on average and minimum quality. Our method clusters the videos in the corpus such that videos within one cluster have similar ra...

Full description

Saved in:

Bibliographic Details
Main Authors	John, Sam, Gadde, Akshay, Adsumilli, Balu
Format	Journal Article
Language	English
Published	27.08.2020
Subjects	Computer Science - Information Theory Computer Science - Learning Computer Science - Multimedia Mathematics - Information Theory
Online Access	Get full text
DOI	10.48550/arxiv.2008.12408

Cover

More Information
Summary:	We present an efficient codec-agnostic method for bitrate allocation over a large scale video corpus with the goal of minimizing the average bitrate subject to constraints on average and minimum quality. Our method clusters the videos in the corpus such that videos within one cluster have similar rate-distortion (R-D) characteristics. We train a support vector machine classifier to predict the R-D cluster of a video using simple video complexity features that are computationally easy to obtain. The model allows us to classify a large sample of the corpus in order to estimate the distribution of the number of videos in each of the clusters. We use this distribution to find the optimal encoder operating point for each R-D cluster. Experiments with AV1 encoder show that our method can achieve the same average quality over the corpus with$22\%$less average bitrate.
DOI:	10.48550/arxiv.2008.12408