MMM’21: Towards Optimal Multirate Encoding for HTTP Adaptive Streaming

The International MultiMedia Modeling Conference (MMM)

June 22-24, 2021, Prague, Czech Republic

[PDF][Slides][Video]

Hadi Amirpour (Alpen-Adria-Universität Klagenfurt), Ekrem Çetinkaya (Alpen-Adria-Universität Klagenfurt), Christian Timmerer (Alpen-Adria-Universität Klagenfurt), and Mohammad Ghanbari (School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK)

Abstract: HTTP Adaptive Streaming (HAS) enables high-quality streaming of video content. In HAS, videos are divided into short intervals called segments, and each segment is encoded at various quality/bitrates to adapt to the available bandwidth. Multiple encodings of the same content impose high costs for video content providers. To reduce the time-complexity of encoding multiple representations, state-of-the-art methods typically encode the highest quality representation first and reuse the information gathered during its encoding to accelerate the encoding of the remaining representations. As encoding the highest quality representation requires the highest time-complexity compared to the lower quality representations, it would be a bottleneck in parallel encoding scenarios and the overall time-complexity will be limited to the time-complexity of the highest quality representation. In this paper and to address this problem, we consider all representations from the highest to the lowest quality representation as a potential, single reference to accelerate the encoding of the other, dependent representations. We formulate a set of encoding modes and assess their performance in terms of BD-Rate and time-complexity, using both VMAF and PSNR as objective metrics. Experimental results show that encoding a middle quality representation as a reference, can significantly reduce the maximum en-coding complexity and hence it is an efficient way of encoding multiple representations in parallel. Based on this fact, a fast multirate encoding method is proposed which utilizes depth and prediction mode of a middle quality representation to accelerate the encoding of the dependent representations.

Keywords: HEVC, Video Encoding, Multirate Encoding, DASH

This entry was posted in ATHENA. Bookmark the permalink.