Statistical early termination model for fast mode decision and reference frame selection in multiview video coding

Yun ZHANG, Sam KWONG, Gangyi JIANG, Xu WANG, Mei YU

Research output: Journal PublicationsJournal Article (refereed)peer-review

31 Citations (Scopus)

Abstract

Multiview Video Coding (MVC) adopts exhaustive variable size mode decision and multiple reference frame selection to significantly improve high compression efficiency at each macroblock. However, these two technologies increase the computational complexity of MVC encoders tremendously. In this paper, we propose an efficient Statistical DIRECT Mode Early Termination (SDMET) model which estimates the rate distortion degradation, false acceptance rate and false reject rate of early DIRECT mode decision. It can adaptively adjust the rate distortion cost threshold not only according to the quantization parameter, but also the video content and motion properties. Experimental results show that SDMET can reduce 42.40% to 65.60% computation complexity for fast mode decision. When it is jointly optimized with fast multi-reference frame selection, the proposed overall algorithm can achieve 79.57% to 89.21% computational complexity reduction with unnoticeable rate distortion degradation. Additionally, the proposed SDMET and the overall fast mode decision algorithm can be applied to both temporal views and inter-view views in MVC. © 2011 IEEE.
Original languageEnglish
Pages (from-to)45200
JournalIEEE Transactions on Broadcasting
Volume58
Issue number1
DOIs
Publication statusPublished - Mar 2012
Externally publishedYes

Bibliographical note

This work was supported in part by Hong Kong RGC General Research Fund (GRF) Projects 9041495 (CityU 115109) and in part by the National Natural Science Foundation of China under Grants 61071120, 60872094, and 61102088.

Keywords

  • Digital video broadcasting
  • early termination
  • mode decision
  • multiview video
  • video coding

Fingerprint

Dive into the research topics of 'Statistical early termination model for fast mode decision and reference frame selection in multiview video coding'. Together they form a unique fingerprint.

Cite this