Efficient multi-view video coding using 3D motion estimation and virtual frame

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

Three dimensional (3D) i.e., multi-view extension of High Efficiency Video Coding (HEVC) standard provides a better compression compared to the simulcast coding (i.e., HEVC) technique by exploiting inter- and intra-view redundancy for multi-view video coding. However, this technique imposes a random access frame delay (RAFD) problem as well as requires huge computational time. In this paper three novel techniques are proposed to overcome the problems mentioned above. Firstly, a simulcast video coding technique is proposed where each view is encoded individually using hierarchical bi-predictive structure with extra virtual reference frame generated by dynamic background modelling (popularly known as McFIS- the most common frame in a scene) of the corresponding view. Secondly a novel technique is proposed using 3D motion estimation (3DME) where a 3D frame is formed using the same temporal frames of all views and motion estimation is carried out for the current 3D frame using dual 3D reference frames. Finally, a modification of 3DME dual reference frame technique (3DME-McFIS) is proposed where 3D McFIS is used as the second reference frame. Experimental results confirm that the proposed three techniques reduce the overall computational time and reduce the RAFD problem with comparable or better rate-distortion performance compared to the HEVC-3D extension. Specifically the proposed 3DME-McFIS technique outperforms the HEVC-3D coding standard by improving 0.90dB PSNR on average, by reducing computational time by 50%, and by reducing RAFD problem compared to the existing HEVC-3D coding standard.
Original languageEnglish
Pages (from-to)544-554
Number of pages11
JournalNeurocomputing
Volume175
Issue numberA
Early online dateNov 2015
DOIs
Publication statusPublished - Jan 2016

Fingerprint

Dive into the research topics of 'Efficient multi-view video coding using 3D motion estimation and virtual frame'. Together they form a unique fingerprint.

Cite this