7 Citations (Scopus)
6 Downloads (Pure)


H.264/MVC multi-view video coding provides a better compression rate compared to the simulcast coding using hierarchical B-picture prediction structure exploiting inter- and intra-view redundancy. However, this technique imposes random access frame delay as well as requiring huge computational time. In this paper a novel technique is proposed using 3D motion estimation (3D-ME) to overcome the problems. In the 3D-ME technique, a 3D frame is formed using the same temporal frames of all views and ME is carried out for the current 3D frame using the immediate previous 3D frame as a reference frame. As the correlation among the intra-view images is higher compared to the correlation among the inter-view images, the proposed 3D-ME technique reduces the overall computational time and eliminates the frame delay with comparable rate-distortion (RD) performance compared to H.264/MVC. Another technique is also proposed in the paper where an extra reference 3D frame comprising dynamic background frames (the most common frame of a scene i.e., McFIS) of each view is used for 3D-ME. Experimental results reveal that the proposed 3D-ME-McFIS technique outperforms the H.264/MVC in terms of improved RD performance by reducing computational time and by eliminating the random access frame delay.
Original languageEnglish
Title of host publication2012 IEEE international conference on acoustics, speech, and signal processing: Proceedings
EditorsHideaki Sakai
Place of PublicationUSA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Number of pages4
ISBN (Electronic)9781467300469
ISBN (Print)9781467300452
Publication statusPublished - 2012
Event2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Kyoto International Conference Center (KICC), Kyoto, Japan
Duration: 25 Mar 201230 Mar 2012


Conference2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
OtherThe latest research results on both theories and applications on signal processing will be presented and discussed among participants from all over the world. Video/Speech Signal processing used in human interface between Robots and Personal users will be highlighted.


Dive into the research topics of '3D Motion estimation for 3D video coding'. Together they form a unique fingerprint.

Cite this