TY - JOUR
T1 - Efficient multiview video coding using 3-D coding and saliency-based bit allocation
AU - Paul, Manoranjan
N1 - Includes bibliographical references.
PY - 2018/6
Y1 - 2018/6
N2 - Capturing a scene using multiple cameras from different angles is expected to provide the necessary interactivity in the 3-D space to satisfy end-users' demands for observing objects and actions from different angles and depths. Existing multiview video coding (MVC) technologies face tradeoff among rate-distortion performance, random access frame delay, i.e., interactivity, and computational time. To address above mentioned tradeoffs, a novel cuboid MVC strategy is proposed with 3-D frame referencing structure to improve interactivity and computational time, an additional reference frame to improve rate-distortion performance for occluded areas, and visual attention-based bit allocation to provide better perceptual video quality. The experimental results reveal that the proposed scheme provides better interactivity, reduced computational time, and better perceptual quality compared to the 3D-HEVC implementation, HTM 15.0.
AB - Capturing a scene using multiple cameras from different angles is expected to provide the necessary interactivity in the 3-D space to satisfy end-users' demands for observing objects and actions from different angles and depths. Existing multiview video coding (MVC) technologies face tradeoff among rate-distortion performance, random access frame delay, i.e., interactivity, and computational time. To address above mentioned tradeoffs, a novel cuboid MVC strategy is proposed with 3-D frame referencing structure to improve interactivity and computational time, an additional reference frame to improve rate-distortion performance for occluded areas, and visual attention-based bit allocation to provide better perceptual video quality. The experimental results reveal that the proposed scheme provides better interactivity, reduced computational time, and better perceptual quality compared to the 3D-HEVC implementation, HTM 15.0.
KW - 3D DCT
KW - 3D motion estimation
KW - Bit rate
KW - Cameras
KW - Correlation
KW - Encoding
KW - Three-dimensional displays
KW - Uncovered background
KW - variable bit allocation.
KW - Video coding
KW - Video sequences
KW - visual attention modeling
UR - http://www.scopus.com/inward/record.url?scp=85040047680&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85040047680&partnerID=8YFLogxK
U2 - 10.1109/TBC.2017.2781118
DO - 10.1109/TBC.2017.2781118
M3 - Article
AN - SCOPUS:85040047680
SN - 0018-9316
VL - 64
SP - 235
EP - 246
JO - IEEE Transactions on Broadcasting
JF - IEEE Transactions on Broadcasting
IS - 2
ER -