In high efficiency video coding (HEVC) scheme, the split structure of the largest coding-unit (LCU) is highly variable and laborious due to its recursive quad-tree representation. Thus, the computational complexity of the HEVC encoder increases dramatically. A fast HEVC encoding method is proposed to predict the split structure of the current LCU by using the same structure of the co-located LCU in the reference or temporally-previous frame, in order to reduce the encoding computational complexity. Simulation results show that the proposed method can achieve a reduction in encoding time of 21.3% on average, with a negligible average PSNR loss and a negligible bit rate increase compared with the original HEVC encoding scheme.
KEYWORDS: Video, 3D image processing, 3D displays, Image processing, Video processing, Video compression, Optical engineering, Data modeling, Image compression, Video coding
We present the design and implementation of an 8-view three-dimensional (3-D) video player system based on a multi-core processor environment, which operates faster than existing video player systems. We propose a structure for obtaining near-optimum speed by parallelizing the component modules to process large volumes of 8-view image data at a high speed. In order to exploit task parallelism in the video player, we have designed a multiple-video decoder, a view synthesizer that creates the 3-D image from the multi-view image data, and a 3-D renderer for an 8-view display in a pipeline structure. For load balancing and exploiting data parallelism in the modules, the decoder is divided into units of viewpoint, and the view synthesizer is geometrically divided based on the synthesized images. As a result of this experiment, 8-view images are correctly synthesized, and the sense of three dimensions can be observed when watching the images on the 8-view auto-stereoscopic display. The proposed structure could be used to process large volumes of multi-view image data at high speeds by utilizing the maximum capacity of the multi-core processors.
KEYWORDS: Image compression, Computer programming, Video coding, Video, Binary data, Optical engineering, Cameras, 3D video compression, 3D image processing, Video compression
This work proposes an efficient bit-plane-based lossless depth-map coding method for an MPEG 3-D video coding scheme. This method uses the distinctive image characteristics between bit planes of the depth map. In the simulations, the performances of the proposed coding method are compared with those of the conventional lossless coding methods, i.e., JPEG-LS, JPEG-2000, and H.264/AVC, in terms of the intra- and also intercoding modes. In intracoding mode, the proposed method achieves the highest compression ratios as 14.28:1 on average. JPEG-LS, JPEG2000, H.264/AVC (CAVLC), and H.264/AVC (CABAC) obtain the compression ratios as 9.74:1, 7.68:1, 9.13:1, and 10.97:1, respectively. In intercoding mode, the proposed method also accomplishes the highest compression ratios as 28.91:1 on average. However, H.264/AVC (CAVLC) and H.264/AVC (CABAC) obtain the compression ratios as 19.82:1 and 23.45:1, respectively.
A lot of 3D contents haven been widely used for multimedia services, however, real 3D video contents have been
adopted for a limited applications such as a specially designed 3D cinema. This is because of the difficulty of capturing
real 3D video contents and the limitation of display devices available in a market. However, diverse types of display
devices for stereoscopic video contents for real 3D video contents have been recently released in a market. Especially, a
mobile phone with a stereoscopic camera has been released in a market, which provides a user as a consumer to have
more realistic experiences without glasses, and also, as a content creator to take stereoscopic images or record the
stereoscopic video contents. However, a user can only store and display these acquired stereoscopic contents with his/her
own devices due to the non-existence of a common file format for these contents. This limitation causes a user not share
his/her contents with any other users, which makes it difficult the relevant market to stereoscopic contents is getting
expanded.
Therefore, this paper proposes the common file format on the basis of ISO base media file format for stereoscopic
contents, which enables users to store and exchange pure stereoscopic contents. This technology is also currently under
development for an international standard of MPEG as being called as a stereoscopic video application format.
Multimedia service can be categorized as conversational, streaming, and download services, which differ in temporal and semantic quality requirements. Performance of SVC (Scalable video coding) is analyzed in the three dimensional (spatio-temporal) frequency domain. Based on the analysis, SVC scheme can be modified to satisfy various requirements as being compliant to the international standards. Decoding without enhancement layer data results in drift phenomenon. Several drift-free techniques are analyzed.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.