26 September 2017 Deep linear autoencoder and patch clustering-based unified one-dimensional coding of image and video
Author Affiliations +
Abstract
This paper proposes a unified one-dimensional (1-D) coding framework of image and video, which depends on deep learning neural network and image patch clustering. First, an improved K-means clustering algorithm for image patches is employed to obtain the compact inputs of deep artificial neural network. Second, for the purpose of best reconstructing original image patches, deep linear autoencoder (DLA), a linear version of the classical deep nonlinear autoencoder, is introduced to achieve the 1-D representation of image blocks. Under the circumstances of 1-D representation, DLA is capable of attaining zero reconstruction error, which is impossible for the classical nonlinear dimensionality reduction methods. Third, a unified 1-D coding infrastructure for image, intraframe, interframe, multiview video, three-dimensional (3-D) video, and multiview 3-D video is built by incorporating different categories of videos into the inputs of patch clustering algorithm. Finally, it is shown in the results of simulation experiments that the proposed methods can simultaneously gain higher compression ratio and peak signal-to-noise ratio than those of the state-of-the-art methods in the situation of low bitrate transmission.
© 2017 SPIE and IS&T 1017-9909/2017/$25.00 © 2017 SPIE and IS&T
Honggui Li "Deep linear autoencoder and patch clustering-based unified one-dimensional coding of image and video," Journal of Electronic Imaging 26(5), 053016 (26 September 2017). https://doi.org/10.1117/1.JEI.26.5.053016
Received: 23 May 2017; Accepted: 6 September 2017; Published: 26 September 2017
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image compression

Video coding

Simulation of CCA and DLA aggregates

Video

Reconstruction algorithms

3D image processing

Chromium

RELATED CONTENT

Joint bit allocation for 3D video coding based on virtual...
Proceedings of SPIE (October 29 2014)
Enhancements to MPEG4 MVC for depth compression
Proceedings of SPIE (August 04 2010)
Compression of full-parallax integral 3D-TV image data
Proceedings of SPIE (May 15 1997)
Predictive Coding of Depth Images Across Multiple Views
Proceedings of SPIE (March 05 2007)
High-compression video coding: a novel approach
Proceedings of SPIE (May 01 1994)

Back to Top