Paper
21 June 2024 Improved deep learning image compression model: performance optimization based on convolutional modules and local attention mechanism
Ruihua Liu, Lihang Xu, Siyu Duan, He Yan
Author Affiliations +
Proceedings Volume 13167, International Conference on Remote Sensing, Mapping, and Image Processing (RSMIP 2024); 131672N (2024) https://doi.org/10.1117/12.3029659
Event: International Conference on Remote Sensing, Mapping and Image Processing (RSMIP 2024), 2024, Xiamen, China
Abstract
Deep learning image compression, using neural networks, improves compression over traditional methods like JPEG. These methods enhance visual quality at lower bit rates by learning better image representations. However, they struggle with capturing broad context compared to local features. To address this, we propose enhancements: a new convolutional module with stacked layers and advanced operations, and a spatial attention block ("Shuffle attention") for better feature extraction. These boost performance. Our method is faster and requires fewer parameters than state-of-the-art techniques on Kodak and CLIC datasets. Despite slightly lower rate-distortion performance, our Composite Conv module and spatial attention block effectively extract global features and reduce encoding time. In conclusion, our work advances deep learning image compression by mitigating convolutional network limitations, enhancing compression efficiency while preserving quality.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Ruihua Liu, Lihang Xu, Siyu Duan, and He Yan "Improved deep learning image compression model: performance optimization based on convolutional modules and local attention mechanism", Proc. SPIE 13167, International Conference on Remote Sensing, Mapping, and Image Processing (RSMIP 2024), 131672N (21 June 2024); https://doi.org/10.1117/12.3029659
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image compression

Image quality

Convolution

Deep learning

Performance modeling

Image enhancement

Image restoration

Back to Top