Background-lead self-attention for image harmonization

Min Lu; Lingtao Zhang; Yongmin Liu

doi:10.1117/1.JEI.31.6.063038

30 November 2022 Background-lead self-attention for image harmonization

Min Lu, Lingtao Zhang, Yongmin Liu

Author Affiliations +

Journal of Electronic Imaging, Vol. 31, Issue 6, 063038 (November 2022). https://doi.org/10.1117/1.JEI.31.6.063038

Abstract

Image composition constructs a new image by cutting out a part of one image and then pasting it on another image. However, the quality of the new image is generally low due to inconsistency between the two parts. To overcome this, image harmonization aims to obtain realistic composite images by adjusting the color, illumination, and visual style of the foreground, to make it compatible with the background. Nevertheless, previous image harmonization techniques mostly concentrate on learning a mapping network from composite image to real image, and they ignore the significant role of background visual styles as a mapping guide. In this task, we consider image harmonization as a style transfer problem. Specifically, we take foreground as the content image and background as the style image, to transform the foreground’s style through background’s style features. To do so, we propose a unique self-attention-based module to learn the mapping between foreground features and background features using a modified self-attention mechanism. The proposed module can calculate the degree of correlation between the foreground and background according to the semantic distribution and also aligns the channel-wise mean and variance of the foreground features with that of the background features in the meantime. To comprehensively investigate the effectiveness of proposed module, we perform multiple experiments and ablation studies on the existing benchmark dataset iHarmony4. The experimental results prove that our module is more effective compared to the baseline in our task, and our harmonized image looks much closer to the real image.

Citation Download Citation

Min Lu, Lingtao Zhang, and Yongmin Liu "Background-lead self-attention for image harmonization," Journal of Electronic Imaging 31(6), 063038 (30 November 2022). https://doi.org/10.1117/1.JEI.31.6.063038

Received: 14 July 2022; Accepted: 15 November 2022; Published: 30 November 2022

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
12 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Composites

Image quality

Visualization

Image processing

Education and training

Semantics

Convolution

Show All Keywords

Keywords/Phrases

Search In:

Publication Years