Multi-label image classification model based on multi-scale semantic attention and graph attention network

Lu Jiang; JiHua Ye; ShunJie Xiao; Yi Zong; AiWen Jiang

doi:10.1117/12.3005199

20 October 2023 Multi-label image classification model based on multi-scale semantic attention and graph attention network

Lu Jiang, JiHua Ye, ShunJie Xiao, Yi Zong, AiWen Jiang

Author Affiliations +

Proceedings Volume 12916, Third International Conference on Signal Image Processing and Communication (ICSIPC 2023); 129160Q (2023) https://doi.org/10.1117/12.3005199
Event: Third International Conference on Signal Image Processing and Communication (ICSIPC 2023), 2023, Kunming, China

Abstract

Current research on multi-label image classification mainly focuses on exploring the correlation between labels to improve the classification accuracy of multi-label images. However, in the existing methods, the label correlation is calculated based on the statistical information of the data. This label correlation is global and depends on the data set, and is not suitable for all samples. In the process of extracting image features, the The characteristic information of small objects is easily lost, resulting in low classification accuracy of small objects. For this reason, this paper innovatively proposes a multi-label image classification model based on multi-scale semantic attention and graph attention network. vector, followed by feature fusion to enhance the feature information of small objects, and then use the self-attention mechanism in the graph attention module to adaptively mine the correlation between categories in the image, and propose an attention regularization loss. The mAP of the model on the two public datasets of VOC 2007 and MS-COCO 2014 reached 95.5% and 83.4%, respectively, and most of the indicators are better than the existing state-of-the-art methods.

(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Lu Jiang, JiHua Ye, ShunJie Xiao, Yi Zong, and AiWen Jiang "Multi-label image classification model based on multi-scale semantic attention and graph attention network", Proc. SPIE 12916, Third International Conference on Signal Image Processing and Communication (ICSIPC 2023), 129160Q (20 October 2023); https://doi.org/10.1117/12.3005199

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available