Paper
27 March 2024 UAV aerial image segmentation method based on improved U-Net
Bin Gao
Author Affiliations +
Proceedings Volume 13105, International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2023); 1310503 (2024) https://doi.org/10.1117/12.3026722
Event: 3rd International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2023), 2023, Qingdao, China
Abstract
With the rapid development of UAV technology, the demand for image recognition segmentation acquired from the air is gradually growing. In this paper, we propose an improved U-Net architecture designed to enhance the effectiveness of UAV image segmentation tasks. Firstly, the traditional convolution module is replaced by a depth-separable convolution to reduce the number of parameters and computational cost and to improve the lightweight performance of the model; secondly, an Axial Attention mechanism is introduced to enhance the model's ability to capture long-range dependencies and spatial information; and finally, the Focal Loss is employed to solve the category imbalance problem. We conducted extensive experimental evaluations on the Aeroscapes dataset containing images captured by UAVs from different altitudes, and the experimental results show that the improved U-Net model exhibits significant improvements in the UAV image segmentation task compared to the traditional U-Net model. Our model improves 6.7% in miou and 2.7% in precision by 2.9% and 3.3% in F1 score. Thus, the method proposed in this paper has potential applications for improving the automation and accuracy of aerial image analysis.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Bin Gao "UAV aerial image segmentation method based on improved U-Net", Proc. SPIE 13105, International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2023), 1310503 (27 March 2024); https://doi.org/10.1117/12.3026722
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Unmanned aerial vehicles

Performance modeling

Data modeling

Education and training

Statistical modeling

Convolutional neural networks

Back to Top