14 March 2024 Few-shot object detection based on self-supervised feature pyramid network
Wen Lv, Xinwei Qi, Hongbo Shi, Shuai Tan, Bing Song, Yang Tao
Author Affiliations +
Abstract

In few-shot object detection, the limited amount of labeled data fails to adequately represent all possible scenarios and objects. This limitation leads to the model’s inability to fully learn the features and attributes of novel classes. In some cases, novel classes may be confused with base classes due to feature similarity, resulting in inaccurate detection results. During the two-stage fine-tuning phase, when there is a significant difference between the novel class and the base class data, the candidate boxes generated by the backbone network trained on the base class may not be suitable for the novel class targets. Therefore, it is a forward-looking research problem to explore how to mine novel class knowledge during the feature extraction process to supplement the disadvantage of having limited feature samples and improve the sensitivity in recognizing novel classes. To enrich the feature representation of novel classes, we propose a self-supervised feature pyramid network. This approach explores novel class attributes in the lower-level network, thereby encouraging the feature extractor to generate candidate boxes that are consistent with the novel class targets. The goal is to enhance the sensitivity of the backbone network in recognizing novel classes. We validate the effectiveness of our proposed framework by comparing it with state-of-the-art methods on two popular datasets and achieve an improvement of up to +5.2% on the standard PASCAL VOC benchmark and a 1.4% boost on the challenging COCO benchmark.

© 2023 SPIE and IS&T
Wen Lv, Xinwei Qi, Hongbo Shi, Shuai Tan, Bing Song, and Yang Tao "Few-shot object detection based on self-supervised feature pyramid network," Journal of Electronic Imaging 33(2), 023021 (14 March 2024). https://doi.org/10.1117/1.JEI.33.2.023021
Received: 29 August 2023; Accepted: 13 November 2023; Published: 14 March 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Education and training

Feature extraction

Data modeling

Statistical modeling

Visualization

Sensors

RELATED CONTENT


Back to Top