Poster + Paper
20 June 2024 Real-time deep learning-based object recognition in augmented reality
Author Affiliations +
Conference Poster
Abstract
Augmented reality is a visualization technology that displays information by adding virtual images to the real world. Effective implementation of augmented reality requires recognition of the current scene. Identifying objects in real-time video on computationally limited hardware requires significant effort. One way to solve this problem is to create a hybrid system that, based on machine learning and computer vision technology, processes and analyzes visual data to identify and classify real-world objects. The proposed architecture is based on a combination of the Vuforia augmented system, which provides good performance by balancing prediction accuracy and efficiency. First, the Vuforia neural network architecture allows convenient interaction with AR in Unity and provides initial conditions for detecting 3D objects. The augmented reality construction algorithm is based on the ARCore framework and the OpenGL interface for embedded systems. The system integrates recognition data with an AR platform to display corresponding 3D models, allowing users to interact with them through the functionality of the AR application. This method also involves the development of an enhanced user interface for AR, making the augmented environment more accessible for navigation and control. Experimental research has shown that the proposed method significantly improves the accuracy of object recognition and the ease of working with 3D models in AR.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
V. Egipko, M. Zhdanova, N. Gapon, V. Voronin, and E. Semenishchev "Real-time deep learning-based object recognition in augmented reality", Proc. SPIE 13000, Real-time Processing of Image, Depth, and Video Information 2024, 130000P (20 June 2024); https://doi.org/10.1117/12.3024957
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Augmented reality

Image filtering

3D modeling

Object recognition

Tunable filters

Deep learning

Image processing

Back to Top