Poster + Presentation + Paper
4 January 2023 Deep learning in tasks of interior objects recognition and 3D reconstruction
Author Affiliations +
Conference Poster
Abstract
3D data visualization is a non-trivial effort, however high-quality data processing and visualization is crucial in all spheres of computer vision tasks, especially if our tasks include work with real environment and require precise results. Many industries can benefit from automated object detection and its analysis. Effective environment information retrieving and its digitization open up great prospects in robotics and in the design of such systems that require scene reconstruction into point clouds. This solution offers new possibilities for mixed reality systems also. For example, with restored scene data we can add a virtual light source and illuminate the room, or it becomes possible to cast reflections of virtual objects in mirrors. A breakthrough in neural networks training on point clouds occurred recently after the "PointNet" architecture implementation, and the trend in working with 3D data continues to grow. Current research is aimed at implementing the interior objects recognition and 3D reconstruction approach that works with interior scenes and low-quality incomplete information from lidars. This method enables the selection of interior objects from the scene as well as the determination of their location and dimensions. PointNet neural network architecture trained on the ScanNet dataset was used to annotate and segment the point cloud. To create a triangle grid, the neural network "Total3D understanding" was employed. As a result, was built an interior environment reconstruction method using RGB images and point clouds as input data. A simple interior of a room reconstruction example is provided, along with the result quality assessment.
Conference Presentation
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Maksim Sorokin, Dmitry Zhdanov, Andrey Zhdanov, Igor Potemin, and Yan Wang "Deep learning in tasks of interior objects recognition and 3D reconstruction", Proc. SPIE 12317, Optoelectronic Imaging and Multimedia Technology IX, 1231718 (4 January 2023); https://doi.org/10.1117/12.2643991
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
3D modeling

Mixed reality

Cameras

Neural networks

Object recognition

Visualization

Back to Top