Paper
1 August 2023 Assessment of depression level based on multi-scale 3D convolution and expression videos
Qishuang Cao, Junlong Gao, Zeying Lv, Mi Li
Author Affiliations +
Proceedings Volume 12754, Third International Conference on Computer Vision and Pattern Analysis (ICCPA 2023); 127540S (2023) https://doi.org/10.1117/12.2684233
Event: 2023 3rd International Conference on Computer Vision and Pattern Analysis (ICCPA 2023), 2023, Hangzhou, China
Abstract
The automatic diagnosis of depression using deep learning has recently shown significant progress, and 3DCNN has been applied to assessing depression levels from videos. There are two problems to be encountered applying 3DCNN, including insufficient data to train the model and an insufficient diversity of features extracted on a single scale. An approach is presented in this paper to address these issues. Firstly, we augmented the raw data by adding noise and applying rotations to increase the amount of the training set. Secondly, we constructed a 3DCNN to extract more diverse features from multiscales. We evaluated our proposed method on the AVEC 2014. Our approach achieved MAE=7.13 and RMSE=9.16, demonstrating its effectiveness in improving the accuracy of depression assessment compared to existing approaches.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qishuang Cao, Junlong Gao, Zeying Lv, and Mi Li "Assessment of depression level based on multi-scale 3D convolution and expression videos", Proc. SPIE 12754, Third International Conference on Computer Vision and Pattern Analysis (ICCPA 2023), 127540S (1 August 2023); https://doi.org/10.1117/12.2684233
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Video

Education and training

Convolution

Data modeling

Deep learning

Overfitting

Back to Top