Paper
21 August 2023 Learning rate range test for the vision transformer
Rinka Kiriyama, Akio Sashima, Ikuko Shimizu
Author Affiliations +
Abstract
The solutions obtained by training the deep neural network are highly dependent on the parameters including the learning rate. Therefore, finding the appropriate settings for training deep neural networks is very important. In particular, it is necessary to find the better settings for SOTA models of Vision Transformer(ViT), whose structure is different from ordinal models. In this paper, we focus on the learning rate to find a better value using the Learning Rate Range Test (LRRT). Through our experiments, we found that the appropriate LR is located where the decrease in loss value stops in the LRRT. In addition, we discuss about the effects of the number of epochs and the LR warm up.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Rinka Kiriyama, Akio Sashima, and Ikuko Shimizu "Learning rate range test for the vision transformer", Proc. SPIE 12783, International Conference on Images, Signals, and Computing (ICISC 2023), 1278303 (21 August 2023); https://doi.org/10.1117/12.2692013
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Lawrencium

Education and training

Transformers

Visual process modeling

Computer vision technology

Neural networks

Deep learning

Back to Top