Paper
14 February 2020 Large-scale book page retrieval by deep hashing networks
Author Affiliations +
Proceedings Volume 11430, MIPPR 2019: Pattern Recognition and Computer Vision; 1143023 (2020) https://doi.org/10.1117/12.2541935
Event: Eleventh International Symposium on Multispectral Image Processing and Pattern Recognition (MIPPR2019), 2019, Wuhan, China
Abstract
Nowadays, more and more printed books are accompanied by electronic resources including videos, audios, games, augmented reality and other mobile apps. However, it is not very convenient to access most of these electronic resources, as the association between printed books and electronic resources is not automatically available [2]. To build a bridge between a book page and the corresponding electronic resources, a large-scale book page retrieval method using deep hashing network is presented in this paper. There are mainly three contributions: First, a pipeline is proposed to make a Convolutional Neural Network (CNN) trained for another unrelated task available for book page retrieval. Second, the high-dimensional features extracted from the CNN is mapped to the low-dimensional binary hash code sequence in Hemming space by the deep hashing network, which not only increases the speed of retrieval but also saves the space of feature storage. Third, a large-scale dataset which is consist of 1.55M book page images is collected. Experimental results on the 1.55M book page dataset show that the proposed deep hashing network achieves a Top-1 hit rate of 92.1% and the response time is less than 0.6 second on a desktop computer with a GeForce 1080Ti GPU.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jiao Huo, Yi Zhao, and Leyuan Liu "Large-scale book page retrieval by deep hashing networks", Proc. SPIE 11430, MIPPR 2019: Pattern Recognition and Computer Vision, 1143023 (14 February 2020); https://doi.org/10.1117/12.2541935
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Convolutional neural networks

Binary data

Video

Video acceleration

Databases

Image retrieval

Back to Top