Paper
4 March 2015 An approach to the segmentation of multi-page document flow using binary classification
Onur Agin, Cagdas Ulas, Mehmet Ahat, Can Bekar
Author Affiliations +
Proceedings Volume 9443, Sixth International Conference on Graphic and Image Processing (ICGIP 2014); 944311 (2015) https://doi.org/10.1117/12.2178778
Event: Sixth International Conference on Graphic and Image Processing (ICGIP 2014), 2014, Beijing, China
Abstract
In this paper, we present a method for segmentation of document page flow applied to heterogeneous real bank documents. The approach is based on the content of images and it also incorporates font based features inside the documents. Our method involves a bag of visual words (BoVW) model on the designed image based feature descriptors and a novel approach to combine the consecutive pages of a document into a single feature vector that represents the transition between these pages. The transitions here could be represented by one of the two different classes: continuity of the same document or beginning of a new document. Using the transition feature vectors, we utilize three different binary classifiers to make predictions on the relationship between consecutive pages. Our initial results demonstrate that the proposed method can exhibit promising performance for document flow segmentation at this stage.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Onur Agin, Cagdas Ulas, Mehmet Ahat, and Can Bekar "An approach to the segmentation of multi-page document flow using binary classification", Proc. SPIE 9443, Sixth International Conference on Graphic and Image Processing (ICGIP 2014), 944311 (4 March 2015); https://doi.org/10.1117/12.2178778
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Binary data

Visual process modeling

Image segmentation

Feature extraction

Performance modeling

Optical character recognition

Back to Top