An approach to the segmentation of multi-page document flow using binary classification

Onur Agin; Cagdas Ulas; Mehmet Ahat; Can Bekar

doi:10.1117/12.2178778

4 March 2015 An approach to the segmentation of multi-page document flow using binary classification

Onur Agin, Cagdas Ulas, Mehmet Ahat, Can Bekar

Proceedings Volume 9443, Sixth International Conference on Graphic and Image Processing (ICGIP 2014); 944311 (2015) https://doi.org/10.1117/12.2178778
Event: Sixth International Conference on Graphic and Image Processing (ICGIP 2014), 2014, Beijing, China

Abstract

In this paper, we present a method for segmentation of document page flow applied to heterogeneous real bank documents. The approach is based on the content of images and it also incorporates font based features inside the documents. Our method involves a bag of visual words (BoVW) model on the designed image based feature descriptors and a novel approach to combine the consecutive pages of a document into a single feature vector that represents the transition between these pages. The transitions here could be represented by one of the two different classes: continuity of the same document or beginning of a new document. Using the transition feature vectors, we utilize three different binary classifiers to make predictions on the relationship between consecutive pages. Our initial results demonstrate that the proposed method can exhibit promising performance for document flow segmentation at this stage.

Citation Download Citation

Onur Agin, Cagdas Ulas, Mehmet Ahat, and Can Bekar "An approach to the segmentation of multi-page document flow using binary classification", Proc. SPIE 9443, Sixth International Conference on Graphic and Image Processing (ICGIP 2014), 944311 (4 March 2015); https://doi.org/10.1117/12.2178778

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available