SEGMENTATION AND ALIGNMENT OF MULTI-ORIENTED AND CURVED TEXT LINES FROM DOCUMENT IMAGES

T.K. BOAZ1*, C.J. PRABHAKAR2
1Department of Computer Science, Kuvempu University, Shivamogga - 577 451, Karnataka, India.
2Department of Computer Science, Kuvempu University, Shivamogga - 577 451, Karnataka, India.
* Corresponding Author : kipyego3@yahoo.com.au

Received : 24-12-2014     Accepted : 14-05-2015     Published : 04-06-2015
Volume : 6     Issue : 1       Pages : 426 - 434
Int J Mach Intell 6.1 (2015):426-434

Keywords : Text-Lines detection, Text-line segmentation, oriented gradients
Academic Editor : Dr Chitra Desai
Conflict of Interest : None declared

Cite - MLA : BOAZ, T.K. and PRABHAKAR, C.J. "SEGMENTATION AND ALIGNMENT OF MULTI-ORIENTED AND CURVED TEXT LINES FROM DOCUMENT IMAGES." International Journal of Machine Intelligence 6.1 (2015):426-434.

Cite - APA : BOAZ, T.K., PRABHAKAR, C.J. (2015). SEGMENTATION AND ALIGNMENT OF MULTI-ORIENTED AND CURVED TEXT LINES FROM DOCUMENT IMAGES. International Journal of Machine Intelligence, 6 (1), 426-434.

Cite - Chicago : BOAZ, T.K. and C.J., PRABHAKAR. "SEGMENTATION AND ALIGNMENT OF MULTI-ORIENTED AND CURVED TEXT LINES FROM DOCUMENT IMAGES." International Journal of Machine Intelligence 6, no. 1 (2015):426-434.

Copyright : © 2015, T.K. BOAZ and C.J. PRABHAKAR, Published by Bioinfo Publications. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.

Abstract

In this paper, we present a novel approach to segment and align multi-oriented and curved text-lines from document images. We assumed that the input document image contains text-lines with arbitrary orientation and identified the arbitrary text string based on projection profile. We employed anisotropic Gaussian filter bank on the identified arbitrary text region in order to smooth the text region, which helps to detect the ridges which is a representative of a text-line path. The ridges are then labeled and a cubic B-spline is fitted to the text-line path points. The orientation and curvature features of the text-line path is estimated using orientated gradients for each point and corresponding curvature to these text-line path are computed. Text is aligned along the horizontally transformed line by rotating individual characters based on the computed curvature information. Finally, the aligned text-lines are extracted, which can be fed into OCR for recognition. The evaluation metrics was evaluated at text-line segmentation level and the results posted show a significant improvement. The resulting system is proven to provide better results than most state of the art algorithms.