T.K. BOAZ1*, C.J. PRABHAKAR2
1Department of Computer Science, Kuvempu University, Shivamogga - 577 451, Karnataka, India.
2Department of Computer Science, Kuvempu University, Shivamogga - 577 451, Karnataka, India.
* Corresponding Author : kipyego3@yahoo.com.au
Received : 24-12-2014 Accepted : 14-05-2015 Published : 04-06-2015
Volume : 6 Issue : 1 Pages : 426 - 434
Int J Mach Intell 6.1 (2015):426-434
Keywords : Text-Lines detection, Text-line segmentation, oriented gradients
Academic Editor : Dr Chitra Desai
Conflict of Interest : None declared
In this paper, we present a novel approach to segment and align multi-oriented and curved text-lines from document images. We assumed that the input document image contains text-lines with arbitrary orientation and identified the arbitrary text string based on projection profile. We employed anisotropic Gaussian filter bank on the identified arbitrary text region in order to smooth the text region, which helps to detect the ridges which is a representative of a text-line path. The ridges are then labeled and a cubic B-spline is fitted to the text-line path points. The orientation and curvature features of the text-line path is estimated using orientated gradients for each point and corresponding curvature to these text-line path are computed. Text is aligned along the horizontally transformed line by rotating individual characters based on the computed curvature information. Finally, the aligned text-lines are extracted, which can be fed into OCR for recognition. The evaluation metrics was evaluated at text-line segmentation level and the results posted show a significant improvement. The resulting system is proven to provide better results than most state of the art algorithms.