Segmentation Based Word Spotting Method for Handwritten Documents

Thontadari C, Prabhakar C. J.

Abstract


In this paper, we present a segmentation based word spotting method for handwritten document images using Co-occurrence Histograms of Oriented Gradients (Co-HOG) descriptor. The drawback of Histogram of Oriented Gradients (HOG) is that HOG ignores spatial information of adjacent pixels where as the Co-HOG take into account spatial contextual information by capturing the co-occurrence of orientation pairs of neighbouring pixels. In order to construct Co-HOG descriptor for word spotting, we divide a word image into blocks and Co-HOG features are extracted from each block and finally concatenate them to form a feature descriptor. The proposed method is evaluated using precision and recall metrics through experimentation conducted on popular GW dataset and confirmed that our method outperforms for this dataset. 

Full Text:

PDF

References


Dalal, N., & Triggs, B. (2005, June). Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, IEEE Computer Society Conference on Vol. 1, 886-893.

Rodrıguez, J. A., & Perronnin, F. (2008). Local gradient histogram features for word spotting in unconstrained handwritten documents. In International Conference on Frontiers in Handwriting Recognition, Canada, 7-12.

Terasawa, K., & Tanaka, Y. (2009). Slit style HOG feature for document image word spotting. In Document Analysis and Recognition, 2009. ICDAR'09. 10th International Conference, 116-120.

Corvee, E., & Bremond, F. (2010). Body parts detection for people tracking using trees of histogram of oriented gradient descriptors. In Advanced Video and Signal Based Surveillance, Seventh IEEE International Conference, 469-475.

Deniz, O., Bueno, G., Salido, J., & De la Torre, F. (2011). Face recognition using histograms of oriented gradients. PR Letters,32(12),1598-1603.

Shu, C., Ding, X., & Fang, C. (2011). Histogram of the oriented gradient for face recognition. Tsinghua Science & Technology, 16(2), 216-224.

Newell, A. J., & Griffin, L. D. (2011). Multiscale histogram of oriented gradient descriptors for robust character recognition. In DAR, 1085-1089.

Minetto, R., Thome, N., Cord, M., Leite, N. J., & Stolfi, J. (2013). T-HOG: An effective gradient-based descriptor for single line text regions. Pattern recognition, 46(3), 1078-1090.

Arrospide, J., Salgado, L., & Marinas, J. (2013). HOG-like gradient-based descriptor for visual vehicle detection. In Intelligent Vehicles Symposium (IV), IEEE, 223-228.

Saidani, A., Kacem Echi, A., & Belaid, A. (2015).Arabic/Latin and Machine-printed/Handwritten Word Discrimination using HOG-based Shape Descriptor. ELCVIA: 0001-23.

Almazan, J., Gordo, A., Fornés, A., & Valveny, E. (2014). Segmentation-free word spotting with exemplar svms. PR, 47(12), 3967-3978.

Watanabe, T., Ito, S., & Yokoi, K. (2009). Co-occurrence histograms of oriented gradients for pedestrian detection. In Advances in Image and Video Technology, Springer Heidelberg, 37-47.

Ren, H., Heng, C. K., Zheng, W., Liang, L., & Chen, X. (2010). Fast object detection using boosted co-occurrence histograms of oriented gradients. In Image Processing, 17th IEEE International Conference, 2705-2708.

Almazan, J., Gordo, A., Fornés, A., & Valveny, E. (2013). Handwritten word spotting with corrected attributes. In Proceedings of the IEEE International Conference on Computer Vision, 1017-1024.

Tian, S., Lu, S., Su, B., & Tan, C. L. (2013). Scene text recognition using co-occurrence of histogram of oriented gradients. In DAR, 12th International Conference on 912-916.

Su, B., Lu, S., Tian, S., Lim, J. H., & Tan, C. L. (2014). Character recognition in natural scenes using convolutional co-occurrence hog. In 22 International Conference on PR, 2926-2931.

Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., & Tan, C. L. (2015). Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recognition, 51, 125-134.

Manmatha, R., Han, C., & Riseman, E. M. (1996). Word spotting: A new approach to indexing handwriting. In Computer Vision and Pattern Recognition,Proceedings, IEEE Computer Society Conference, 631-637.

Manmatha, R., & Rath, T. (2003). Indexing of handwritten historicaldocuments-recent progress. In Proceedings Symposium on Document Image Understanding Technology, 77-86.

Fischer, A., Keller, A., Frinken, V., & Bunke, H. (2010). HMM-based word spotting in handwritten documents using subword models. In PR, International conference,3416-3419.

Wollmer, M., Eyben, F., Keshet, J., Graves, A., Schuller, B., & Rigoll, G. (2009). Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks. In Acoustics, Speech and Signal Processing, 3949-3952.

Frinken, V., Fischer, A., Manmatha, R., & Bunke, H. (2011). A novel word spotting method based on recurrent neural networks. PAMI,34(2), 211-224.

Huang, L., Yin, F., Chen, Q. H., & Liu, C. L. (2013). Keyword spotting in unconstrained handwritten Chinese documents using contextual word model. Image and Vision Computing, 31(12), 958-968.

Wshah, S., Kumar, G., & Govindaraju, V. (2014). Statistical script independent word spotting in offline handwritten documents. Pattern Recognition, 47(3), 1039-1050.

Rusinol, M., Aldavert, D., Toledo, R., & Llados, J. (2015). Efficient segmentation-free keyword spotting in historical document collections. Pattern Recognition, 48(2), 545-555.

Rath, T. M., & Manmatha, R. (2007). Word spotting for historical documents. IJDAR, 9, (2-4),139-152.




DOI: https://doi.org/10.23956/ijarcsse/V7I6/0127

Refbacks

  • There are currently no refbacks.




© International Journals of Advanced Research in Computer Science and Software Engineering (IJARCSSE)| All Rights Reserved | Powered by Advance Academic Publisher.