A Two-stage Approach for Word Searching in Handwritten Document Images

Ankur   Goyal; Pronita   Mukherjee; Dipra  Mitra; Shiv   Kant; Khalid  Almalki; Suliman   Mohamed Fati

doi:10.56294/dm202554

Authors

Ankur Goyal Department of CSE, Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, Maharashtra, India Author
Pronita Mukherjee Department of CSE, Gargi Memorial Institute of Technology, Kolkata West Bengal, India Author
Dipra Mitra Department of CSE, Amity University Jharkhand, Ranchi, Jharkhand India Author
Shiv Kant Department of Computer Science & Engineering (AI & DS), Greater Noida Institute of Technology (GNIOT), Greater Noida, Delhi/NCR, India Author
Khalid Almalki Assistant Professor, Department of Computer Science, College of Computing and Informatics, Saudi Electronic University, Riyadh Author
Suliman Mohamed Fati Associate Professor and Chair of Information Systems Department, College of Computer and Information Sciences, Prince Sultan University, Riyadh-11586, Saudi Arabia Author

DOI:

https://doi.org/10.56294/dm202554

Keywords:

Feature ex-traction, Antlion Algorithm for feature section, comparative study with existing algorithm

Abstract

Introduction; Despite the rise of electronic papers, handwritten paper documents remain important. Current technologies make document digitization, storage, compression, and transmission easy and affordable. But semi-automatic document image processing needs specific technology to extract document information accurately. Typed textual searches are used to get information from Digital Libraries.
Objective; Generally, in a document, there exists a varying number of characters in different words. That is why searching a word in a whole document is incorporate mismatched word images in the fetched word image and also increases the time consumption to complete the task.
Method; Keeping this idea in mind, the words having different number of characters with respect to the search word are discarded at the beginning as preprocessing.
Result; To confirm the outstanding words in the document page as probable search word, a voting-based approach has been used for doing this, a modified HOG feature descriptor is extracted from each word image, then 5 distance-matching metrics are calculated, fed to a voting schema with the help of threshold value of each metrics, calculated beforehand.
Conclusion; Here 3 types of voting is performed, first 2, with the varying no of metrics vote for positivity of the search word and in the last one three distance metrics are used among which if more than one votes for the positivity the model will indicate the word as a search word.

References

1. Bhowmik S, Malakar S, Sarkar R, Basu S, Kundu M, Nasipuri M. Off-line Bangla handwritten word recognition: a holistic approach. Neural Comput Appl. 2019;31:5783–98. DOI: https://doi.org/10.1007/s00521-018-3389-1

2. Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK. A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recognit. 2009;42(7):1467–84. DOI: https://doi.org/10.1016/j.patcog.2009.01.008

3. Rath TM, Manmatha R. Word spotting for historical documents. Int J Doc Anal Recognit. 2007;9:139–52. DOI: https://doi.org/10.1007/s10032-006-0027-8

4. Begum N, Goyal A. Analysis of legal case document automated summarizer. In: 2021 6th International Conference on Signal Processing, Computing and Control (ISPCC). IEEE; 2021. p. 533–8. DOI: https://doi.org/10.1109/ISPCC53510.2021.9609442

5. Sharma S, Choudhary S, Sharma VK, Goyal A, Balihar MM. Image watermarking in frequency domain using Hu’s invariant moments and firefly algorithm. no April. 2022;1–15. DOI: https://doi.org/10.5815/ijigsp.2022.02.01

6. Rath TM, Manmatha R. Word image matching using dynamic time warping. In: 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003 Proceedings. IEEE; 2003. p. II–II.

7. Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05). Ieee; 2005. p. 886–93. DOI: https://doi.org/10.1109/CVPR.2005.177

8. Zagoris K, Ergina K, Papamarkos N. A document image retrieval system. Eng Appl Artif Intell. 2010;23(6):872–9. DOI: https://doi.org/10.1016/j.engappai.2010.03.002

9. Retsinas G, Louloudis G, Stamatopoulos N, Gatos B. Efficient learning-free keyword spotting. IEEE Trans Pattern Anal Mach Intell. 2018;41(7):1587–600. DOI: https://doi.org/10.1109/TPAMI.2018.2845880

10. Pantke W, Dennhardt M, Fecker D, Märgner V, Fingscheidt T. An historical handwritten arabic dataset for segmentation-free word spotting-hadara80p. In: 2014 14th International Conference on Frontiers in Handwriting Recognition. IEEE; 2014. p. 15–20. DOI: https://doi.org/10.1109/ICFHR.2014.11

11. Rusiñol M, Aldavert D, Toledo R, Lladós J. Efficient segmentation-free keyword spotting in historical document collections. Pattern Recognit. 2015;48(2):545–55. DOI: https://doi.org/10.1016/j.patcog.2014.08.021

12. Malakar S, Ghosh M, Sarkar R, Nasipuri M. Development of a two-stage segmentation-based word searching method for handwritten document images. J Intell Syst. 2019;29(1):719–35. DOI: https://doi.org/10.1515/jisys-2017-0384

A Two-stage Approach for Word Searching in Handwritten Document Images

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

compendex