A Complete Bengali OCR: A Novel Hybrid Approach to Handwritten Bengali Character Recognition

A. F. R. Rahman, M. Kaykobad

Abstract


A complete Bengali OCR is presented incorporating a novel hybrid approach to handwritten Bengali character recognition. The idea is to combine structural analysis and template matching techniques in order to recognise the handwritten Bengali characters. Handwritten Bengali characters are inherently cursive and there is an absence of well-defined strokes. In this approach, the character set has been separated into different distinct sub-classes based one some distinguishable structural features. Details of several approaches to detect these structural features are presented. Structural and syntactic features have been generated from the training samples to generate distinct character signatures. A match dictionary has been devised based on these character signatures, that helps in collecting multiple prototypes from the training samples. A revised form of continuity analysis is applied to match the test characters to the characters in the match dictionary. This complete OCR has been thoroughly implemented and tested and very promising results have been achieved in this direction.

Keywords


Handwritten Bengali character recognition, hierarchical structure, syntactic recognition

Full Text:

PDF


Creative Commons License
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

Crossref Similarity Check logo

Crossref logologo_doaj