A Study on Word Learning and Error Type for Character Correction in Hangul Character Recognition


The Transactions of the Korea Information Processing Society (1994 ~ 2000), Vol. 3, No. 5, pp. 1273-1280, Sep. 1996
10.3745/KIPSTE.1996.3.5.1273,   PDF Download:

Abstract

In order to perform high accuracy recognition of text recognition system, the recognized text must be processed through a post-processing stage using contextual information. We present a system that combines multiple knowledge sources to post-process the output of an optical character recognition(OCR) system. The multiple knowledge sources include characteristics of word, wrongly recognized types of Hangul characters, and Hangul word learning. In this paper, the wrongly recognized characters which are made by OCR systems are collected and analyzed. We input a Korean dictionary with approximately 150,000 words, and Korean language texts of Korean elementary/middle/high school. We found that only 10.7% words in Korean language texts of Korean elementary/middle/high school were used in a Korean dictionary. And we classified error types of Korean character recognition with OCR systems. For Hangul word learning, we utilized indexes of texts. With these multiple knowledges sources, we could predict a proper word in large candidate words.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
K. T. Kyun and L. B. Hee, "A Study on Word Learning and Error Type for Character Correction in Hangul Character Recognition," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 3, no. 5, pp. 1273-1280, 1996. DOI: 10.3745/KIPSTE.1996.3.5.1273.

[ACM Style]
Kim Tae Kyun and Lee Byeong Hee. 1996. A Study on Word Learning and Error Type for Character Correction in Hangul Character Recognition. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 3, 5, (1996), 1273-1280. DOI: 10.3745/KIPSTE.1996.3.5.1273.