Retrieving Information from Korean OCR Text Database


The Transactions of the Korea Information Processing Society (1994 ~ 2000), Vol. 6, No. 4, pp. 833-840, Apr. 1999
10.3745/KIPSTE.1999.6.4.833,   PDF Download:

Abstract

The texts constructed with Optical Character Recognition (OCR) contain more errors than those constructed with keyboard typing. Therefore, in order to retrieve useful information from OCR texts, we need to develop an effective automatic indexing method. In this paper, we investigate automatic indexing methods that can retrieve information effectively from Korean OCR text database with the character-level recognition ratio of 90%. Experimental result shows that 2-gram indexing provides similar retrieval effectiveness to morpheme-based indexing for the Korean OCR text database.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
L. J. Ho, L. C. Sik, H. S. Hwa, K. J. Hyung, "Retrieving Information from Korean OCR Text Database," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 6, no. 4, pp. 833-840, 1999. DOI: 10.3745/KIPSTE.1999.6.4.833.

[ACM Style]
Lee Joon Ho, Lee Chung Sik, Hahn Sun Hwa, and Kim Jin Hyung. 1999. Retrieving Information from Korean OCR Text Database. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 6, 4, (1999), 833-840. DOI: 10.3745/KIPSTE.1999.6.4.833.