A Model of Natural Language Information Retrieval Using Main Keywords and Sub - Keywords


The Transactions of the Korea Information Processing Society (1994 ~ 2000), Vol. 4, No. 12, pp. 3052-3062, Dec. 1997
10.3745/KIPSTE.1997.4.12.3052,   PDF Download:

Abstract

An Information Retrieval(IR) is to retrieve relevant information that satisfies user's information needs. However a major role of IR systems is not just the generation of sets of relevant documents, but to help determine which documents are most likely to be relevant to the given requirements. Various attempts have been made in the recent past to use syntactic analysis methods for the generation of complex construction that are essential for content identification in various automatic text analysis systems. Unfortunately, it is known that methods based on syntactic understanding alone are not sufficiently powerful to produce complete analyses of arbitrary text samples. In this paper, we present a document ranking method based on two-level ranking. The first level is used to retrieve the documents, and the second level to reorder the retrieved documents. The main keywords used in the first level can be defined as nouns and/or compound nouns that possess good document discrimination powers. The sub-keywords used in the second level can be also defined as adjectives, adverbs, and/or verbs that are not main keywords, and function words. An empirical study was conducted from a Korean encyclopedia with 23,113 entries and 161 Korean natural language queries collected by end users. 85% of the natural language queries contained sub-keywords. The two-level document ranking methods provides significant improvement in retrieval effectiveness over traditional ranking methods.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
K. H. Kyu and P. S. Young, "A Model of Natural Language Information Retrieval Using Main Keywords and Sub - Keywords," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 4, no. 12, pp. 3052-3062, 1997. DOI: 10.3745/KIPSTE.1997.4.12.3052.

[ACM Style]
Kang Hyun Kyu and Park Se Young. 1997. A Model of Natural Language Information Retrieval Using Main Keywords and Sub - Keywords. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 4, 12, (1997), 3052-3062. DOI: 10.3745/KIPSTE.1997.4.12.3052.