Query Expansion Based on Word Graphs Using Pseudo Non-Relevant Documents and Term Proximity


The KIPS Transactions:PartB , Vol. 19, No. 3, pp. 189-194, Jun. 2012
10.3745/KIPSTB.2012.19.3.189,   PDF Download:

Abstract

In this paper, we propose a query expansion method based on word graphs using pseudo-relevant and pseudo non-relevant documents to achieve performance improvement in information retrieval. The initially retrieved documents are classified into a core cluster when a document includes core query terms extracted by query term combinations and the degree of query term proximity. Otherwise, documents are classified into a non-core cluster. The documents that belong to a core query cluster can be seen as pseudo-relevant documents, and the documents that belong to a non-core cluster can be seen as pseudo non-relevant documents. Each cluster is represented as a graph which has nodes and edges. Each node represents a term and each edge represents proximity between the term and a query term. The term weight is calculated by subtracting the term weight in the non-core cluster graph from the term weight in the core cluster graph. It means that a term with a high weight in a non-core cluster graph should not be considered as an expanded term. Expansion terms are selected according to the term weights. Experimental results on TREC WT10g test collection show that the proposed method achieves 9.4% improvement over the language model in mean average precision.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
S. H. Jo and K. S. Lee, "Query Expansion Based on Word Graphs Using Pseudo Non-Relevant Documents and Term Proximity," The KIPS Transactions:PartB , vol. 19, no. 3, pp. 189-194, 2012. DOI: 10.3745/KIPSTB.2012.19.3.189.

[ACM Style]
Seung Hyeon Jo and Kyung Soon Lee. 2012. Query Expansion Based on Word Graphs Using Pseudo Non-Relevant Documents and Term Proximity. The KIPS Transactions:PartB , 19, 3, (2012), 189-194. DOI: 10.3745/KIPSTB.2012.19.3.189.