Finding Frequent Itemsets based on Open Data Mining in Data Streams


The KIPS Transactions:PartD, Vol. 10, No. 3, pp. 447-458, Jun. 2003
10.3745/KIPSTD.2003.10.3.447,   PDF Download:

Abstract

The basic assumption of conventional data mining methodology is that the data set of a knowledge discovery process should be fixed and available before the process can proceed. Consequently, this assumption is valid only when the static knowledge embedded in a specific data set is the target of data mining. In addition, a conventional data mining method requires considerable computing time to produce the result of mining from a large data set. Due to these reasons, it is almost impossible to apply the mining method to a real-time analysis task in a data stream where a new transaction is continuously generated and the up-to-dated result of data mining including the newly generated transaction is needed as quickly as possible. In this paper, a new mining concept, open data mining in a data stream, is proposed for this purpose. In open data mining, whenever each transaction is newly generated, the updated mining result of whole transactions including the newly generated transactions is obtained instantly. In order to implement this mechanism efficiently, it is necessary to incorporate the delayed-insertion of newly identified information in recent transactions as well as the pruning of insignificant information in the mining result of past transactions. The proposed algorithm is analyzed through a series of experiments in order to identify the various characteristics of the proposed algorithm.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
J. H. Chang and W. S. Lee, "Finding Frequent Itemsets based on Open Data Mining in Data Streams," The KIPS Transactions:PartD, vol. 10, no. 3, pp. 447-458, 2003. DOI: 10.3745/KIPSTD.2003.10.3.447.

[ACM Style]
Joong Hyuk Chang and Won Suk Lee. 2003. Finding Frequent Itemsets based on Open Data Mining in Data Streams. The KIPS Transactions:PartD, 10, 3, (2003), 447-458. DOI: 10.3745/KIPSTD.2003.10.3.447.