Clustering of Web Objects with Similar Popularity Trends


The KIPS Transactions:PartD, Vol. 15, No. 4, pp. 485-494, Aug. 2008
10.3745/KIPSTD.2008.15.4.485,   PDF Download:

Abstract

Huge amounts of various web items such as keywords, images, and web pages are being made widely available on the Web. The popularities of such web items continuously change over time, and mining temporal patterns in popularities of web items is an important problem that is useful for several web applications. For example, the temporal patterns in popularities of search keywords help web search enterprises predict future popular keywords, enabling them to make price decisions when marketing search keywords to advertisers. However, presence of millions of web items makes it difficult to scale up previous techniques for this problem. This paper proposes an efficient method for mining temporal patterns in popularities of web items. We treat the popularities of web items as time-series, and propose gapmeasure to quantify the similarity between the popularities of two web items. To reduce the computation overhead for this measure, an efficient method using the Fast Fourier Transform (FFT) is presented. We assume that the popularities of web items are not necessarily following any probabilistic distribution or periodic. For finding clusters of web items with similar popularity trends, we propose to use a density-based clustering algorithm based on the gap measure. Our experiments using the popularity trends of search keywords obtained from the Google Trends web site illustrate the scalability and usefulness of the proposed approach in real-world applications.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
W. K. Loh, "Clustering of Web Objects with Similar Popularity Trends," The KIPS Transactions:PartD, vol. 15, no. 4, pp. 485-494, 2008. DOI: 10.3745/KIPSTD.2008.15.4.485.

[ACM Style]
Wong Kee Loh. 2008. Clustering of Web Objects with Similar Popularity Trends. The KIPS Transactions:PartD, 15, 4, (2008), 485-494. DOI: 10.3745/KIPSTD.2008.15.4.485.