Enhancing clustering blog documents by utilizing author/reader comments | Zendy

Beibei Li | Zendy; Shuting Xu | Zendy; Jun Zhang | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Enhancing clustering blog documents by utilizing author/reader comments

Author(s) -

Beibei Li,

Shuting Xu,

Jun Zhang

Publication year - 2007

Publication title -

citeseer x (the pennsylvania state university)

Language(s) - English

Resource type - Conference proceedings

DOI - 10.1145/1233341.1233359

Subject(s) - cluster analysis , computer science , upload , information retrieval , world wide web , web page , hits algorithm , the internet , document clustering , resource (disambiguation) , web search engine , web navigation , artificial intelligence , computer network

Blogs are a new form of internet phenomenon and a vast everincreasing information resource. Mining blog files for information is a very new research direction in data mining. Blog files are different from standard web files and may need specialized mining strategies. We propose to include the title, body, and comments of the blog pages in clustering datasets from blog documents. In particular, we argue that the author/reader comments of the blog pages may have more discriminating effect in clustering blog documents. We constructed a word-page matrix by downloading blog pages from a well-known website and experimented a k-means clustering algorithm with different weights assigned to the title, body, and comment parts. Our experimental results show that assigning a larger weight value to the blog comments helps the k-means algorithm produce better clustering solutions. The experimental results confirm our hypothesis that the author/reader comments of the blog files are very useful in discriminating blog files.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research