z-logo
open-access-imgOpen Access
Exploiting Homogeneity of Density in Incremental Hierarchical Clustering
Author(s) -
Dwi H. Widyantoro
Publication year - 2006
Publication title -
itb journal of engineering science
Language(s) - English
Resource type - Journals
ISSN - 1978-3051
DOI - 10.5614/itbj.eng.sci.2006.38.2.1
Subject(s) - homogeneity (statistics) , cluster analysis , hierarchical clustering , computer science , data mining , artificial intelligence , machine learning
Hierarchical clustering is an important tool in many applications. As it involves a large data set that proliferates over time, reclustering the data set periodically is not an efficient process. Therefore, the ability to incorporate a new data set incrementally into an existing hierarchy becomes increasingly demanding. This article describes HOMOGEN, a system that employs a new algorithm for generating a hierarchy of concepts and clusters incrementally from a stream of observations. The system aims to construct a hierarchy that satisfies the homogeneity and the monotonicity properties. Working in a bottom-up fashion, a new observation is placed in the hierarchy and a sequence of hierarchy restructuring processes is performed only in regions that have been affected by the presence of the new observation. Additionally, it combines multiple restructuring techniques that address different restructuring objectives to get a synergistic effect. The system has been tested on a variety of domains including structured and unstructured data sets. The experimental results reveal that the system is able to construct a concept hierarchy that is consistent regardless of the input data order and whose quality is comparable to the quality of those produced by non incremental clustering algorithms.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom