PriTri: An Innovative Algorithm for Clustering Categorical Data in Data Warehouse
Author(s) -
S. Hari Ganesh,
Author C.Chandrasekar
Publication year - 2011
Publication title -
international journal of computer applications
Language(s) - English
Resource type - Journals
ISSN - 0975-8887
DOI - 10.5120/2448-3307
Subject(s) - computer science , categorical variable , data warehouse , cluster analysis , data mining , data science , information retrieval , artificial intelligence , machine learning
the process of data mining to extract knowledge from large data set needs great potential to extract the hidden nuggets. To cluster the numerical data there are enormous clustering technique. Data mining for categorical data(qualitative and quantitative) the most frequently used algorithms are k-means, k-mediods and fuzzy rule all these methods needs a threshold value to overcome this problem. This paper propose an algorithm to optimize the number of clusters and it also uses novel way to construct the data mart using the concept of multiprocessing Pri-tri algorithm. Keywordsmining, clustering, k-means, Multiprocessing.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom