Applying Naive Bayes Classifier to Document Clustering | Zendy

Jie Ji | Zendy; Qiangfu Zhao | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Applying Naive Bayes Classifier to Document Clustering

Author(s) -

Jie Ji,

Qiangfu Zhao

Publication year - 2010

Publication title -

journal of advanced computational intelligence and intelligent informatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.172

H-Index - 20

eISSN - 1343-0130

pISSN - 1883-8014

DOI - 10.20965/jaciii.2010.p0624

Subject(s) - naive bayes classifier , computer science , cluster analysis , initialization , probabilistic classification , artificial intelligence , classifier (uml) , machine learning , bayes classifier , bayes' theorem , pattern recognition (psychology) , data mining , probabilistic logic , bayesian probability , support vector machine , programming language

Document clustering partitions sets of unlabeled documents so that documents in clusters share common concepts. A Naive Bayes Classifier (BC) is a simple probabilistic classifier based on applying Bayes’ theorem with strong (naive) independence assumptions. BC requires a small amount of training data to estimate parameters required for classification. Since training data must be labeled, we propose an Iterative Bayes Clustering (IBC) algorithm. To improve IBC performance, we propose combining IBC with Comparative Advantage-based (CA) initialization method. Experimental results show that our proposal improves performance significantly over classical clustering methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research