Bisecting K-Means for Clustering Web Log data | Zendy

Ruchika Patil | Zendy; Amreen Khan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Bisecting K-Means for Clustering Web Log data

Author(s) -

Ruchika Patil,

Amreen Khan

Publication year - 2015

Publication title -

international journal of computer applications

Language(s) - English

Resource type - Journals

ISSN - 0975-8887

DOI - 10.5120/20448-2799

Subject(s) - computer science , cluster analysis , information retrieval , world wide web , data mining , database , artificial intelligence

Web usage mining is the area of web mining which deals with extraction of useful knowledge from web log information produced by web servers. One of the most important tasks of Web Usage Mining (WUM) is web user clustering which forms groups of users exhibiting similar interests or similar browsing patterns. This paper presents results of clustering techniques for Web log data using K-means and Bisecting K-means algorithm. Clusters are formed with respect to similar IP address and packet combinations. The clustering framework is further used as an approach for intrusion detection from the log files. The system is trained first by labeling the classes and then tested to check for any intrusions. Recommendation output is generated which help in classifying the whether the input IP’s are “safe” or “infected”. Comparison of both algorithms is done and performance is evaluated with respect to time and accuracy. From the experimental results, it is found that Bisecting K-means overcomes the major drawbacks of basic K-means algorithm.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research