z-logo
open-access-imgOpen Access
Cluster-Based News Representative Generation with Automatic Incremental Clustering
Author(s) -
Irsal Shabirin,
Ali Ridho Barakbah,
Iwan Syarif
Publication year - 2019
Publication title -
emitter international journal of engineering technology
Language(s) - English
Resource type - Journals
eISSN - 2443-1168
pISSN - 2355-391X
DOI - 10.24003/emitter.v7i2.378
Subject(s) - metadata , cluster analysis , computer science , centroid , information retrieval , document clustering , word (group theory) , column (typography) , cluster (spacecraft) , data mining , world wide web , artificial intelligence , mathematics , telecommunications , geometry , frame (networking) , programming language
Nowadays, a large volume of news circulates around the Internet in one day, amounting to more than two thousand news. However, some of these news have the same topic and content, trapping readers among different sources of news that say similar things. This research proposes a new approach to provide a representative news automatically through the Automatic Incremental Clustering method. This method began with the Data Acquisition process, Keyword Extraction, and Metadata Aggregation to produce a news metadata matrix. The news metadata matrix consisted of types of word in the column and news section of each line. Furthermore, the news on the matrix were grouped by the Automatic Incremental Clustering method based on the number of word similarities that arised, calculated using the Euclidean Distance approach, and was done automatically and real-time. Each cluster (topic) determined one representing news as a Representative News based on the location of the news closest to the midpoint/centroid on the cluster. This study used 101 news as experimental data and produced 87 news clusters with 85.14% precision ratio.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here