k-means Improvement by Dynamic Pre-aggregates | Zendy

Nabil El Malki | Zendy; Franck Ravat | Zendy; Olivier Teste | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

k-means Improvement by Dynamic Pre-aggregates

Author(s) -

Nabil El Malki,

Franck Ravat,

Olivier Teste

Publication year - 2019

Language(s) - English

Resource type - Conference proceedings

DOI - 10.5220/0007675201330140

Subject(s) - cluster analysis , computer science , process (computing) , algorithm , volume (thermodynamics) , iterative method , speedup , k means clustering , data mining , parallel computing , artificial intelligence , physics , quantum mechanics , operating system

The k-means algorithm is one well-known of clustering algorithms. k-means requires iterative and repetitive accesses to data up to performing the same calculations several times on the same data. However, intermediate results that are difficult to predict at the beginning of the k-means process are not recorded to avoid recalculating some data in subsequent iterations. These repeated calculations can be costly, especially when it comes to clustering massive data. In this article, we propose to extend the k-means algorithm by introducing pre-aggregates. These aggregates can then be reused to avoid redundant calculations during successive iterations. We show the interest of the approach by several experiments. These last ones show that the more the volume of data is important, the more the pre-aggregations speed up the algorithm.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research