z-logo
open-access-imgOpen Access
A New Semimetric for Interval Data
Author(s) -
Irani Hazarika*
Publication year - 2019
Publication title -
international journal of recent technology and engineering
Language(s) - English
Resource type - Journals
ISSN - 2277-3878
DOI - 10.35940/ijrte.c5422.098319
Subject(s) - cluster analysis , interval data , data mining , measure (data warehouse) , interval (graph theory) , computer science , norm (philosophy) , similarity (geometry) , similarity measure , field (mathematics) , mathematics , artificial intelligence , image (mathematics) , combinatorics , political science , pure mathematics , law
Interval data, a special case of symbolic data, is becoming more and more frequent in different fields of applications including the field of Data Mining. Measuring the dissimilarity or similarity between two intervals is an important task in Data Mining. In this paper an analysis of ten desirable properties that should be fulfilled by the measures for interval data for making it suitable for applications like clustering and classification has been done. Also, it has been verified whether these properties are satisfied by three existing measures- L1-norm, L2-norm, L∞-norm and also a new dissimilarity measure for interval data has also been proposed. The performance of all the existing distance measures are compared with the proposed measure by applying well known K-Means algorithm on 6 interval datasets. It is seen that proposed measure gives better clustering accuracy then the existing measures on most of the datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here