z-logo
Premium
Symbolic Data Analysis: another look at the interaction of Data Mining and Statistics
Author(s) -
Brito Paula
Publication year - 2014
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.1133
Subject(s) - representation (politics) , variable (mathematics) , symbolic data analysis , data mining , cluster analysis , computer science , external data representation , data analysis , domain (mathematical analysis) , statistics , mathematics , theoretical computer science , artificial intelligence , mathematical analysis , politics , political science , law
Symbolic Data Analysis ( SDA ) provides a framework for the representation and analysis of data that comprehends inherent variability. While in Data Mining and classical Statistics the data to be analyzed usually presents one single value for each variable, that is no longer the case when the entities under analysis are not single elements, but groups gathered on the basis of some given criteria. Then, for each variable, variability inherent to each group should be taken into account. Also, when analysing concepts, such as botanic species, disease descriptions, car models, and so on, data entail intrinsic variability, which should be explicitly considered. To this purpose, new variable types have been introduced, whose realizations are not single real values or categories, but sets, intervals, or, more generally, distributions over a given domain. SDA provides methods for the (multivariate) analysis of such data, where the variability expressed in the data representation is taken into account, using various approaches. This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Fundamental Concepts of Data and Knowledge > Knowledge Representation Technologies > Structure Discovery and Clustering

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here