Tools in data science for better processing
Author(s) -
Nur Syahela Hussien,
Sarina Sulaiman,
Siti Mariyam Shamsuddin
Publication year - 2016
Publication title -
aip conference proceedings
Language(s) - English
Resource type - Conference proceedings
eISSN - 1551-7616
pISSN - 0094-243X
DOI - 10.1063/1.4954530
Subject(s) - computer science , data mining , cluster analysis , java , set (abstract data type) , visualization , data set , data visualization , modular design , analytics , association rule learning , data science , information retrieval , machine learning , artificial intelligence , programming language
Analysing the data is an important part of a research in data science. There are many tools that can be used in analysing a data set to get the experiment results for classification, clustering and others. However, the researchers are concerned about how to increase the efficiency in analysing a data set. In this paper, three open source tools which are the Waikato Environment for Knowledge Analysis (WEKA), Konstanz Information Miner (KNIME) and Salford Predictive Modular (SPM) were compared to identify the better processing tools in evaluating the presented data. All of these tools have their own different characteristics. WEKA can handle pre-processing of data and then analyses it based on different algorithms. It is suitable to be used for classification, regression, clustering, association rules, and visualisation. The algorithms can be applied directly to a data set or called from its own Java code. KNIME is more inclined towards producing graphical view, while SPM is a highly accurate and ultra-fast analytics which also data mines platforms for any sizes, complexity or organisation. The results illustrate the tools capability in analysing data sets and evaluators in an efficient and effective manner.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom