z-logo
open-access-imgOpen Access
Extracting and Transforming Heterogeneous Data from XML files for Big Data
Author(s) -
Tanuja Das,
Ramesh Saha,
Goutam Kumar Saha
Publication year - 2019
Publication title -
international journal of engineering and advanced technology
Language(s) - English
Resource type - Journals
ISSN - 2249-8958
DOI - 10.35940/ijeat.b3438.129219
Subject(s) - computer science , big data , process (computing) , data transformation , xml , analytics , transformation (genetics) , data science , data mining , data warehouse , database , world wide web , biochemistry , chemistry , gene , operating system
Digital technology is fast changing in the recent years and with this change, the number of data systems, sources, and formats has also increased exponentially. So the process of extracting data from these multiple source systems and transforming it to suit for various analytics processes is gaining importance at an alarming rate. In order to handle Big Data, the process of transformation is quite challenging, as data generation is a continuous process. In this paper, we extract data from various heterogeneous sources from the web and try to transform it into a form which is vastly used in data warehousing so that it caters to the analytical needs of the machine learning community.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here