z-logo
open-access-imgOpen Access
Big Data Workflows: A Reference Architecture and the DATAVIEW System
Author(s) -
Andrey Kashlev,
Shiyong Lu,
Aravind Mohan
Publication year - 2017
Publication title -
services transactions on big data
Language(s) - English
Resource type - Journals
eISSN - 2326-442X
pISSN - 2326-4411
DOI - 10.29268/stbd.2017.4.1.1
Subject(s) - workflow , computer science , architecture , reference architecture , big data , software engineering , database , data science , computer architecture , operating system , geography , software architecture , archaeology
The big data era is here, a natural result of the digital revolution of the last few decades. The emergence of big data in virtually all areas of life raises a fundamental question how can we turn large volumes of bits and bytes into insights and possibly values? The answer to this question is often hindered by three big data challenges: volume, velocity, and variety. While scientific workflows have been used extensively in structuring complex scientific data analysis processes, they fall short in meeting the three big data challenges on the one hand, and in leveraging the dynamic resource provisioning capability of cloud computing on the other hand. To address such limitations, we propose and develop the concept of big data workflow as the next generation of data-centric workflow technologies. In this paper we: 1) identify the key challenges for running big data workflows in the cloud; 2) propose a reference architecture for big data workflow management systems (BDWFMSs) that addresses these challenges, 3) develop DATAVIEW, a big data workflow management system, to validate our proposed reference architecture, 4) design and run two big data workflows in the automotive and astronomy domains to showcase applications of our DATAVIEW system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom