z-logo
Premium
Toward an intensional approach to transformation classification
Author(s) -
Renear Allen H.,
Wang Xinrui
Publication year - 2018
Publication title -
proceedings of the association for information science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.193
H-Index - 14
ISSN - 2373-9231
DOI - 10.1002/pra2.2018.14505501045
Subject(s) - computer science , sort , metadata , workflow , variety (cybernetics) , process (computing) , information retrieval , data science , transformation (genetics) , data type , artificial intelligence , world wide web , database , biochemistry , chemistry , gene , operating system , programming language
Generating one dataset from another is a fundamental activity in data science: data curators convert datasets to different file formats, create data subsets, generate metadata, integrate data from multiple sources, and so on; data analysts generate summaries and classifications, create visualizations, and derive data about one sort of thing from data about another sort of thing. Sometimes these transformations occur in independent single episodes, sometimes as part of an extended structured process or scientific workflow. Although such transformations have been studied from a variety of perspectives, there has been little effort to develop a general classification based on intrinsic (rather than functional) characteristics, apart from computational complexity. With this paper we hope to motivate a classification of transformations based on the relationships between the Intensional features of the input and output datasets, that is, their propositional and conceptual content. Intensional entities are the fundamental components of scientific reasoning and explanation and consequently deserve a uniquely central role in the analysis of information work. We believe such a classification would be a valuable contribution to the data curation curriculum. This paper is an introduction to that project.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here