z-logo
open-access-imgOpen Access
Arangopipe, a tool for machine learning meta-data management
Author(s) -
Jörg Schad,
Rajiv Sambasivan,
C. Woodward
Publication year - 2021
Publication title -
data science
Language(s) - English
Resource type - Journals
eISSN - 2451-8492
pISSN - 2451-8484
DOI - 10.3233/ds-210034
Subject(s) - computer science , machine learning , workflow , metadata , artificial intelligence , pipeline (software) , interface (matter) , key (lock) , active learning (machine learning) , salient , database , world wide web , programming language , operating system , bubble , maximum bubble pressure method
Experimenting with different models, documenting results and findings, and repeating these tasks are day-to-day activities for machine learning engineers and data scientists. There is a need to keep control of the machine-learning pipeline and its metadata. This allows users to iterate quickly through experiments and retrieve key findings and observations from historical activity. This is the need that Arangopipe serves. Arangopipe is an open-source tool that provides a data model that captures the essential components of any machine learning life cycle. Arangopipe provides an application programming interface that permits machine-learning engineers to record the details of the salient steps in building their machine learning models. The components of the data model and an overview of the application programming interface is provided. Illustrative examples of basic and advanced machine learning workflows are provided. Arangopipe is not only useful for users involved in developing machine learning models but also useful for users deploying and maintaining them.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here