z-logo
open-access-imgOpen Access
MIDI2vec: Learning MIDI embeddings for reliable prediction of symbolic music metadata
Author(s) -
Pasquale Lisena,
Albert MeroñoPeñuela,
Raphaël Troncy
Publication year - 2022
Publication title -
semantic web
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.862
H-Index - 45
eISSN - 2210-4968
pISSN - 1570-0844
DOI - 10.3233/sw-210446
Subject(s) - midi , computer science , metadata , scalability , graph , embedding , crowdsourcing , machine learning , artificial intelligence , theoretical computer science , information retrieval , world wide web , database , operating system
An important problem in large symbolic music collections is the low availability of high-quality metadata, which is essential for various information retrieval tasks. Traditionally, systems have addressed this by relying either on costly human annotations or on rule-based systems at a limited scale. Recently, embedding strategies have been exploited for representing latent factors in graphs of connected nodes. In this work, we propose MIDI2vec, a new approach for representing MIDI files as vectors based on graph embedding techniques. Our strategy consists of representing the MIDI data as a graph, including the information about tempo, time signature, programs and notes. Next, we run and optimise node2vec for generating embeddings using random walks in the graph. We demonstrate that the resulting vectors can successfully be employed for predicting the musical genre and other metadata such as the composer, the instrument or the movement. In particular, we conduct experiments using those vectors as input to a Feed-Forward Neural Network and we report good comparable accuracy scores in the prediction with respect to other approaches relying purely on symbolic music, avoiding feature engineering and producing highly scalable and reusable models with low dimensionality. Our proposal has real-world applications in automated metadata tagging for symbolic music, for example in digital libraries for musicology, datasets for machine learning, and knowledge graph completion.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here