z-logo
open-access-imgOpen Access
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Author(s) -
Andreas Triantafyllopoulos,
Iosif Tsangko,
Alexander Gebhard,
Annamaria Mesaros,
Tuomas Virtanen,
Bjorn W. Schuller
Publication year - 2025
Publication title -
proceedings of the ieee
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 2.383
H-Index - 287
eISSN - 1558-2256
pISSN - 0018-9219
DOI - 10.1109/jproc.2025.3593952
Subject(s) - general topics for engineers , engineering profession , aerospace , bioengineering , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , fields, waves and electromagnetics , geoscience , nuclear engineering , robotics and control systems , signal processing and analysis , transportation , power, energy and industry applications , communication, networking and broadcast technologies , photonics and electrooptics
Foundation models (FMs) are increasingly spearheading recent advances on a variety of tasks that fall under the purview of computer audition—i.e., the use of machines to understand sounds. They feature several advantages over traditional pipelines: among others, the ability to consolidate multiple tasks in a single model, the option to leverage knowledge from other modalities, and the readily available interaction with human users. Naturally, these promises have created substantial excitement in the audio community and have led to a wave of early attempts to build new, general-purpose FMs for audio. In the present contribution, we give an overview of computational audio analysis as it transitions from traditional pipelines toward auditory FMs. Our work highlights the key operating principles that underpin those models and showcases how they can accommodate multiple tasks that the audio community previously tackled separately.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom