Open Access
Modeling Nonlinear Transfer Functions from Speech Envelopes to Encephalography with Neural Networks
Author(s) -
Tobias de Taillez,
Florian Denk,
Bojana Mirkovic,
Birger Kollmeier,
Bernd T. Meyer
Publication year - 2019
Publication title -
international journal of psychological studies
Language(s) - English
Resource type - Journals
eISSN - 1918-722X
pISSN - 1918-7211
DOI - 10.5539/ijps.v11n4p1
Subject(s) - electroencephalography , artificial neural network , stimulus (psychology) , speech recognition , nonlinear system , neurophysiology , transfer function , computer science , linear model , contrast (vision) , pattern recognition (psychology) , artificial intelligence , psychology , neuroscience , machine learning , physics , cognitive psychology , quantum mechanics , electrical engineering , engineering
Diferent linear models have been proposed to establish a link between an auditory stimulus and the neurophysiological response obtained through electroencephalography (EEG). We investigate if non-linear mappings can be modeled with deep neural networks trained on continuous speech envelopes and EEG data obtained in an auditory attention two-speaker scenario. An artificial neural network was trained to predict the EEG response related to the attended and unattended speech envelopes. After training, the properties of the DNN-based model are analyzed by measuring the transfer function between input envelopes and predicted EEG signals by using click-like stimuli and frequency sweeps as input patterns. Using sweep responses allows to separate the linear and nonlinear response components also with respect to attention. The responses from the model trained on normal speech resemble event-related potentials despite the fact that the DNN was not trained to reproduce such patterns. These responses are modulated by attention, since we obtain significantly lower amplitudes at latencies of 110 ms, 170 ms and 300 ms after stimulus presentation for unattended processing in contrast to the attended. The comparison of linear and nonlinear components indicates that the largest contribution arises from linear processing (75%), while the remaining 25% are attributed to nonlinear processes in the model. Further, a spectral analysis showed a stronger 5 Hz component in modeled EEG for attended in contrast to unattended predictions. The results indicate that the artificial neural network produces responses consistent with recent findings and presents a new approach for quantifying the model properties.