Quantifying Expectation Modulation in Human Speech Processing

Martijn Bentum, L. ten Bosch, A. van den Bosch, Mirjam Ernestus

Onderzoeksoutput: Hoofdstuk in boek/boekdeelBijdrage aan conferentie proceedingsWetenschappelijkpeer review

1 Citaat (Scopus)


The mismatch between top-down predicted and bottom-up perceptual input is an important mechanism of perception according to the predictive coding framework (Friston, [1]). In this paper we develop and validate a new information-theoretic measure that quantifies the mismatch between expected and observed auditory input during speech processing. We argue that such a mismatch measure is useful for the study of speech processing. To compute the mismatch measure, we use naturalistic speech materials containing approximately 50,000 word tokens. For each word token we first estimate the prior word probability distribution with the aid of statistical language modelling, and next use automatic speech recognition to update this word probability distribution based on the unfolding speech signal. We validate the mismatch measure with multiple analyses, and show that the auditory-based update improves the probability of the correct word and lowers the uncertainty of the word probability distribution. Based on these results, we argue that it is possible to explicitly estimate the mismatch between predicted and perceived speech input with the cross entropy between word expectations computed before and after an auditory update.
Originele taal-2Engels
TitelProc. Interspeech 2019
StatusGepubliceerd - 18 sep. 2019


Duik in de onderzoeksthema's van 'Quantifying Expectation Modulation in Human Speech Processing'. Samen vormen ze een unieke vingerafdruk.

Citeer dit