Monitoring Cognitive Workload Using Vocal Tract and Voice Source Features

Eydis Huld Magnusdottir; Michal Borsky; Manuela Meier; Kamilla Johannsdottir; Jon Gudnason

doi:10.3311/PPee.10414

Authors

Eydis Huld Magnusdottir

Affiliation

Center for Analysis and Design of Intelligent Agents, Reykjavik University, Iceland
Michal Borsky

Affiliation

Center for Analysis and Design of Intelligent Agents, Reykjavik University, Iceland
Manuela Meier

Affiliation

Center for Analysis and Design of Intelligent Agents, Reykjavik University, Iceland; Department of Electrical and Computer Engineering, Technical University of Munich, Germany
Kamilla Johannsdottir

Affiliation

Center for Analysis and Design of Intelligent Agents, Reykjavik University, Iceland
Jon Gudnason

Affiliation

Center for Analysis and Design of Intelligent Agents, Reykjavik University, Iceland

Abstract

Monitoring cognitive workload from speech signals has received much attention from researchers in the past few years as it has the potential to improve performance and fidelity in human decision making. The bulk of the research has focused on classifying speech from talkers participating in cognitive workload experiments using simple reading tasks, memory span tests and the Stroop test, typically into three levels of low, medium and high cognitive workload. This study focuses on using parameters extracted from the vocal tract and the voice source components of the speech signal for cognitive workload monitoring. The experiment used in this study contains 98 participants, the levels were obtained by using a reading task and three Stroop tasks which were randomly ordered for each participant and an adequate rest time was used inbetween tasks to mitigate the effect of cognitive workload from one task affecting the subsequent one. Vocal tract features were obtained from the first three formants and voice source features were extracted using signal analysis on the inverse filtered speech signal. The results show that on their own, the vocal tract features outperform the voice source features. The MCR of 33.92% ± 1.05 was achieved with a SVM classifier. A weighted combination of vocal tract and voice source features classified with SWM classifier fused at the output level achieved the lowest MCR of 32.5%.

Keywords:

speech science, voice source signal, vocal tract features, computational paralinguistic

Citation data from Crossref and Scopus

Monitoring Cognitive Workload Using Vocal Tract and Voice Source Features

Authors

Abstract

Keywords:

Citation data from Crossref and Scopus

Published Online

How to Cite

Issue

Section

Make a Submission