AUTHOR=Baird Alice , Triantafyllopoulos Andreas , Zänkert Sandra , Ottl Sandra , Christ Lukas , Stappen Lukas , Konzok Julian , Sturmbauer Sarah , Meßner Eva-Maria , Kudielka Brigitte M. , Rohleder Nicolas , Baumeister Harald , Schuller Björn W. TITLE=An Evaluation of Speech-Based Recognition of Emotional and Physiological Markers of Stress JOURNAL=Frontiers in Computer Science VOLUME=3 YEAR=2021 URL=https://www.frontiersin.org/journals/computer-science/articles/10.3389/fcomp.2021.750284 DOI=10.3389/fcomp.2021.750284 ISSN=2624-9898 ABSTRACT=
Life in modern societies is fast-paced and full of stress-inducing demands. The development of stress monitoring methods is a growing area of research due to the personal and economic advantages that timely detection provides. Studies have shown that speech-based features can be utilised to robustly predict several physiological markers of stress, including emotional state, continuous heart rate, and the stress hormone, cortisol. In this contribution, we extend previous works by the authors, utilising three German language corpora including more than 100 subjects undergoing a Trier Social Stress Test protocol. We present cross-corpus and transfer learning results which explore the efficacy of the speech signal to predict three physiological markers of stress—sequentially measured saliva-based cortisol, continuous heart rate as beats per minute (BPM), and continuous respiration. For this, we extract several features from audio as well as video and apply various machine learning architectures, including a temporal context-based Long Short-Term Memory Recurrent Neural Network (LSTM-RNN). For the task of predicting cortisol levels from speech, deep learning improves on results obtained by conventional support vector regression—yielding a Spearman correlation coefficient (