Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition

Che-Wei Huang; Shrikanth Narayanan

Страница публикации Публикация в OpenAlex

Аннотация: We present a deep convolutional recurrent neural network for speech emotion recognition based on the log-Mel filterbank energies, where the convolutional layers are responsible for the discriminative feature learning. Based on the hypothesis that a better understanding of the internal configuration within an utterance would help reduce misclassification, we further propose a convolutional attention mechanism to learn the utterance structure relevant to the task. In addition, we quantitatively measure the performance gain contributed by each module in our model in order to characterize the nature of emotion expressed in speech. The experimental results on the eNTERFACE'05 emotion database validate our hypothesis and also demonstrate an absolute improvement by 4.62% compared to the state-of-the-art approach.

Год издания: 2017

Авторы: Che-Wei Huang, Shrikanth Narayanan

Источник: 2022 IEEE International Conference on Multimedia and Expo (ICME)

Ключевые слова: Emotion and Mood Recognition, Speech and Audio Processing, Speech Recognition and Synthesis

Показать дополнительные сведения

Будние дни	9:00–19:00
Суббота	9:00–17:00
Воскресенье	выходной день

Подразделения:

8:30–17:00 (обед 12:30–13:00), пн-пт

Контакты

Единый телефон	+7 (391) 291-25-74
Библиотека	+7 (391) 206-21-06
Издательство	+7 (391) 206-25-88
E-mail	bik [at] sfu-kras.ru
Адрес	пр. Свободный, 79/10

Библиотечно-издательский комплекс СФУ

Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition
статья (материалы конференций)

Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognitionстатья (материалы конференций)

Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition
статья (материалы конференций)