Ensemble learning with attention-integrated convolutional recurrent neural network for imbalanced speech emotion recognition

Xusheng Ai, Victor S. Sheng, Wei Fang, Charles X. Ling, Chunhua Li

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

This article addresses observation duplication and lack of whole picture problems for ensemble learning with the attention model integrated convolutional recurrent neural network (ACRNN) in imbalanced speech emotion recognition. Firstly, we introduce Bagging with ACRNN and the observation duplication problem. Then Redagging is devised and proved to address the observation duplication problem by generating bootstrap samples from permutations of observations. Moreover, Augagging is proposed to get oversampling learner to participate in majority voting for addressing the lack of whole picture problem. Finally, Extensive experiments on IEMOCAP and Emo-DB samples demonstrate the superiority of our proposed methods (i.e., Redagging and Augagging).

Original languageEnglish
Pages (from-to)199909-199919
Number of pages11
JournalIEEE Access
Volume8
DOIs
StatePublished - 2020

Keywords

  • Convolutional neural network
  • Ensemble learning
  • Imbalance learning
  • Recurrent neural network
  • Speech emotion recognition

Fingerprint

Dive into the research topics of 'Ensemble learning with attention-integrated convolutional recurrent neural network for imbalanced speech emotion recognition'. Together they form a unique fingerprint.

Cite this