Multimodal emotional state recognition using sequence-dependent deep hierarchical features

Pablo Barros , Doreen Jirak , Cornelius Weber , Stefan Wermter

Neural Networks Volume 72, pages 140--151, doi: 10.1016/j.neunet.2015.09.009 - Dec 2015 Open Access

Associated documents :

Emotional state recognition has become an important topic for humanârobot interaction in the past years. By determining emotion expressions, robots can identify important variables of human behavior and use these to communicate in a more human-like fashion and thereby extend the interaction possibilities. Human emotions are multimodal and spontaneous, which makes them hard to be recognized by robots. Each modality has its own restrictions and constraints which, together with the non-structured behavior of spontaneous expressions, create several difficulties for the approaches present in the literature, which are based on several explicit feature extraction techniques and manual modality fusion. Our model uses a hierarchical feature representation to deal with spontaneous emotions, and learns how to integrate multiple modalities for non-verbal emotion recognition, making it suitable to be used in an HRI scenario. Our experiments show that a significant improvement of recognition accuracy is achieved when we use hierarchical features and multimodal information, and our model improves the accuracy of state-of-the-art approaches from 82.5% reported in the literature to 91.3% for a benchmark dataset on spontaneous emotion expressions.

@Article{BJWW15, 
 	 author =  {Barros, Pablo and Jirak, Doreen and Weber, Cornelius and Wermter, Stefan},  
 	 title = {Multimodal emotional state recognition using sequence-dependent deep hierarchical features}, 
 	 journal = {Neural Networks},
 	 number = {},
 	 volume = {72},
 	 pages = {140--151},
 	 year = {2015},
 	 month = {Dec},
 	 publisher = {Elsevier},
 	 doi = {10.1016/j.neunet.2015.09.009}, 
 }