Gaussian Mixture Hidden Conditional Random Fields for Emotional Speech Classification

Authors: La The Vinh*


In this study, we investigate in the use of hidden conditional random fields model to classify emotional speech. We introduce a novel hidden conditional random fields model, which is able to approximate complex distributions using a mixture of full covariance Gaussian density functions. In our experiments, we extracted Mel-frequency cepstral coefficients (MFCC) features from the well-known Berlin emotional speech dataset and eNTERFACE 2005 dataset. After that, we used the 10-fold cross validation rule to train, evaluate and compare our proposed model with the conventional learning method, hidden Markov model (HMM) and the existing hidden conditional random fields model, which can only utilize diagonal covariance Gaussian distributions. The experiments show that our method achieves significant improvement (p-value < 0.05) regarding the classification accuracy


Emotion classification, Conditional Random Fields, HMM, GMM
Pages : 76-80

Related Articles:

Authors : Trieu Viet Phuong, Trinh Quang Thong, Nguyen Thi Lan Huong*
Authors : Dang Nhu Dinh, Vu Van Yem, Hoang Phuong Chi*, Dao Ngoc Chien
Authors : Dang Thai Son, Sayan Mukherjee, Thang Manh Hoang*