Reduced Feature Set for Emotion Based Spoken Utterances of Normal and Special Children Using Multivariate Analysis and Decision Trees

M. A. Siddiqui, S. A. Ali, N. G. Haider

Abstract


The current paper deals with the use of multivariate data analysis and decision tree methods in order to reduce the feature set for the normal and special children speech in four different emotions: anger, happiness, neutral and sadness. Ten features were extracted, by an algorithm implemented in a previous study to classify the speech emotions of normal and special children. In the current study, the best features are selected using multivariate analysis: principal component analysis (PCA), factor analysis and decision tree. Step by step PCA is applied to reduce the feature set according to the variables that are collinear. The obtained reduced feature sets are applicable to both normal and special children samples. Experimental results revealed that PCA yields the feature set comprising pitch, intensity, formant, LPCC and rate of acceleration. Factor analysis provides three feature sets out of which the feature set comprising of Rasta PLP, MFCC, ZCR, and intensity provides the best result. Decision tree yields a feature set comprising energy, pitch and LPCC.


Keywords


speech emotions; PCA; factor analysis; decision tree; features

Full Text:

PDF

References


S. Ramakrishnan, “Recognition of Emotion from Speech: A Review”, in: Speech Enhancement, Modeling and Recognition- Algorithms and Applications, pp. 121-138, InTech, 2012

S. Pahune, N. Mishra, “Emotion Recognition through Combination of Speech and Image Processing: A Review”, International Journal on Recent and Innovation Trends in Computing and Communication, Vol. 3, No. 2, pp. 134-137, 2015

B. Schuller, A. Batliner, D. Seppi, S.Steidl, T. Vogt, J. Wagner, L. Devillers, L. Vidrascu, N. Amir, L. Kessous, V. Aharonson, “The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functional”, in: INTERSPEECH 2007, Antwerp, Belgium, pp. 2253-2256, August 27-31, 2007

B. Schuller, G. Rigoll, “Recognizing interest in conversational speech–comparing bag of frames and supra-segmental features”, INTERSPEECH, Brighton, UK, pp. 1999-2002, September 6-10, 2009

Y. Zhou, Y. Sun, L. Yang, Y. Yan, “Applying articulatory features to speech emotion recognition”, IEEE 9th International Conference on Research Challenges in Computer Science, Shanghai, China, December 28-29, 2009

S. Alghowinem, R. Goecke, M. Wagner, J. Epps, G. Parker, M. Breakspear, “Characterizing Depressed Speech for Classification”, INTERSPEECH, Florence, Italy, pp. 2534-2538, August 25-29, 2013

K. M.Chung, D. Jung, “Validity and reliability of the Korean version of autism spectrum disorders comorbid for children (ASD-CC)”, Research in Autism Spectrum Disorders, Vol. 39, pp.1-10, 2017

M. A. Siddiqui, N. G. Haider, S. A. Ali, S. Hina, “A: Novel Approach for Features Extraction towards Classifying Normal and Special Children Speech Emotions in Urdu Language”, International Journal of Computer Science and Network Security, Vol. 17, No. 7, pp. 188-195, 2017

L. E. Aik, L. C Kiang, Z. B. Mohamed, T. W Hong, “A review on the multivariate statistical methods for dimensional reduction studies”, in: AIP Conference Proceedings, Perlis, Malaysia, Vol. 1847, No. 1, AIP Publishing, 2017

K. Morris, P. D. McNicholas, “Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures”, Computational Statistics and Data Analysis, Vol. 97, pp. 133-150, 2016

Y. W. Lin, B. C Deng, Q. S Xu,Y. H. Yun, Y. Z. Liang, “The equivalence of partial least squares and principal component regression in the sufficient dimension reduction framework”, Chemometrics and Intelligent Laboratory Systems, Vol. 150, pp. 58-64, 2016

K. Mallick, S. Bhattacharyya, “Uncorrelated Local Maximum Margin Criterion: An Efficient Dimensionality reduction Method for Text Classification”, Procedia Technology, Vol. 4, pp. 370-374, 2012

Y. Jingjie, X. Wang, W. Gu, L. Ma, “Speech Emotion Recognition Based on Sparse Representation




eISSN: 1792-8036     pISSN: 2241-4487