Reduced Feature Set for Emotion Based Spoken Utterances of Normal and Special Children Using Multivariate Analysis and Decision Trees
Abstract
The current paper deals with the use of multivariate data analysis and decision tree methods in order to reduce the feature set for the normal and special children speech in four different emotions: anger, happiness, neutral and sadness. Ten features were extracted, by an algorithm implemented in a previous study to classify the speech emotions of normal and special children. In the current study, the best features are selected using multivariate analysis: principal component analysis (PCA), factor analysis and decision tree. Step by step PCA is applied to reduce the feature set according to the variables that are collinear. The obtained reduced feature sets are applicable to both normal and special children samples. Experimental results revealed that PCA yields the feature set comprising pitch, intensity, formant, LPCC and rate of acceleration. Factor analysis provides three feature sets out of which the feature set comprising of Rasta PLP, MFCC, ZCR, and intensity provides the best result. Decision tree yields a feature set comprising energy, pitch and LPCC.
Keywords:
speech emotions, PCA, factor analysis, decision tree, featuresDownloads
References
S. Ramakrishnan, “Recognition of Emotion from Speech: A Review”, in: Speech Enhancement, Modeling and Recognition- Algorithms and Applications, pp. 121-138, InTech, 2012 DOI: https://doi.org/10.5772/39246
S. Pahune, N. Mishra, “Emotion Recognition through Combination of Speech and Image Processing: A Review”, International Journal on Recent and Innovation Trends in Computing and Communication, Vol. 3, No. 2, pp. 134-137, 2015
B. Schuller, A. Batliner, D. Seppi, S.Steidl, T. Vogt, J. Wagner, L. Devillers, L. Vidrascu, N. Amir, L. Kessous, V. Aharonson, “The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functional”, in: INTERSPEECH 2007, Antwerp, Belgium, pp. 2253-2256, August 27-31, 2007
B. Schuller, G. Rigoll, “Recognizing interest in conversational speech–comparing bag of frames and supra-segmental features”, INTERSPEECH, Brighton, UK, pp. 1999-2002, September 6-10, 2009
Y. Zhou, Y. Sun, L. Yang, Y. Yan, “Applying articulatory features to speech emotion recognition”, IEEE 9th International Conference on Research Challenges in Computer Science, Shanghai, China, December 28-29, 2009 DOI: https://doi.org/10.1109/ICRCCS.2009.26
S. Alghowinem, R. Goecke, M. Wagner, J. Epps, G. Parker, M. Breakspear, “Characterizing Depressed Speech for Classification”, INTERSPEECH, Florence, Italy, pp. 2534-2538, August 25-29, 2013
K. M.Chung, D. Jung, “Validity and reliability of the Korean version of autism spectrum disorders comorbid for children (ASD-CC)”, Research in Autism Spectrum Disorders, Vol. 39, pp.1-10, 2017 DOI: https://doi.org/10.1016/j.rasd.2017.03.006
M. A. Siddiqui, N. G. Haider, S. A. Ali, S. Hina, “A: Novel Approach for Features Extraction towards Classifying Normal and Special Children Speech Emotions in Urdu Language”, International Journal of Computer Science and Network Security, Vol. 17, No. 7, pp. 188-195, 2017
L. E. Aik, L. C Kiang, Z. B. Mohamed, T. W Hong, “A review on the multivariate statistical methods for dimensional reduction studies”, in: AIP Conference Proceedings, Perlis, Malaysia, Vol. 1847, No. 1, AIP Publishing, 2017 DOI: https://doi.org/10.1063/1.4983858
K. Morris, P. D. McNicholas, “Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures”, Computational Statistics and Data Analysis, Vol. 97, pp. 133-150, 2016 DOI: https://doi.org/10.1016/j.csda.2015.10.008
Y. W. Lin, B. C Deng, Q. S Xu,Y. H. Yun, Y. Z. Liang, “The equivalence of partial least squares and principal component regression in the sufficient dimension reduction framework”, Chemometrics and Intelligent Laboratory Systems, Vol. 150, pp. 58-64, 2016 DOI: https://doi.org/10.1016/j.chemolab.2015.11.003
K. Mallick, S. Bhattacharyya, “Uncorrelated Local Maximum Margin Criterion: An Efficient Dimensionality reduction Method for Text Classification”, Procedia Technology, Vol. 4, pp. 370-374, 2012 DOI: https://doi.org/10.1016/j.protcy.2012.05.057
Y. Jingjie, X. Wang, W. Gu, L. Ma, “Speech Emotion Recognition Based on Sparse Representation
Downloads
How to Cite
License
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.