Reduced Feature Set for Emotion Based Spoken Utterances of Normal and Special Children Using Multivariate Analysis and Decision Trees
The current paper deals with the use of multivariate data analysis and decision tree methods in order to reduce the feature set for the normal and special children speech in four different emotions: anger, happiness, neutral and sadness. Ten features were extracted, by an algorithm implemented in a previous study to classify the speech emotions of normal and special children. In the current study, the best features are selected using multivariate analysis: principal component analysis (PCA), factor analysis and decision tree. Step by step PCA is applied to reduce the feature set according to the variables that are collinear. The obtained reduced feature sets are applicable to both normal and special children samples. Experimental results revealed that PCA yields the feature set comprising pitch, intensity, formant, LPCC and rate of acceleration. Factor analysis provides three feature sets out of which the feature set comprising of Rasta PLP, MFCC, ZCR, and intensity provides the best result. Decision tree yields a feature set comprising energy, pitch and LPCC.
S. Ramakrishnan, “Recognition of Emotion from Speech: A Review”, in: Speech Enhancement, Modeling and Recognition- Algorithms and Applications, pp. 121-138, InTech, 2012
S. Pahune, N. Mishra, “Emotion Recognition through Combination of Speech and Image Processing: A Review”, International Journal on Recent and Innovation Trends in Computing and Communication, Vol. 3, No. 2, pp. 134-137, 2015
B. Schuller, A. Batliner, D. Seppi, S.Steidl, T. Vogt, J. Wagner, L. Devillers, L. Vidrascu, N. Amir, L. Kessous, V. Aharonson, “The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functional”, in: INTERSPEECH 2007, Antwerp, Belgium, pp. 2253-2256, August 27-31, 2007
B. Schuller, G. Rigoll, “Recognizing interest in conversational speech–comparing bag of frames and supra-segmental features”, INTERSPEECH, Brighton, UK, pp. 1999-2002, September 6-10, 2009
Y. Zhou, Y. Sun, L. Yang, Y. Yan, “Applying articulatory features to speech emotion recognition”, IEEE 9th International Conference on Research Challenges in Computer Science, Shanghai, China, December 28-29, 2009
S. Alghowinem, R. Goecke, M. Wagner, J. Epps, G. Parker, M. Breakspear, “Characterizing Depressed Speech for Classification”, INTERSPEECH, Florence, Italy, pp. 2534-2538, August 25-29, 2013
K. M.Chung, D. Jung, “Validity and reliability of the Korean version of autism spectrum disorders comorbid for children (ASD-CC)”, Research in Autism Spectrum Disorders, Vol. 39, pp.1-10, 2017
M. A. Siddiqui, N. G. Haider, S. A. Ali, S. Hina, “A: Novel Approach for Features Extraction towards Classifying Normal and Special Children Speech Emotions in Urdu Language”, International Journal of Computer Science and Network Security, Vol. 17, No. 7, pp. 188-195, 2017
L. E. Aik, L. C Kiang, Z. B. Mohamed, T. W Hong, “A review on the multivariate statistical methods for dimensional reduction studies”, in: AIP Conference Proceedings, Perlis, Malaysia, Vol. 1847, No. 1, AIP Publishing, 2017
K. Morris, P. D. McNicholas, “Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures”, Computational Statistics and Data Analysis, Vol. 97, pp. 133-150, 2016
Y. W. Lin, B. C Deng, Q. S Xu,Y. H. Yun, Y. Z. Liang, “The equivalence of partial least squares and principal component regression in the sufficient dimension reduction framework”, Chemometrics and Intelligent Laboratory Systems, Vol. 150, pp. 58-64, 2016
K. Mallick, S. Bhattacharyya, “Uncorrelated Local Maximum Margin Criterion: An Efficient Dimensionality reduction Method for Text Classification”, Procedia Technology, Vol. 4, pp. 370-374, 2012
Y. Jingjie, X. Wang, W. Gu, L. Ma, “Speech Emotion Recognition Based on Sparse Representation
MetricsAbstract Views: 187
PDF Downloads: 96
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
Most read articles by the same author(s)
- A. Shabbir, H. R. Khan, S. A. Ali, S. Rizvi, Design and Performance Analysis of Multi-tier Heterogeneous Network through Coverage, Throughput and Energy Efficiency , Engineering, Technology & Applied Science Research: Vol. 7 No. 6 (2017): December, 2017
- S. Khan, S. A. Ali, J. Sallar, Analysis of Children’s Prosodic Features Using Emotion Based Utterances in Urdu Language , Engineering, Technology & Applied Science Research: Vol. 8 No. 3 (2018): June, 2018
- A. Samad, A. U. Rehman, S. A. Ali, Performance Evaluation of Learning Classifiers of Children Emotions using Feature Combinations in the Presence of Noise , Engineering, Technology & Applied Science Research: Vol. 9 No. 6 (2019): December, 2019