- ris
- BibTeX

Comparison of speaker dependent and speaker independent emotion recognition

Rybka, Jan; Janicki, Artur

Struktura obiektu

Rybka, Jan ; Janicki, Artur

Korbicz, Józef - red. ; Uciński, Dariusz - red.

Comparison of speaker dependent and speaker independent emotion recognition

AMCS, Volume 23 (2013)

peech processing ; emotion recognition ; EMO-DB ; support vector machines ; artificial neural networks

This paper describes a study of emotion recognition based on speech analysis. The introduction to the theory contains a review of emotion inventories used in various studies of emotion recognition as well as the speech corpora applied, methods of speech parametrization, and the most commonly employed classification algorithms. ; In the current study the EMO-DB speech corpus and three selected classifiers, the k-Nearest Neighbor (k-NN), the Artificial Neural Network (ANN) and Support Vector Machines (SVMs), were used in experiments. SVMs turned out to provide the best classification accuracy of 75.44% in the speaker dependent mode, that is, when speech samples from the same speaker were included in the training corpus. ; Various speaker dependent and speaker independent configurations were analyzed and compared. Emotion recognition in speaker dependent conditions usually yielded higher accuracy results than a similar but speaker independent configuration. The improvement was especially well observed if the base recognition ratio of a given speaker was low. Happiness and anger, as well as boredom and neutrality, proved to be the pairs of emotions most often confused.

Zielona Góra: Uniwersytet Zielonogórski

10.2478/amcs-2013-0060

AMCS, volume 23, number 4 (2013) ; kliknij tutaj, żeby przejść

Biblioteka Uniwersytetu Zielonogórskiego