Object structure

Creator:

Rybka, Jan ; Janicki, Artur

Contributor:

Korbicz, Józef - red. ; Uciński, Dariusz - red.

Title:

Comparison of speaker dependent and speaker independent emotion recognition

Group publication title:

AMCS, Volume 23 (2013)

Subject and Keywords:

peech processing ; emotion recognition ; EMO-DB ; support vector machines ; artificial neural networks

Abstract:

This paper describes a study of emotion recognition based on speech analysis. The introduction to the theory contains a review of emotion inventories used in various studies of emotion recognition as well as the speech corpora applied, methods of speech parametrization, and the most commonly employed classification algorithms. ; In the current study the EMO-DB speech corpus and three selected classifiers, the k-Nearest Neighbor (k-NN), the Artificial Neural Network (ANN) and Support Vector Machines (SVMs), were used in experiments. SVMs turned out to provide the best classification accuracy of 75.44% in the speaker dependent mode, that is, when speech samples from the same speaker were included in the training corpus. ; Various speaker dependent and speaker independent configurations were analyzed and compared. Emotion recognition in speaker dependent conditions usually yielded higher accuracy results than a similar but speaker independent configuration. The improvement was especially well observed if the base recognition ratio of a given speaker was low. Happiness and anger, as well as boredom and neutrality, proved to be the pairs of emotions most often confused.

Publisher:

Zielona Góra: Uniwersytet Zielonogórski

Date:

2013

Resource Type:

artykuł

DOI:

10.2478/amcs-2013-0060

Pages:

797-808

Source:

AMCS, volume 23, number 4 (2013) ; click here to follow the link

Language:

eng

Rights:

Biblioteka Uniwersytetu Zielonogórskiego