EXTRACTING AND RECOGNISING MUSIC FEATURES THROUGH MULTI-MODAL EMOTION RECOGNITION, 140-146.

Chi Xu

doi:10.2316/J.2024.201-0380

EXTRACTING AND RECOGNISING MUSIC FEATURES THROUGH MULTI-MODAL EMOTION RECOGNITION, 140-146.

Chi Xu

Keywords

Multi-modal emotion recognition, music features, Hevner model, convolutional neural network

Abstract

Music is one of the ways to express emotions. Recognising and extracting the emotional features of music and recommending them to diﬀerent audiences according to diﬀerent emotions is currently a hot research topic. In this paper, the Hevner model was used to describe diﬀerent music emotions, and a multi-modal emotion recognition approach was adopted to extract features of music in two modalities: audio and lyrics text. The convolutional neural network (CNN) model classiﬁed the emotional features of the data set. The experimental results showed that compared with single- modal recognition, the precision, and recall rate of the multi-modal emotion recognition proposed in this paper were both increased by more than 30%, with an increase of about 0.3 in value. At the same time, the ten-fold cross-validation accuracy of music emotion recognition under the CNN method was 95.36%, and the recognition time was 17.66 s, which was better than the support vector machine (SVM) and the Bayesian models. Better emotional classiﬁcation of music is an important foundation for accurately recommending music to diﬀerent audiences. The experimental results prove that in the future, the multi-modal emotion recognition approach can be used to extract music features and classify music using the CNN model, and this method has high accuracy.

Important Links:

DOI: 10.2316/J.2024.201-0380
From Journal (201) Mechatronic Systems and Control - 2024

Go Back