Bogem, Nagalaxmi, Lakshmi, V. and Jangirala, Srinivas
(2025)
Speech emotion recognition using machine learning.
AIP Conference Proceedings, 3283.
040038.
ISSN 0094-243X
Abstract
Speech, our natural communication tool, conveys not just words, but emotions too. Speech Emotion Recognition (SER) aims to decode these hidden feelings, bridging the gap between humans and machines. While annotating emotions and capturing their subjectivity pose challenges, SER unlocks immense potential. Our work proposes an ANN-based system to extract emotions from speech, focusing on crucial acoustic features. Leveraging existing datasets and models, we trained our ANN to recognize seven emotions: anger, disgust, fear, happiness, neutral, sadness, and surprise. The proposed model achieved remarkable accuracy, reaching 100% during training and 99% during validation. This work contributes to advancing SER technology, paving the way for more empathetic human-computer interactions.
Item Type: | Article |
---|---|
Keywords: | Speech communication| Human-computer interaction | Acoustics | Machine learning |
Subjects: | Physical, Life and Health Sciences > Computer Science Physical, Life and Health Sciences > Engineering and Technology Social Sciences and humanities > Psychology > Cognitive Psychology |
JGU School/Centre: | Jindal Global Business School |
Depositing User: | Mr. Gautam Kumar |
Date Deposited: | 14 May 2025 10:25 |
Last Modified: | 14 May 2025 10:25 |
Official URL: | https://doi.org/10.1063/5.0266070 |
URI: | https://pure.jgu.edu.in/id/eprint/9503 |
Downloads
Downloads per month over past year