تجاوز إلى المحتوى الرئيسي

Enhancing emotion prediction using deep learning and distributed federated systems with SMOTE oversampling technique.

Author name : Hedi . . Hamdi
Publication Date : 2024-12-21
Journal Name : Alexandria Engineering Journal

Abstract

Facial Expression Recognition (FER) categorizes various human emotions by analyzing the features of the face, so it plays a vital role in recognizing emotions. Prior studies have focused on the issue of recognizing emotions through voices or speech. Addressing the existing method issues, this approach aims to detect voices and three-dimensional images using appropriate datasets and novel deep-learning techniques. In this research, the valid Audio-Visual datasets Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), Acted Facial Expression in the Wild (AFEW), and eNTERFACE’05 datasets are chosen for analysis. RAVDESS dataset contains audio, AFEW, and eNTERFACE and has three-dimensional images of humans, i.e., 3D images. SMOTE technique is presented for solving overfitting problems to balance the dataset by oversampling and under-sampling process. The research employs the Federated 3D-CNN technique to predict the accurate emotions of humans. The 3D Convolutional Neural Network (3DCNN) predicts accurate information of a person at any angle in image processing. Mel Frequency Cepstrum Coefficient (MFCC) is used to extract and fine-tune the voices. A significant contribution of Federated Learning with 3D-Convolutional Neural Network is executed for multiple clients at a time through global and local updates of weights. The proposed framework achieves a prediction accuracy of 95.72 % when compared with existing methods. This approach helps in many applications, such as analyzing emotions, healthcare, etc.

Keywords

Audio-visual Convolutional neural network Deep learning Recognising emotions Federated system SMOTE

Publication Link

https://doi.org/10.1016/j.aej.2024.07.081

Block_researches_list_suggestions

Suggestions to read

Generalized first approximation Matsumoto metric
AMR SOLIMAN MAHMOUD HASSAN
HIDS-IoMT: A Deep Learning-Based Intelligent Intrusion Detection System for the Internet of Medical Things
Ahlem . Harchy Ep Berguiga
Structure–Performance Relationship of Novel Azo-Salicylaldehyde Disperse Dyes: Dyeing Optimization and Theoretical Insights
EBTSAM KHALEFAH H ALENEZY
“Synthesis and Characterization of SnO₂/α-Fe₂O₃, In₂O₃/α-Fe₂O₃, and ZnO/α-Fe₂O₃ Thin Films: Photocatalytic and Antibacterial Applications”
Asma Arfaoui
تواصل معنا