Skip to main content
 

 

 

Innovative Forward Fusion Feature Selection Algorithm for Sentiment Analysis Using Supervised Classification

Author name : AHMED MOSA H ALSAYAT
Publication Date : 2023-02-05
Journal Name : Applied Sciences

Abstract

Sentiment analysis is considered one of the significant trends of the recent few years. Due to the high importance and increasing use of social media and electronic services, the need for reviewing and enhancing the provided services has become crucial. Revising the user services is based mainly on sentiment analysis methodologies for analyzing users’ polarities to different products and applications. Sentiment analysis for Arabic reviews is a major concern due to high morphological linguistics and complex polarity terms expressed in the reviews. In addition, the users can present their orientation towards a service or a product by using a hybrid or mix of polarity terms related to slang and standard terminologies. This paper provides a comprehensive review of recent sentiment analysis methods based on lexicon or machine learning (ML). The comparison provides a clear vision of the number of classes, the used dialect, the annotated algorithms, and their performance. The proposed methodology is based on cross-validation of Arabic data using a k-fold mechanism that splits the dataset into training and testing folds; subsequently, the data preprocessing is executed to clean sentiments from unwanted terms that can affect data analysis. A vectorization of the dataset is then applied using TF–IDF for counting word and polarity terms. Furthermore, a feature selection stage is processed using Pearson, Chi2, and Random Forest (RF) methods for mapping the compatibility between input and target features. This paper also proposed an algorithm called the forward fusion feature for sentiment analysis (FFF-SA) to provide a feature selection that applied different machine learning (ML) classification models for each chunk of k features and accumulative features on the Arabic dataset. The experimental results measured and scored all accuracies between the feature importance method and ML models. The best accuracy is recorded with the Naïve Bayes (NB) model with the RF method.

Keywords

Sentiment analysis; machine learning; cross-validation; vectorization; feature importance

Publication Link

https://doi.org/10.3390/app13042074

Block_researches_list_suggestions

Suggestions to read

“Synthesis and Characterization study of SnO2/α-Fe2O3, In2O3/α-Fe2O3 and ZnO/α-Fe2O3 thin films and its application as transparent conducting electrode in silicon heterojunction solar cell”
Asma Arfaoui
Oral cancer stem cells: A comprehensive review of key drivers of treatment resistance and tumor recurrence
DR KALADHAR REDDY AILENI
Modeling the Social Factors Affecting Students Satisfaction with Online Learning: A Structural Equation Modeling Approach
ABDULHAMEED RAKAN ALENEZI
Higher Knee Muscles Co-Contractions are Observed in Individuals Exhibiting Loading Asymmetry Early after ACL Reconstruction. The Combined Sections Meeting
ABDULMAJEED BARAKAT MUBARAK ALFAYYADH
Contact