Abstract:Purpose of research. The purpose of the study is to evaluate certain machine learning models in data processing based on speed and efficiency related to the analysis of sentiment or consumer opinions in business intelligence. To highlight the existing developments, an overview of modern methods and models of sentiment analysis is given, demonstrating their advantages and disadvantages. Materials and methods. In order to improve the semester analysis process, organized using existing methods and models, it is necessary to adjust it in accordance with the growing changes in information flows today. In this case, it is crucial for researchers to explore the possibilities of updating certain tools, either to combine them or to develop them to adapt them to modern tasks in order to provide a clearer understanding of the results of their treatment. We present a comparison of several deep learning models, including convolutional neural networks, recurrent neural networks, and long-term and shortterm bidirectional memory, evaluated using different approaches to word integration, including Bidirectional Encoder Representations from Transformers (BERT) and its variants, FastText and Word2Vec. Data augmentation was conducted using a simple data augmentation approach. This project uses natural language processing (NLP), deep learning, and models such as LSTM, CNN, SVM TF-IDF, Adaboost, Naive Bayes, and then combinations of models. The results of the study allowed us to obtain and verify model results with user reviews and compare model accuracy to see which model had the highest accuracy results from the models and their combination of CNN with LSTM model, but SVM with TF-IDF vectoring was most effective for this unbalanced data set. In the constructed model, the result was the following indexes: ROC AUC - 0.82, precision - 0.92, F1 - 0.82, Precision - 0.82, and Recall - 0.82. More research and model implementation can be done to find a better model. Conclusion. Natural language text analysis has advanced quite a bit in recent years, and it is possible that such problems will be completely solved in the near future. Several different models in ML and CNN with the LSTM model, but SVM with the TF-IDF vectorizer proved most effective for this unbalanced data set. In general, both deep classification algorithm. A combination of both approaches can also learning and feature-based selection methods can be used to solve be used to further improve the efficiency of the algorithm. some of the most pressing problems. Deep learning is useful when the most relevant features are not known in advance, while feature-based

The impact of preprocessing steps on the accuracy of machine learning algorithms in sentiment analysis

Optimizing Machine Learning-based Sentiment Analysis Accuracy in Bilingual Sentences via Preprocessing Techniques

The Effects of Natural Language Processing on Big Data Analysis: Sentiment Analysis Case Study

A Combined Data Preprocessing Method Based on K-means Clustering and Singular Spectrum Analysis

A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis

Data preprocessing approach for machine learning-based sentiment classification

A Comparison of Pre-processing Techniques for Twitter Sentiment Analysis

Evaluating the Impact of Data Preprocessing Techniques on the Performance of Intrusion Detection Systems

The Role of Text Pre-processing in Sentiment Analysis

Research on performance variations of classifiers with the influence of pre-processing methods for Chinese short text classification

The influence of preprocessing on text classification using a bag-of-words representation

Comparison of Deep Learning Sentiment Analysis Methods, Including LSTM and Machine Learning

Realization of natural language processing and machine learning approaches for text‐based sentiment analysis

Sentiment Analysis Techniques and Application-Survey and Taxonomy

Assessing the Impact of Data Preprocessing on Analyzing Next Generation Sequencing Data

Analyzing Preprocessing Impact on Machine Learning Classifiers for Cryotherapy and Immunotherapy Dataset

An efficient preprocessing method for supervised sentiment analysis by converting sentences to numerical vectors: a twitter case study

Named Entity Recognition of an Oversampled and Preprocessed Manufacturing Data Corpus

Investigating sentiment analysis using machine learning approach

Sentiment Analysis of Customer Reviews on E-commerce Platforms: A Machine Learning Approach