Outlier Detection Using Machine Learning Algorithms Integrated with Bayesian Optimization

Xihua Liu,Tao Ma,Wenjie Gao,Xinglin Zhu,Yong Wen,Wenqiang Pan
DOI: https://doi.org/10.1109/admit57209.2022.00033
2022-01-01
Abstract:This study aimed to discuss the performance of machine learning algorithms on the outlier detection problem and to investigate the effect of Bayesian Optimization Algorithm (BOA) on improving the performance of machine learning mod-els.Firstly, Classification And Regression Tree (CART), Extreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LGBM) models were constructed. The models were trained using default parameters, and it was found that XGB and LSTM performed much better than CART. secondly, the best hyperparameters were searched using grid search, random search and Bayesian optimization, respectively, and the comparison revealed that BOA could search for the best hyperparameters in the shortest time, resulting in the maximum model performance. Finally, by analyzing the ROC and PRC curves of the three ML models tuned to the optimal hyperparameters using BOA, it is found that the combination of LSTM+BOA has the best detection effect on outlier points with an average precision of 0.911. Through the results of this study, it is confirmed that the machine learning model tuned with BOA can have a robust outlier detection capability, and the method can be applied to other method can be applied to other outlier detection problems, such as network intrusion detection.
What problem does this paper attempt to address?