Research on financial fraud algorithm based on federal learning and big data technology

Xinye Sha
2024-05-07
Abstract:With the deepening of the digitization degree of financial business, financial fraud presents more complex and hidden characteristics, which poses a severe challenge to the risk prevention and control ability of financial institutions. At the same time, the vigorous development of big data technology provides massive potential information resources, and federated learning, as an emerging distributed machine learning paradigm, can realize multi-party data collaborative modeling under the premise of protecting data privacy. This paper firstly elaborates the basic principle, advantages and unique value of federated learning in solving data silos and protecting user privacy. Aiming at the needs of financial fraud detection, this paper discusses the design of federal learning architecture suitable for this scenario, including selecting suitable model type (such as neural network), setting reasonable data partitioning and updating rules. The central theme of the dissertation revolves around the exploration and execution of an algorithm for detecting financial fraud, which is grounded in federated learning methodologies. With a federated learning framework, each participant trains the model locally and exchanges only model parameters rather than raw data, enabling iterative optimization of the global model while protecting data privacy. To ascertain the efficacy and superiority of the suggested algorithm, a meticulous experimental investigation is both devised and executed. A real-world financial fraud dataset is selected to compare the fraud detection performance using traditional centralized learning and federated learning. The findings from the experiments reveal that the federated learning-based financial fraud algorithm achieves a substantial reduction in the likelihood of data privacy breaches without compromising on high detection accuracies.
Computational Engineering, Finance, and Science
What problem does this paper attempt to address?
The paper mainly explores the research on financial fraud algorithm based on federated learning and big data technology. With the deepening digitization of financial business, fraud behavior has become more complex and difficult to detect, posing severe challenges to the risk control of financial institutions. As emerging distributed machine learning paradigms, big data technology and federated learning can achieve multi-dimensional data collaborative modeling while protecting data privacy. The paper first expounds the basic principles, advantages, and unique value of federated learning in solving data silos and user privacy protection. In response to the needs of financial fraud detection, the paper discusses the design of a federated learning architecture suitable for this scenario, including selecting appropriate model types (such as neural networks), setting reasonable data partitioning and update rules. Through the federated learning framework, each participant trains the model locally, only exchanging model parameters instead of raw data, to iteratively optimize the global model and protect data privacy. In the experimental part, the paper selects a real financial fraud dataset and compares the differences in fraud detection performance between traditional centralized learning and federated learning, with evaluation metrics including accuracy, recall, and F1 score. The experimental results show that federated learning has significant advantages in improving fraud detection performance compared to centralized learning.