Expert-Driven Monitoring of Operational ML Models

Joran Leest,Claudia Raibulet,Ilias Gerostathopoulos,Patricia Lago
2024-01-22
Abstract:We propose Expert Monitoring, an approach that leverages domain expertise to enhance the detection and mitigation of concept drift in machine learning (ML) models. Our approach supports practitioners by consolidating domain expertise related to concept drift-inducing events, making this expertise accessible to on-call personnel, and enabling automatic adaptability with expert oversight.
Machine Learning,Software Engineering
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to use domain expert knowledge to enhance the detection and mitigation of concept drift in machine learning (ML) models during actual operations. ### Problem Background With the wide application of machine learning models in modern software systems, changes in data distribution (such as changes caused by external events and data integrity issues) may lead to a decline in model performance. This phenomenon is known as **data drift**, which includes: - **Feature Drift**: changes in the input data distribution \(P(X)\). - **Feature Drift**: changes in the input data distribution \(P(X)\). - **Concept Drift**: changes in the conditional probability distribution \(P(Y|X)\) of the target variable given the input. Concept drift has a significant impact on model performance, especially in sensitive areas such as credit card fraud detection, and may lead to discriminatory behavior. Therefore, timely detection and mitigation of concept drift are crucial. ### Existing Challenges 1. **Concept Drift Detection in Unlabeled Data**: In the real world, labeled data is often delayed or missing, making detection based on metrics such as accuracy difficult. 2. **Understanding the Characteristics of Detected Concept Drift**: Even if concept drift can be detected, it is still challenging to understand its characteristics such as severity, frequency, duration, and transition speed. 3. **Limitations of Automated Tools**: Existing automated concept drift detection and mitigation tools are not commonly used and rely more on human intervention and shift - based monitoring. ### Solution To solve the above problems, this paper proposes the **Expert - Driven Monitoring** method, which specifically includes: 1. **Scenario Specification**: - Through expert knowledge acquisition and retrospective analysis, collect events that may cause concept drift and standardize them. - Use the Bayesian model to quantify the experts' subjective beliefs about changes in feature distribution and estimate parameters such as mean and standard deviation. 2. **Scenario Identification**: - After detecting feature drift at runtime, use the Bayesian model to compare and infer the occurrence of scenarios. - Calculate the posterior probability \(P(M|D)\) of each scenario model and compare it with the reference model to obtain the Bayesian factor to evaluate the likelihood of the scenario. 3. **Response Mechanism**: - Select the most appropriate response strategy based on the Bayesian factor to assist ML engineers in making decisions or automatically trigger mitigation measures. By integrating the knowledge of domain experts, this method aims to improve the detection accuracy and response efficiency of concept drift, thereby ensuring the stability and reliability of machine learning models in practical applications.