Abstract:Survival analysis (SA) models have been widely studied in mining electronic health records (EHRs), particularly in forecasting the risk of critical conditions for prioritizing high-risk patients. However, their vulnerability to adversarial attacks is much less explored in the literature. Developing black-box perturbation algorithms and evaluating their impact on state-of-the-art survival models brings two benefits to medical applications. First, it can effectively evaluate the robustness of models in pre-deployment testing. Also, exploring how subtle perturbations would result in significantly different outcomes can provide counterfactual insights into the clinical interpretation of model prediction. In this work, we introduce SurvAttack, a novel black-box adversarial attack framework leveraging subtle clinically compatible, and semantically consistent perturbations on longitudinal EHRs to degrade survival models' predictive performance. We specifically develop a greedy algorithm to manipulate medical codes with various adversarial actions throughout a patient's medical history. Then, these adversarial actions are prioritized using a composite scoring strategy based on multi-aspect perturbation quality, including saliency, perturbation stealthiness, and clinical meaningfulness. The proposed adversarial EHR perturbation algorithm is then used in an efficient SA-specific strategy to attack a survival model when estimating the temporal ranking of survival urgency for patients. To demonstrate the significance of our work, we conduct extensive experiments, including baseline comparisons, explainability analysis, and case studies. The experimental results affirm our research's effectiveness in illustrating the vulnerabilities of patient survival models, model interpretation, and ultimately contributing to healthcare quality.

BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records

B3: Backdoor Attacks Against Black-box Machine Learning Models

BAD-FM: Backdoor Attacks Against Factorization-Machine Based Neural Network for Tabular Data Prediction

Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A Pilot Study on MedCLIP

Identify Susceptible Locations in Medical Records via Adversarial Attacks on Deep Predictive Models

Machine Learning with Electronic Health Records is vulnerable to Backdoor Trigger Attacks

MedAttacker: Exploring Black-Box Adversarial Attacks on Risk Prediction Models in Healthcare

Composite Backdoor Attacks Against Large Language Models

BadCLIP: Dual-Embedding Guided Backdoor Attack on Multimodal Contrastive Learning

Clinical Risk Prediction Using Language Models: Benefits And Considerations

Exposing Vulnerabilities in Clinical LLMs Through Data Poisoning Attacks: Case Study in Breast Cancer

Medical MLLM is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges

BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents

Adversarial Attacks on Large Language Models in Medicine

CBAs: Character-level Backdoor Attacks Against Chinese Pre-trained Language Models

Neutralizing Backdoors through Information Conflicts for Large Language Models

Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning

BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models

Longitudinal Adversarial Attack on Electronic Health Records Data

SurvAttack: Black-Box Attack On Survival Models through Ontology-Informed EHR Perturbation