Abstract:Poisoning attacks are a primary threat to machine learning models, aiming to compromise their performance and reliability by manipulating training datasets. This paper introduces a novel attack - Outlier-Oriented Poisoning (OOP) attack, which manipulates labels of most distanced samples from the decision boundaries. The paper also investigates the adverse impact of such attacks on different machine learning algorithms within a multiclass classification scenario, analyzing their variance and correlation between different poisoning levels and performance degradation. To ascertain the severity of the OOP attack for different degrees (5% - 25%) of poisoning, we analyzed variance, accuracy, precision, recall, f1-score, and false positive rate for chosen ML <a class="link-external link-http" href="http://models.Benchmarking" rel="external noopener nofollow">this http URL</a> our OOP attack, we have analyzed key characteristics of multiclass machine learning algorithms and their sensitivity to poisoning attacks. Our experimentation used three publicly available datasets: IRIS, MNIST, and ISIC. Our analysis shows that KNN and GNB are the most affected algorithms with a decrease in accuracy of 22.81% and 56.07% while increasing false positive rate to 17.14% and 40.45% for IRIS dataset with 15% poisoning. Further, Decision Trees and Random Forest are the most resilient algorithms with the least accuracy disruption of 12.28% and 17.52% with 15% poisoning of the IRIS dataset. We have also analyzed the correlation between number of dataset classes and the performance degradation of models. Our analysis highlighted that number of classes are inversely proportional to the performance degradation, specifically the decrease in accuracy of the models, which is normalized with increasing number of classes. Further, our analysis identified that imbalanced dataset distribution can aggravate the impact of poisoning for machine learning models

Hyperparameter Learning under Data Poisoning: Analysis of the Influence of Regularization via Multiobjective Bilevel Optimization

Data Poisoning in LLMs: Jailbreak-Tuning and Scaling Laws

Regularization Helps with Mitigating Poisoning Attacks: Distributionally-Robust Machine Learning Using the Wasserstein Distance

Outlier-Oriented Poisoning Attack: A Grey-box Approach to Disturb Decision Boundaries by Perturbing Outliers in Multiclass Learning

Certified Robustness to Data Poisoning in Gradient-Based Training

Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks

Robust Linear Regression Against Training Data Poisoning

PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning

With Great Dispersion Comes Greater Resilience: Efficient Poisoning Attacks and Defenses for Linear Regression Models

Reinforcement Learning For Data Poisoning on Graph Neural Networks

On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping

Is poisoning a real threat to LLM alignment? Maybe more so than you think

On the Relevance of Byzantine Robust Optimization Against Data Poisoning

A Separation Result Between Data-oblivious and Data-aware Poisoning Attacks

Stronger Data Poisoning Attacks Break Data Sanitization Defenses

Amplifying Membership Exposure via Data Poisoning

Data Poisoning Attacks on Regression Learning and Corresponding Defenses

What Distributions are Robust to Indiscriminate Poisoning Attacks for Linear Learners?

Poisoning Attacks against Support Vector Machines

Fragile Giants: Understanding the Susceptibility of Models to Subpopulation Attacks

MetaPoison: Practical General-purpose Clean-label Data Poisoning