Abstract:Numerous studies have demonstrated, in-depth, the vulnerability of the deep reinforcement learning (DRL) model’s elements (e.g., reward), which is a factor limiting the widespread deployment of DRL in some crucial domains, including intelligent traffic signal control (ITSC). While partial poisoning attacks with insidious rewards are enabled undetectable by directly employing regularization or cumulative reward restrictions, these constraints are somewhat one-dimensional and fail to consider the time dependence of DRL. Moreover, the adversary should avoid injecting undesirable perturbations when agents’ policies are unstable, namely effectively maximizing the attacking strategy’s benefit. It is thus a challenge to perturb the DRL model stealthily with as few disruption steps or modifications to the original sample as possible while ensuring the attack’s efficiency. In this work, two black-box reward space attack strategies are introduced, where we encourage the adversary to learn a malicious adversarial policy actively. The first is Multi Constraint Stealthy-time Attack which is updated with the penalties earned by attacking crucial moments, and restricted through action confidence and perturbations’ total number, to ensure attack times’ stealthiness. The second technique is Multi Objective Stealthy-modification Attack which is modeled as a multi-objective optimization problem, and the adversary balance attack performance and stealthy modification with weighting factor ω. Extensive simulation results evaluated in SUMO, involving comparison assessment and attack distribution, exhibit a dramatic increase in average travel time, implying that our attacks impose pressure on the traffic flow, namely the efficacy of proposed attack strategies.

Promoting or Hindering: Stealthy Black-Box Attacks Against DRL-Based Traffic Signal Control

Uniformity of Markov Elements in Deep Reinforcement Learning for Traffic Signal Control

MARNet: Backdoor Attacks Against Cooperative Multi-Agent Reinforcement Learning

Exploring the Vulnerability of Deep Reinforcement Learning-based Emergency Control for Low Carbon Power Systems

Adversarial Attacks and Defense in Deep Reinforcement Learning (DRL)-Based Traffic Signal Controllers

Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies

Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles

Collaborative Attack Sequence Generation Model Based on Multiagent Reinforcement Learning for Intelligent Traffic Signal System

A Deep Reinforcement Learning Approach to Traffic Signal Control With Temporal Traffic Pattern Mining

DNLight: Learning Efficient Evaluation for Traffic Signal Control

Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems

Reinforcement Learning-based Security Enhancement for Controlled Optimization of Phases in Intelligent Traffic Signal System

A Deep Reinforcement Learning Approach for Isolated Intersection Traffic Signal Control with Long-Short Term Memory Network

Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving

Good Learning, Bad Performance: A Novel Attack Against RL-Based Congestion Control Systems

Adaptive Optimization of Traffic Signal Timing Via Deep Reinforcement Learning

RA-TSC: Learning Adaptive Traffic Signal Control Strategy Via Deep Reinforcement Learning

Adaptive urban traffic signal control based on enhanced deep reinforcement learning

Stealthy and efficient adversarial attacks against deep reinforcement learning

Reinforcement Learning based Cyberattack Model for Adaptive Traffic Signal Controller in Connected Transportation Systems

Smarter and Safer Traffic Signal Controlling Via Deep Reinforcement Learning