Abstract:Voice-controlled (VC) systems, such as mobile phones and smart speakers, enable users to operate smart devices through voice commands. Previous works (e.g., LightCommands) show that attackers can trigger VC systems to respond to various audio commands by injecting light signals. However, LightCommands only discusses attacks on devices with a single microphone, while new devices typically use microphone arrays with sensor fusion technology for better capturing sound from different distances. By replicating LightCommands’s experiments on the new devices, we find that simply extending the light scope (just as they do) to overlap multiple microphone apertures is inadequate to wake up the device with sensor fusion. Adapting LightCommands’s approach to microphone arrays is challenging due to their requirement for multiple sound amplifiers, and each amplifier requires an independent power driver with unique settings. The number of additional devices increases with the microphone aperture count, significantly increasing the complexity of implementing and deploying the attack equipment. With a growing number of devices adopting sensor fusion to distinguish the sound location, it is essential to propose new approaches to adapting the light injection attacks to these new devices. To address these problems, we propose a lightweight microphone array laser injection solution called LCMA (Laser Commands for Microphone Array), which can use a single laser controller to manipulate multiple laser points and simultaneously target all the apertures of a microphone array and input light waves at different frequencies. Our key design is to propose a new PWM (Pulse Width Modulation) based control signal algorithm that can be implemented on a single MCU and directly control multiple lasers via different PWM output channels. Moreover, LCMA can be remotely configured via BLE (Bluetooth Low Energy). These features allow our solution to be deployed on a drone to covertly attack the targets hidden inside the building. Using LCMA, we successfully attack 29 devices. The experiment results show that LCMA is robust on the newest devices such as the iPhone 15, and the control panel of the Tesla Model Y.

CapSpeaker: Injecting Commands to Voice Assistants Via Capacitors

CapSpeaker: Injecting Voices to Microphones Via Capacitors

Remote Attacks on Speech Recognition Systems Using Sound from Power Supply

Echo: Reverberation-based Fast Black-Box Adversarial Attacks on Intelligent Audio Systems.

The Silent Manipulator: A Practical and Inaudible Backdoor Attack against Speech Recognition Systems

Evaluation and Defense of Light Commands Attacks Against Voice Controllable Systems in Smart Cars

Marionette: Manipulate Your Touchscreen Via A Charging Cable

EarArray: Defending Against DolphinAttack Via Acoustic Attenuation

WIGHT: Wired Ghost Touch Attack on Capacitive Touchscreens

The Feasibility of Injecting Inaudible Voice Commands to Voice Assistants

MagBackdoor: Beware of Your Loudspeaker As A Backdoor for Magnetic Injection Attacks.

Mmear: Push the Limit of COTS Mmwave Eavesdropping on Headphones

Audio Hotspot Attack: An Attack on Voice Assistance Systems Using Directional Sound Beams and its Feasibility

Light Commands: Laser-Based Audio Injection Attacks on Voice-Controllable Systems

BarrierBypass: Out-of-Sight Clean Voice Command Injection Attacks through Physical Barriers

DolphinAttack: Inaudible Voice Commands

Laser-Based Command Injection Attacks on Voice-Controlled Microphone Arrays.

DolphinAtack: Inaudible Voice Commands

Exploiting Physical Presence Sensing to Secure Voice Assistant Systems.

Voiceprint Mimicry Attack Towards Speaker Verification System in Smart Home

Using AI to Hack IA: A New Stealthy Spyware Against Voice Assistance Functions in Smart Phones