Abstract:New data processing pipelines and novel network architectures increasingly drive the success of deep learning. In consequence, the industry considers top-performing architectures as intellectual property and devotes considerable computational resources to discovering such architectures through neural architecture search (NAS). This provides an incentive for adversaries to steal these novel architectures; when used in the cloud, to provide Machine Learning as a Service, the adversaries also have an opportunity to reconstruct the architectures by exploiting a range of hardware side channels. However, it is challenging to reconstruct novel architectures and pipelines without knowing the computational graph (e.g., the layers, branches or skip connections), the architectural parameters (e.g., the number of filters in a convolutional layer) or the specific pre-processing steps (e.g. embeddings). In this paper, we design an algorithm that reconstructs the key components of a novel deep learning system by exploiting a small amount of information leakage from a cache side-channel attack, Flush+Reload. We use Flush+Reload to infer the trace of computations and the timing for each computation. Our algorithm then generates candidate computational graphs from the trace and eliminates incompatible candidates through a parameter estimation process. We implement our algorithm in PyTorch and Tensorflow. We demonstrate experimentally that we can reconstruct MalConv, a novel data pre-processing pipeline for malware detection, and ProxylessNAS- CPU, a novel network architecture for the ImageNet classification optimized to run on CPUs, without knowing the architecture family. In both cases, we achieve 0% error. These results suggest hardware side channels are a practical attack vector against MLaaS, and more efforts should be devoted to understanding their impact on the security of deep learning systems.

DeepSniffer: A DNN Model Extraction Framework Based on Learning Architectural Hints

D-DAE: Defense-Penetrating Model Extraction Attacks.

A Systematic View of Leakage Risks in Deep Neural Network Systems

Neural Network Model Extraction Attacks in Edge Devices by Hearing Architectural Hints

DeepTheft: Stealing DNN Model Architectures through Power Side Channel

Ownership Verification of DNN Architectures Via Hardware Cache Side Channels.

A GAN-Based Defense Framework Against Model Inversion Attacks.

DeepGuiser: Learning to Disguise Neural Architectures for Impeding Adversarial Transfer Attacks

Demystifying Arch-hints for Model Extraction: An Attack in Unified Memory System

NASPY: Automated Extraction of Automated Machine Learning Models

Architecture Disentanglement for Deep Neural Networks

Cache Telepathy: Leveraging Shared Resource Attacks to Learn DNN Architectures

EZClone: Improving DNN Model Extraction Attack via Shape Distillation from GPU Execution Profiles

How to 0wn NAS in Your Spare Time

NNReArch: A Tensor Program Scheduling Framework Against Neural Network Architecture Reverse Engineering

PINCH: An Adversarial Extraction Attack Framework for Deep Learning Models

CNN architecture extraction on edge GPU

Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network in Edge Computing

A Hard-Label Cryptanalytic Extraction of Non-Fully Connected Deep Neural Networks using Side-Channel Attacks

Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network in Edge Computing

Leaky DNN: Stealing Deep-Learning Model Secret with GPU Context-Switching Side-Channel