Abstract:Complete verification of deep neural networks (DNNs) can exactly determine whether the DNN satisfies a desired trustworthy property (e.g., robustness, fairness) on an infinite set of inputs or not. Despite the tremendous progress to improve the scalability of complete verifiers over the years on individual DNNs, they are inherently inefficient when a deployed DNN is updated to improve its inference speed or accuracy. The inefficiency is because the expensive verifier needs to be run from scratch on the updated DNN. To improve efficiency, we propose a new, general framework for incremental and complete DNN verification based on the design of novel theory, data structure, and algorithms. Our contributions implemented in a tool named IVAN yield an overall geometric mean speedup of 2.4x for verifying challenging MNIST and CIFAR10 classifiers and a geometric mean speedup of 3.8x for the ACAS-XU classifiers over the state-of-the-art baselines.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the inefficiency of deep neural networks (DNNs) when performing a full re - verification after an update. Specifically, when the deployed DNN is updated to improve inference speed or accuracy, the existing full - verification methods need to re - run the expensive verification process from scratch. This not only consumes a great deal of time and computing resources but also limits the use of these verification tools in practical applications.
### Background of the Paper
Deep neural networks (DNNs) have been widely used in many key areas such as autonomous driving, healthcare, and aviation. However, the black - box nature of DNNs, their vulnerability to small changes in input data, and their sensitivity to external data have raised doubts about their credibility in practical applications. In recent years, researchers have developed many verifiers to formally check whether the behavior of DNNs is trustworthy on an infinite input set, such as properties like robustness and fairness.
### Limitations of Existing Work
The existing full - verification methods for DNNs have a fundamental limitation: whenever the network is modified even slightly, the verifier needs to re - run the full verification process from scratch. This inefficient approach causes developers to still rely on the test - set accuracy as the main indicator for measuring the quality of the trained network, and they are unable to fully utilize the deeper credibility insights provided by full - verification.
### The Solution Proposed in This Paper
To solve this problem, this paper proposes IVAN, a new general - purpose framework for incremental and full DNN verification. The core idea of IVAN is to use the verification results of the original network to accelerate the verification process of the updated network. Specifically:
1. **Specification Tree**: IVAN constructs a specification tree to store the trajectory of branch decisions in the verification process of the original network. The specification tree is a binary - tree structure, and each node represents the specification of a sub - problem.
2. **Reuse and Reordering**: IVAN accelerates the verification of the updated network by reusing the specification tree of the original network and adjusting the branch strategy according to the verification effect.
3. **Pruning the Specification Tree**: IVAN also introduces a novel pruning operation to remove invalid branch decisions and construct a new compact tree, further optimizing the verification efficiency.
### Experimental Results
Experiments show that IVAN can significantly accelerate the verification process of DNNs compared to existing methods. For MNIST and CIFAR10 classifiers, IVAN achieves an average acceleration of 2.4 times; for ACAS - XU classifiers, it achieves an average acceleration of 3.8 times. In some cases, IVAN can even achieve an acceleration of up to 43 times.
### Main Contributions
1. Propose a new general - purpose framework for incremental and full DNN verification.
2. Design new algorithms and data structures that can efficiently encode and reuse branch strategies.
3. Provide theoretical bounds to determine the types of network modifications that can be efficiently verified.
4. Verify the effectiveness of IVAN on multiple benchmark datasets and demonstrate its superior performance under different branch strategies.
Through these contributions, IVAN provides a more efficient and practical tool for the credibility assessment of DNNs.