Abstract:Deep neural network (DNN) architectures, such as convolutional neural networks (CNN), involve heavy computation and require hardware, such as CPU, GPU, and AI accelerators, to provide the massive computing power. With the many varieties of AI hardware prevailing on the market, it is often hard to decide which one is the best to use. Thus, benchmarking AI hardware effectively becomes important and is of great help to select and optimize AI hardware. Unfortunately, there are few AI benchmarks available in both academia and industry. Examples are BenchNN[1], DeepBench[2], and Dawn Bench[3], which are usually a collection of typical real DNN applications. While these benchmarks provide performance comparison across different AI hardware, they suffer from a number of drawbacks. First, they cannot adapt to the emerging changes of DNN algorithms and are fixed once selected. Second, they contain tens to hundreds of applications and take very long time to finish running. Third, they are mainly selected from open sources, which are restricted by copyright and are not representable to proprietary applications. In this work, a synthetic benchmarks framework is firstly proposed to address the above drawbacks of AI benchmarks. Instead of pre-selecting a set of open-sourced benchmarks and running all of them, the synthetic approach generates only a one or few benchmarks that best represent a broad range of applications using profiled workload characteristics data of these applications. Thus, it can adapt to emerging changes of new DNN algorithms by re-profiling new applications and updating itself, greatly reduce benchmark count and running time, and strongly represent DNN applications of interests. The generated benchmarks are called AI Matrix, serving as a performance benchmarks matching the statistical workload characteristics of a combination of applications of interests.

AIbench: a Tool for Benchmarking Huawei Ascend AI Processors

AIBench: Towards Scalable and Comprehensive Datacenter AI Benchmarking

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

AIBench: an Industry Standard AI Benchmark Suite from Internet Services.

Aibench: an industry standard ai benchmark suite

P F ] 1 3 A ug 2 01 9 HPC AI 500 : A Benchmark Suite for HPC AI Systems

Benchmarking the Performance and Energy Efficiency of AI Accelerators for AI Training

HPC AI500: A Benchmark Suite for HPC AI Systems

Edge AIBench: Towards Comprehensive End-to-End Edge Computing Benchmarking.

BENCHIP： Benchmarking Intelligence Processors

AIBench Scenario: Scenario-distilling AI Benchmarking

AIBench Training: Balanced Industry-Standard AI Training Benchmarking

AI-oriented Workload Allocation for Cloud-Edge Computing.

AI Benchmark: Running Deep Neural Networks on Android Smartphones

Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs

Being-ahead: Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment

Analysis of Performance and Optimization in MindSpore on Ascend NPUs

AIPerf: Automated machine learning as an AI-HPC benchmark

AI Matrix - Synthetic Benchmarks for DNN