Abstract:Deep neural network (DNN) architectures, such as convolutional neural networks (CNN), involve heavy computation and require hardware, such as CPU, GPU, and AI accelerators, to provide the massive computing power. With the many varieties of AI hardware prevailing on the market, it is often hard to decide which one is the best to use. Thus, benchmarking AI hardware effectively becomes important and is of great help to select and optimize AI hardware. Unfortunately, there are few AI benchmarks available in both academia and industry. Examples are BenchNN[1], DeepBench[2], and Dawn Bench[3], which are usually a collection of typical real DNN applications. While these benchmarks provide performance comparison across different AI hardware, they suffer from a number of drawbacks. First, they cannot adapt to the emerging changes of DNN algorithms and are fixed once selected. Second, they contain tens to hundreds of applications and take very long time to finish running. Third, they are mainly selected from open sources, which are restricted by copyright and are not representable to proprietary applications. In this work, a synthetic benchmarks framework is firstly proposed to address the above drawbacks of AI benchmarks. Instead of pre-selecting a set of open-sourced benchmarks and running all of them, the synthetic approach generates only a one or few benchmarks that best represent a broad range of applications using profiled workload characteristics data of these applications. Thus, it can adapt to emerging changes of new DNN algorithms by re-profiling new applications and updating itself, greatly reduce benchmark count and running time, and strongly represent DNN applications of interests. The generated benchmarks are called AI Matrix, serving as a performance benchmarks matching the statistical workload characteristics of a combination of applications of interests.

AIBench Training: Balanced Industry-Standard AI Training Benchmarking

AIBench: an Industry Standard AI Benchmark Suite from Internet Services.

AIBench Training: Balanced Industry-Standard AI Training Benchmarking

Aibench: an industry standard ai benchmark suite

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

AIBench: Towards Scalable and Comprehensive Datacenter AI Benchmarking

AIBench Scenario: Scenario-distilling AI Benchmarking

P F ] 1 3 A ug 2 01 9 HPC AI 500 : A Benchmark Suite for HPC AI Systems

HPC AI500: A Benchmark Suite for HPC AI Systems

Edge AIBench: Towards Comprehensive End-to-End Edge Computing Benchmarking.

HPC AI500: Representative, Repeatable and Simple HPC AI Benchmarking

AI-oriented Workload Allocation for Cloud-Edge Computing.

AIPerf: Automated machine learning as an AI-HPC benchmark

BENCHIP： Benchmarking Intelligence Processors

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

AI Matrix - Synthetic Benchmarks for DNN

AIbench: a Tool for Benchmarking Huawei Ascend AI Processors

Benchmarking the Performance and Energy Efficiency of AI Accelerators for AI Training

Introducing Milabench: Benchmarking Accelerators for AI

SAIBench: Benchmarking AI for Science