Abstract:AI-based systems possess distinctive characteristics and introduce challenges in quality evaluation at the same time. Consequently, ensuring and validating AI software quality is of critical importance. In this paper, we present an effective AI software functional testing model to address this challenge. Specifically, we first present a comprehensive literature review of previous work, covering key facets of AI software testing processes. We then introduce a 3D classification model to systematically evaluate the image-based text extraction AI function, as well as test coverage criteria and complexity. To evaluate the performance of our proposed AI software quality test, we propose four evaluation metrics to cover different aspects. Finally, based on the proposed framework and defined metrics, a mobile Optical Character Recognition (OCR) case study is presented to demonstrate the framework's effectiveness and capability in assessing AI function quality.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to address the challenges in AI software testing, especially the quality assessment and verification issues of image - based text extraction (such as OCR) functions. Specifically, the paper attempts to solve the following key problems: 1. **Unique challenges in AI software testing**: - AI systems have large - scale unstructured input data, unpredictable scenarios, output uncertainty, and data - driven learning characteristics. These features make it difficult to directly apply traditional software testing methods to AI software. - Specifically for OCR systems, the challenges include model interpretability, lack of clear specifications and defined requirements, test input generation, defining test criteria, and managing dynamic environments. 2. **Effective test models for ensuring AI software quality**: - The paper proposes an effective AI software functional test model to address the above challenges. This model, through a systematic literature review, covers the key aspects of the AI software testing process and introduces a 3D classification model to systematically evaluate image - based text - extraction AI functions. 3. **Test coverage and complexity**: - Test coverage criteria and complexity analysis are proposed to ensure the comprehensiveness and effectiveness of testing. This helps to identify the key factors affecting OCR accuracy and reduce testing costs. 4. **Design of evaluation metrics**: - Four evaluation metrics (Flex Character Accuracy (FCA), String Segment Accuracy (SSA), Ordered String Segment Accuracy (OSSA), and Text - Line Accuracy (TLA)) are designed to evaluate the performance of OCR systems at different levels. 5. **Validation in practical applications**: - Through a case study based on mobile OCR applications (such as CamScanner and Scanner Pro), the effectiveness of the proposed test model and evaluation metrics is verified. The experimental results show that this model can effectively evaluate the performance of OCR systems in different scenarios. ### Summary In general, this paper is committed to developing a systematic AI software testing framework to ensure the quality and reliability of image - based text - extraction AI functions (such as OCR). By introducing new test models and evaluation metrics, the paper provides valuable references and practical guidance for the field of AI software testing.

Computer Vision Intelligence Test Modeling and Generation: A Case Study on Smart OCR

Human Visual Perception Based Image Quality Assessment for Video Prediction

Scene Text Detection and Recognition System for Visually Impaired People in Real World

The Computer Vision-based Tolerancing Callout Detection Model

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

Issue Based OCR Error Prediction in Video Streams

An Intelligent Remote Sensing Image Quality Inspection System

The tuberculous morbidity amongst African tuberculin positive and negative reactors.

Building Safe and Reliable AI systems for Safety Critical Tasks with Vision-Language Processing

A Computer Vision-Based Quality Assessment Technique for the automatic control of consumables for analytical laboratories

Toward Fully Automated Inspection of Critical Assets Supported by Autonomous Mobile Robots, Vision Sensors, and Artificial Intelligence

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems

Revolutionizing the Application of Automatic Inspection System for Industrial Parts Using AI Machine Vision Technology

A Visual Quality Assessment Method for Raster Images in Scanned Document

Adaptive Testing of Computer Vision Models

Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment

Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis

Computer Vision Based Quality Control for Additive Manufacturing Parts

A Survey of Text Detection and Recognition Algorithms Based on Deep Learning Technology

Retention of a fissure sealant six months after application