Abstract:Object detection holds a crucial role in medical diagnostics. Tasks like organ segmentation and malignancy diagnosis typically necessitate preliminary localization of corresponding anatomical structures. Precise positioning ensures that only pertinent regions require processing, leading to a potential reduction in computational and storage demands. Conventional image detection approaches necessitate numerous candidate boxes, resulting in redundant computations. Developing techniques capable of accurately detecting medical image objects without reliance on candidate boxes holds substantial practical significance. This paper introduces a 2D method for detecting medical image objects, which leverages multi-agent deep Q-network reinforcement learning and a multi-scale image representation. The method constructs a collaborative environment for multiple agents. These agents individually govern the upper-right corner and lower-left corner positions of the object detection frame, progressively converging toward the actual endpoint through iterative interactions. To expedite the detection process, a multi-scale image representation technique is employed. This method segments the process into three scales. Initially, within the coarse-scale space, the agent approximates the region containing the true endpoint, subsequently executing oscillatory movements. Progressively, it refines its approach within the fine-scale space, advancing toward the genuine endpoint with smaller iterative steps. The detection results demonstrate that collaborative detection among agents yields a 2.45 % higher intersection over union compared to non-collaborative detection. Agents exhibit varying step sizes and fields of view in different scale spaces, leading to a reduction in detection time by 0.12 s compared to single-scale comparison. Experimental outcomes demonstrate the superiority of the medical image target detection method proposed in this study over prevailing mainstream detection algorithms.

Medical object detector jointly driven by knowledge and data

A Model-Agnostic Framework for Universal Anomaly Detection of Multi-organ and Multi-modal Images

Cross-Modal Object Detection Based on a Knowledge Update

Hybrid Knowledge Routed Modules for Large-scale Object Detection

Class-balanced Open-set Semi-supervised Object Detection for Medical Images

KDSMALL: A lightweight small object detection algorithm based on knowledge distillation

Exploring Driving-aware Salient Object Detection via Knowledge Transfer

Enhancing Medical Image Object Detection with Collaborative Multi-Agent Deep Q-networks and Multi-Scale Representation

KA^2ER: Knowledge Adaptive Amalgamation of ExpeRts for Medical Images Segmentation

Object detection based on knowledge graph network

ADA-YOLO: Dynamic Fusion of YOLOv8 and Adaptive Heads for Precise Image Detection and Diagnosis

K-Diag: Knowledge-enhanced Disease Diagnosis in Radiographic Imaging

Design of robust deep learning-based object detection and classification model for autonomous driving applications

Towards Cross-modality Medical Image Segmentation with Online Mutual Knowledge Distillation

KA$^2$ER: Knowledge Adaptive Amalgamation of ExpeRts for Medical Images Segmentation

Towards Toxic and Narcotic Medication Detection with Rotated Object Detector

Gradient-Guided Knowledge Distillation for Object Detectors

MedYOLO: A Medical Image Object Detection Framework

CrossKD: Cross-Head Knowledge Distillation for Object Detection

Multi-layer Aggregation as a key to feature-based OOD detection

Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images