Abstract:Radiation-induced acoustic (RA) imaging is a promising technique for visualizing the invisible radiation energy deposition in tissues, enabling new imaging modalities and real-time therapy monitoring. However, RA imaging signal often suffers from poor signal-to-noise ratios (SNRs), thus requiring measuring hundreds or even thousands of frames for averaging to achieve satisfactory quality. This repetitive measurement increases ionizing radiation dose and degrades the temporal resolution of RA imaging, limiting its clinical utility. In this study, we developed a general deep inception convolutional neural network (GDI-CNN) to denoise RA signals to substantially reduce the number of frames needed for averaging. The network employs convolutions with multiple dilations in each inception block, allowing it to encode and decode signal features with varying temporal characteristics. This design generalizes GDI-CNN to denoise acoustic signals resulting from different radiation sources. The performance of the proposed method was evaluated using experimental data of X-ray-induced acoustic, protoacoustic, and electroacoustic signals both qualitatively and quantitatively. Results demonstrated the effectiveness of GDI-CNN: it achieved X-ray-induced acoustic image quality comparable to 750-frame-averaged results using only 10-frame-averaged measurements, reducing the imaging dose of X-ray-acoustic computed tomography (XACT) by 98.7%; it realized proton range accuracy parallel to 1500-frame-averaged results using only 20-frame-averaged measurements, improving the range verification frequency in proton therapy from 0.5Hz to 37.5Hz; it reached electroacoustic image quality comparable to 750-frame-averaged results using only a single frame signal, increasing the electric field monitoring frequency from 1 fps to 1k fps. Compared to lowpass filter-based denoising, the proposed method demonstrated considerably lower mean-squared-errors, higher peak-SNR, and higher structural similarities with respect to the corresponding high-frame-averaged measurements. The proposed deep learning-based denoising framework is a generalized method for few-frame-averaged acoustic signal denoising, which significantly improves the RA imaging's clinical utilities for low-dose imaging and real-time therapy monitoring.

Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation.

Learning Frame-Level Recurrent Neural Networks Representations for Query-by-Example Spoken Term Detection on Mobile Devices

Recurrent Neural Network Based Link Quality Prediction for Wireless Sensor Networks

Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments

Sound Levels Forecasting in an Acoustic Sensor Network Using a Deep Neural Network

Frame Stacking and Retaining for Recurrent Neural Network Acoustic Model

Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition

Deep causal speech enhancement and recognition using efficient long-short term memory Recurrent Neural Network

Radiation-induced acoustic signal denoising using a supervised deep learning framework for imaging and therapy monitoring

Multiple-target Deep Learning for LSTM-RNN Based Speech Enhancement

Towards Efficient Recurrent Architectures: A Deep LSTM Neural Network Applied to Speech Enhancement and Recognition

Deep Learning-Based Signal-to-Noise Ratio Estimation for Underwater Optical Wireless Communication

Deep Learning Seismic Random Noise Attenuation via Improved Residual Convolutional Neural Network

Improving the Signal‐to‐Noise Ratio of Seismological Datasets by Unsupervised Machine Learning

Monitoring Depth of Anesthesia Based on Hybrid Features and Recurrent Neural Network

Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

Design of a Deep Learning-Based Underwater Acoustic Sensor Transceiver

Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR

A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement.