Learning Interpretable Microscopic Features of Tumor by Multi-task Adversarial CNNs To Improve Generalization

Mara Graziani,Sebastian Otalora,Stephane Marchand-Maillet,Henning Muller,Vincent Andrearczyk
DOI: https://doi.org/10.59275/j.melba.2023-3462
2023-06-21
Abstract:Adopting Convolutional Neural Networks (CNNs) in the daily routine of primary diagnosis requires not only near-perfect precision, but also a sufficient degree of generalization to data acquisition shifts and transparency. Existing CNN models act as black boxes, not ensuring to the physicians that important diagnostic features are used by the model. Building on top of successfully existing techniques such as multi-task learning, domain adversarial training and concept-based interpretability, this paper addresses the challenge of introducing diagnostic factors in the training objectives. Here we show that our architecture, by learning end-to-end an uncertainty-based weighting combination of multi-task and adversarial losses, is encouraged to focus on pathology features such as density and pleomorphism of nuclei, e.g. variations in size and appearance, while discarding misleading features such as staining differences. Our results on breast lymph node tissue show significantly improved generalization in the detection of tumorous tissue, with best average AUC 0.89 (0.01) against the baseline AUC 0.86 (0.005). By applying the interpretability technique of linearly probing intermediate representations, we also demonstrate that interpretable pathology features such as nuclei density are learned by the proposed CNN architecture, confirming the increased transparency of this model. This result is a starting point towards building interpretable multi-task architectures that are robust to data heterogeneity. Our code is available at <a class="link-external link-https" href="https://github.com/maragraziani/multitask_adversarial" rel="external noopener nofollow">this https URL</a>
Computer Vision and Pattern Recognition,Image and Video Processing
What problem does this paper attempt to address?
The paper attempts to address the challenges faced when using Convolutional Neural Networks (CNN) in pathological diagnosis, specifically including: 1. **Improving model transparency**: Existing CNN models are often viewed as "black boxes," making it difficult to ensure whether the model is using important diagnostic features. Therefore, researchers aim to develop a method that can identify and utilize key pathological features (such as nuclear density and atypia) while excluding misleading features (such as staining differences and acquisition domain changes). 2. **Enhancing model generalization**: Current CNN models are easily disturbed by the diversity in datasets (such as staining, fixation methods, slicing differences, etc.), leading to performance degradation. Thus, researchers hope to improve the model's adaptability and robustness to data from different sources by introducing multi-task learning and adversarial training techniques. 3. **Achieving interpretable deep learning**: To ensure the safety and reliability of the model, it needs to have a certain level of transparency so that clinicians can understand its decision-making process. This paper proposes a new architecture that combines multi-task learning and adversarial training, aiming to guide the model to focus on relevant diagnostic factors. Additionally, the model's ability to learn interpretable pathological features is validated through linear probing of intermediate representations. In summary, the core objective of the paper is to construct a multi-task CNN architecture that is both highly transparent and capable of effectively handling data heterogeneity, thereby improving the accuracy of tumor tissue detection and the interpretability of the model.