A framework for falsifiable explanations of machine learning models with an application in computational pathology

David Schuhmacher,Stephanie Schörner,Claus Küpper,Frederik Großerueschkamp,Carlo Sternemann,Celine Lugnier,Anna-Lena Kraeft,Hendrik Jütte,Andrea Tannapfel,Anke Reinacher-Schick,Klaus Gerwert,Axel Mosig
DOI: https://doi.org/10.1016/j.media.2022.102594
Abstract:In recent years, deep learning has been the key driver of breakthrough developments in computational pathology and other image based approaches that support medical diagnosis and treatment. The underlying neural networks as inherent black boxes lack transparency and are often accompanied by approaches to explain their output. However, formally defining explainability has been a notorious unsolved riddle. Here, we introduce a hypothesis-based framework for falsifiable explanations of machine learning models. A falsifiable explanation is a hypothesis that connects an intermediate space induced by the model with the sample from which the data originate. We instantiate this framework in a computational pathology setting using hyperspectral infrared microscopy. The intermediate space is an activation map, which is trained with an inductive bias to localize tumor. An explanation is constituted by hypothesizing that activation corresponds to tumor and associated structures, which we validate by histological staining as an independent secondary experiment.
What problem does this paper attempt to address?