AI-based analysis of super-resolution microscopy: Biological discovery in the absence of ground truth

Ivan R. Nabi,Ben Cardoen,Ismail M. Khater,Guang Gao,Timothy H. Wong,Ghassan Hamarneh
2024-05-28
Abstract:Super-resolution microscopy, or nanoscopy, enables the use of fluorescent-based molecular localization tools to study molecular structure at the nanoscale level in the intact cell, bridging the mesoscale gap to classical structural biology methodologies. Analysis of super-resolution data by artificial intelligence (AI), such as machine learning, offers tremendous potential for discovery of new biology, that, by definition, is not known and lacks ground truth. Herein, we describe the application of weakly supervised paradigms to super-resolution microscopy and its potential to enable the accelerated exploration of the nanoscale architecture of subcellular macromolecules and organelles.
Subcellular Processes,Artificial Intelligence,Computer Vision and Pattern Recognition,Machine Learning,Biological Physics,Quantitative Methods
What problem does this paper attempt to address?
This paper explores how to utilize artificial intelligence (AI) to analyze super-resolution microscopy (SRM) data for accelerating biological discoveries, especially in the absence of ground truth information. SRM allows for the study of molecular structures at the nanoscale within cells, but its analysis typically requires detailed annotations from experts, which is both time-consuming and expensive. Traditional supervised machine learning relies on fully annotated data, but in SRM, such annotations are often impractical due to limited knowledge about the biology. The paper proposes the use of a weakly supervised learning paradigm, where AI is trained using partial information such as image-level class labels instead of pixel-level labels to recognize and localize objects in the images. This approach is suitable for SRM as it aims to identify and characterize subcellular structures that vary under different experimental conditions such as cell lines, gene expressions, mutations, infections, and drug treatments. Weak supervision can reduce dependence on expert annotations while accounting for the possibility that experts may not have a complete understanding of all the biological knowledge captured in these images. The paper also discusses self-supervised learning, where AI learns from information provided by the images themselves (e.g., rotation or noise) to reduce the need for a large amount of strongly supervised data. Combining different forms of supervision, such as semi-supervised learning, is also considered as a possible strategy. The paper emphasizes that despite the unprecedented insights provided by SRM in revealing subcellular structures, it still faces a "mesoscale gap" compared to high-resolution methods like electron microscopy (EM). EM offers subnanometer resolution, but for dynamic analysis and whole-cell analysis, fluorescence microscopy remains the preferred method, particularly SRM techniques that surpass the diffraction limit of light. Finally, the paper presents two examples demonstrating how weakly supervised AI methods can uncover new biological phenomena from SRM data, such as identifying novel domains of the CAV1 protein and sub-pixel resolution membrane contact points. These examples illustrate how biological prior knowledge can be leveraged to validate and discover new structures in the absence of pixel-level or object-level ground truth information. In summary, this paper aims to address the problem of utilizing AI to analyze SRM data for accelerating biological discoveries, especially in the absence of detailed annotation information. It also investigates how to overcome this challenge through weakly supervised and self-supervised learning strategies.