IterVM: an Iterative Model for Single-Particle Cryo-EM Image Clustering Based on Variational Autoencoder and Multi-Reference Alignment

Guowei Ji,Yang,Hong-Bin Shen
DOI: https://doi.org/10.1109/bibm.2018.8621474
2018-01-01
Abstract:Cryo-electron microscopy (cryo-EM) is playing a more and more important role in single-particle structure determination. In order to achieve near-atomic resolution, 2D image analysis is the first and critical step. The main task is to cluster and align images into homogenous groups, and yield more clear images by averaging the images in the same group. The quality of the average images directly influences the reconstruction of 3D structures. The main difficulties lie in that the cryo-EM images are extremely noisy, and the orientation is randomly distributed and unknown. Therefore how to improve the clustering accuracy in such a low signal-to-noise-ratio (SNR) scenario is a key issue in cryo-EM data analysis. In this study, we present a new clustering method, named IterVM, which is an iterative model. In each iteration, it uses an unsupervised generative model, i.e. variational autoencoders (VAE), to learn the latent information contained in the images. After training the models, it obtains the decoded images from the training data and then clusters and aligns them as using a k-means based algorithm. From the experimental results, we find that the decoded images effectively reflect the real projection information such as orientation angles and structural heterogeneity. The results demonstrate that clustering on the decoded images produce better performance compared with the state-of-the-art clustering methods for cryo-EM data.
What problem does this paper attempt to address?