Improved deep learning based macromolecules structure classification from electron cryo tomograms

Chengqian Che,Ruogu Lin,Xiangrui Zeng,Karim Elmaaroufi,John Galeotti,Min Xu
DOI: https://doi.org/10.1007/s00138-018-0949-4
2017-07-16
Abstract:Cellular processes are governed by macromolecular complexes inside the cell. Study of the native structures of macromolecular complexes has been extremely difficult due to lack of data. With recent breakthroughs in Cellular electron cryo tomography (CECT) 3D imaging technology, it is now possible for researchers to gain accesses to fully study and understand the macromolecular structures single cells. However, systematic recovery of macromolecular structures from CECT is very difficult due to high degree of structural complexity and practical imaging limitations. Specifically, we proposed a deep learning based image classification approach for large-scale systematic macromolecular structure separation from CECT data. However, our previous work was only a very initial step towards exploration of the full potential of deep learning based macromolecule separation. In this paper, we focus on improving classification performance by proposing three newly designed individual CNN models: an extended version of (Deep Small Receptive Field) DSRF3D, donated as DSRF3D-v2, a 3D residual block based neural network, named as RB3D and a convolutional 3D(C3D) based model, CB3D. We compare them with our previously developed model (DSRF3D) on 12 datasets with different SNRs and tilt angle ranges. The experiments show that our new models achieved significantly higher classification accuracies. The accuracies are not only higher than 0.9 on normal datasets, but also demonstrate potentials to operate on datasets with high levels of noises and missing wedge effects presented.
Quantitative Methods
What problem does this paper attempt to address?
This paper attempts to solve the problem of systematically recovering macromolecular structures from cryo - electron tomography (CECT) images. Specifically, due to the high complexity of intracellular macromolecular structures and practical imaging limitations, it is very difficult to systematically recover macromolecular structures from CECT data. These problems include the dense distribution of cytoplasm making the cellular environment very "crowded", and the dynamic interactions between macromolecules forming more complex and heterogeneous structures. In addition, current technological limitations, such as single - particle cryo - electron microscopy (cryo - EM) imaging technology, require the collection of large - scale data sets, usually containing images of thousands of macromolecules, which further increases the difficulty of processing. To address these challenges, the authors propose a deep - learning - based image classification method for large - scale and systematic separation of macromolecular structures in CECT data. Compared with previous work, this paper focuses on improving classification performance and proposes three new convolutional neural network (CNN) models: DSRF3D - v2, RB3D and CB3D. These models were tested on 12 data sets with different signal - to - noise ratios (SNR) and tilt - angle ranges. The experimental results show that the new models significantly improve in classification accuracy and can perform well even in the case of high noise and severe wedge - missing effects. In particular, the CB3D model has an accuracy close to 0.9 on normal data sets and also shows good classification performance on data sets with extremely low SNR (0.01).