| Literature DB >> 28881965 |
Min Xu1, Xiaoqi Chai2, Hariank Muthakana3, Xiaodan Liang4, Ge Yang2, Tzviya Zeev-Ben-Mordehai5, Eric P Xing4.
Abstract
MOTIVATION: Cellular Electron CryoTomography (CECT) enables 3D visualization of cellular organization at near-native state and in sub-molecular resolution, making it a powerful tool for analyzing structures of macromolecular complexes and their spatial organizations inside single cells. However, high degree of structural complexity together with practical imaging limitations makes the systematic de novo discovery of structures within cells challenging. It would likely require averaging and classifying millions of subtomograms potentially containing hundreds of highly heterogeneous structural classes. Although it is no longer difficult to acquire CECT data containing such amount of subtomograms due to advances in data acquisition automation, existing computational approaches have very limited scalability or discrimination ability, making them incapable of processing such amount of data.Entities:
Mesh:
Year: 2017 PMID: 28881965 PMCID: PMC5946875 DOI: 10.1093/bioinformatics/btx230
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1Architectures of our CNN models. These networks both stack multiple layers. Each box represents a layer in the network. The type and configuration of layer are listed in each box. For example, ‘32-5 × 5 × 5-1 Conv’ denotes a 3D convolutional layer with 32 5 × 5 × 5 filters and stride 1. ‘2 × 2 × 2-2 MaxPool’ denotes a 3D max pooling layer implementing max operation over 2 × 2 × 2 regions with stride 2. ‘FC-512’ and ‘FC-L’ denote a fully connected linear layer with 512 and L neurons respectively, where every neuron is connected to every output of the previous layer. L is the number of classes in the training dataset. ‘ReLU’ and ‘Softmax’ denote different types of activation layers
Fig. 3Left: Isosurface of density map of yeast 20S proteasome (PDB ID: 3DY4). Right: Center slices (in parallel with x–z plane) in the simulated subtomograms with different degree of SNRs and tilt angle ranges
The classification accuracy of simulated datasets of subtomograms at different levels of SNR and tilt angle range
| SNR/Tilt angle range | ±60° | ±50° | ±40° | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Inception3D | DSRF3D | RIF-SVM | Inception3D | DSRF3D | RIF-SVM | Inception3D | DSRF3D | RIF-SVM | |
| 1000 | 0.993 | 0.990 | 0.992 | 0.994 | 0.978 | 0.983 | 0.983 | 0.991 | 0.967 |
| 0.5 | 0.975 | 0.972 | 0.929 | 0.964 | 0.967 | 0.885 | 0.931 | 0.951 | 0.857 |
| 0.1 | 0.851 | 0.891 | 0.762 | 0.807 | 0.873 | 0.633 | 0.809 | 0.866 | 0.649 |
| 0.05 | 0.757 | 0.767 | 0.592 | 0.682 | 0.728 | 0.455 | 0.637 | 0.684 | 0.468 |
| 0.03 | 0.608 | 0.658 | 0.446 | 0.516 | 0.604 | 0.319 | 0.473 | 0.556 | 0.341 |
Fig. 2Subtomograms in the test set projected to the structural feature space of through structural feature extraction (Section 2.3). The projected subtomograms were further embedded to using T-SNE (Maaten and Hinton, 2008) only for visual inspection. The points were shaped and colored according to their true class labels. The region enriched with Proteasome subtomograms (PDB ID: 3DY4) was highlighted using red circle
Fig. 4The isosurfaces of true (left) and predicted (right) structures. The predicted structures were obtained by our approach (Section 2.3). The numbers in parentheses were structural discrepancy between true and predicted structures