1
|
Zhan X, Zeng X, Uddin MR, Xu M. AITom: AI-guided cryo-electron tomography image analyses toolkit. J Struct Biol 2025; 217:108207. [PMID: 40378936 DOI: 10.1016/j.jsb.2025.108207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2024] [Revised: 04/20/2025] [Accepted: 04/28/2025] [Indexed: 05/19/2025]
Abstract
Cryo-electron tomography (cryo-ET) is an essential tool in structural biology, uniquely capable of visualizing three-dimensional macromolecular complexes within their native cellular environments, thereby providing profound molecular-level insights. Despite its significant promise, cryo-ET faces persistent challenges in the systematic localization, identification, segmentation, and structural recovery of three-dimensional subcellular components, necessitating the development of efficient and accurate large-scale image analysis methods. In response to these complexities, this paper introduces AITom, an open-source artificial intelligence platform tailored for cryo-ET researchers. AITom integrates a comprehensive suite of public and proprietary algorithms, supporting both traditional template-based and template-free approaches, alongside state-of-the-art deep learning methodologies for cryo-ET data analysis. By incorporating diverse computational strategies, AITom enables researchers to more effectively tackle the complexities inherent in cryo-ET, facilitating precise analysis and interpretation of complex biological structures. Furthermore, AITom provides extensive tutorials for each analysis module, offering valuable guidance to users in utilizing its comprehensive functionalities.
Collapse
Affiliation(s)
- Xueying Zhan
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Mostofa Rafid Uddin
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States.
| |
Collapse
|
2
|
Meng W, Yu X, Zhang T, Han R. A noise-robust classification method for cryo-ET subtomograms with out-of-distribution detection. BIOINFORMATICS (OXFORD, ENGLAND) 2025; 41:btaf274. [PMID: 40358513 DOI: 10.1093/bioinformatics/btaf274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2025] [Revised: 03/27/2025] [Accepted: 05/12/2025] [Indexed: 05/15/2025]
Abstract
MOTIVATION Cryogenic electron tomography (cryo-ET) enables high-resolution 3D reconstruction of biological samples, with accurate subtomogram classification critical for structural analysis. However, current subtomogram classification methods often struggle with out-of-distribution (OOD) data issue, causing misclassification and mismatched structures. RESULTS To solve this problem, we propose a unified subtomogram classification framework that incorporates OOD detection to distinguish unknown (OOD) from known (in-distribution, ID) classes and predict labels for ID data, thereby enhancing existing subtomogram classification methods. Within this framework, we develop a noise-robust classification method that integrates a 3D discrete wavelet transform-based encoder to reduce high-frequency noise and extract robust features. Additionally, we incorporate a Mahalanobis distance-based OOD detector with a reliable metric for 3D subtomograms and introduce an adaptive classifier that adjusts to accommodate datasets of varying scales. The experimental and visualization results demonstrate that our noise-robust method improves subtomogram classification accuracy and effectively models features while enhancing OOD detection. AVAILABILITY AND IMPLEMENTATION Our code is available at https://github.com/yxs1137/Subtomo-Classification-with-OOD.git. The real data used in this study can be accessed through CryoET Data Portal.
Collapse
Affiliation(s)
- Wenjia Meng
- School of Software, Shandong University, Jinan 250101, China
| | - Xueshi Yu
- School of Software, Shandong University, Jinan 250101, China
| | - Tingting Zhang
- School of Software, Shandong University, Jinan 250101, China
| | - Renmin Han
- College of Medical Information and Engineering, Ningxia Medical University, Yinchuan 750004, China
- Research Center for Mathematics and Interdisciplinary Sciences (Ministry of Education Frontiers Science Center for Nonlinear Expectations), Shandong University, Qingdao 266237, China
| |
Collapse
|
3
|
Yu X, Han R, Jiao H, Meng W. Few-shot classification of Cryo-ET subvolumes with deep Brownian distance covariance. Brief Bioinform 2024; 26:bbae643. [PMID: 39668336 PMCID: PMC11637689 DOI: 10.1093/bib/bbae643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 11/04/2024] [Accepted: 11/27/2024] [Indexed: 12/14/2024] Open
Abstract
Few-shot learning is a crucial approach for macromolecule classification of the cryo-electron tomography (Cryo-ET) subvolumes, enabling rapid adaptation to novel tasks with a small support set of labeled data. However, existing few-shot classification methods for macromolecules in Cryo-ET consider only marginal distributions and overlook joint distributions, failing to capture feature dependencies fully. To address this issue, we propose a method for macromolecular few-shot classification using deep Brownian Distance Covariance (BDC). Our method models the joint distribution within a transfer learning framework, enhancing the modeling capabilities. We insert the BDC module after the feature extractor and only train the feature extractor during the training phase. Then, we enhance the model's generalization capability with self-distillation techniques. In the adaptation phase, we fine-tune the classifier with minimal labeled data. We conduct experiments on publicly available SHREC datasets and a small-scale synthetic dataset to evaluate our method. Results show that our method improves the classification capabilities by introducing the joint distribution.
Collapse
Affiliation(s)
- Xueshi Yu
- School of Software, Shandong University, 1500 Shunhua Road, 250101 Jinan, China
| | - Renmin Han
- Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, 72 Binhai Road, 266000 Qingdao, China
| | - Haitao Jiao
- Jinan Center For Disease Control And Prevention, Shandong University, 2 Weiliu Road, 250021 Jinan, China
| | - Wenjia Meng
- School of Software, Shandong University, 1500 Shunhua Road, 250101 Jinan, China
| |
Collapse
|
4
|
Zhao C, Lu D, Zhao Q, Ren C, Zhang H, Zhai J, Gou J, Zhu S, Zhang Y, Gong X. Computational methods for in situ structural studies with cryogenic electron tomography. Front Cell Infect Microbiol 2023; 13:1135013. [PMID: 37868346 PMCID: PMC10586593 DOI: 10.3389/fcimb.2023.1135013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 08/29/2023] [Indexed: 10/24/2023] Open
Abstract
Cryo-electron tomography (cryo-ET) plays a critical role in imaging microorganisms in situ in terms of further analyzing the working mechanisms of viruses and drug exploitation, among others. A data processing workflow for cryo-ET has been developed to reconstruct three-dimensional density maps and further build atomic models from a tilt series of two-dimensional projections. Low signal-to-noise ratio (SNR) and missing wedge are two major factors that make the reconstruction procedure challenging. Because only few near-atomic resolution structures have been reconstructed in cryo-ET, there is still much room to design new approaches to improve universal reconstruction resolutions. This review summarizes classical mathematical models and deep learning methods among general reconstruction steps. Moreover, we also discuss current limitations and prospects. This review can provide software and methods for each step of the entire procedure from tilt series by cryo-ET to 3D atomic structures. In addition, it can also help more experts in various fields comprehend a recent research trend in cryo-ET. Furthermore, we hope that more researchers can collaborate in developing computational methods and mathematical models for high-resolution three-dimensional structures from cryo-ET datasets.
Collapse
Affiliation(s)
- Cuicui Zhao
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Da Lu
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Qian Zhao
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Chongjiao Ren
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Huangtao Zhang
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Jiaqi Zhai
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Jiaxin Gou
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Shilin Zhu
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Yaqi Zhang
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
| | - Xinqi Gong
- Mathematical Intelligence Application LAB, Institute for Mathematical Sciences, Renmin University of China, Beijing, China
- Beijing Academy of Intelligence, Beijing, China
| |
Collapse
|
5
|
Isotropic reconstruction for electron tomography with deep learning. Nat Commun 2022; 13:6482. [PMID: 36309499 PMCID: PMC9617606 DOI: 10.1038/s41467-022-33957-8] [Citation(s) in RCA: 144] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 10/05/2022] [Indexed: 12/25/2022] Open
Abstract
Cryogenic electron tomography (cryoET) allows visualization of cellular structures in situ. However, anisotropic resolution arising from the intrinsic "missing-wedge" problem has presented major challenges in visualization and interpretation of tomograms. Here, we have developed IsoNet, a deep learning-based software package that iteratively reconstructs the missing-wedge information and increases signal-to-noise ratio, using the knowledge learned from raw tomograms. Without the need for sub-tomogram averaging, IsoNet generates tomograms with significantly reduced resolution anisotropy. Applications of IsoNet to three representative types of cryoET data demonstrate greatly improved structural interpretability: resolving lattice defects in immature HIV particles, establishing architecture of the paraflagellar rod in Eukaryotic flagella, and identifying heptagon-containing clathrin cages inside a neuronal synapse of cultured cells. Therefore, by overcoming two fundamental limitations of cryoET, IsoNet enables functional interpretation of cellular tomograms without sub-tomogram averaging. Its application to high-resolution cellular tomograms should also help identify differently oriented complexes of the same kind for sub-tomogram averaging.
Collapse
|
6
|
Hajarolasvadi N, Sunkara V, Khavnekar S, Beck F, Brandt R, Baum D. Volumetric macromolecule identification in cryo-electron tomograms using capsule networks. BMC Bioinformatics 2022; 23:360. [PMID: 36042418 PMCID: PMC9429335 DOI: 10.1186/s12859-022-04901-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 08/23/2022] [Indexed: 11/29/2022] Open
Abstract
Background Despite recent advances in cellular cryo-electron tomography (CET), developing automated tools for macromolecule identification in submolecular resolution remains challenging due to the lack of annotated data and high structural complexities. To date, the extent of the deep learning methods constructed for this problem is limited to conventional Convolutional Neural Networks (CNNs). Identifying macromolecules of different types and sizes is a tedious and time-consuming task. In this paper, we employ a capsule-based architecture to automate the task of macromolecule identification, that we refer to as 3D-UCaps. In particular, the architecture is composed of three components: feature extractor, capsule encoder, and CNN decoder. The feature extractor converts voxel intensities of input sub-tomograms to activities of local features. The encoder is a 3D Capsule Network (CapsNet) that takes local features to generate a low-dimensional representation of the input. Then, a 3D CNN decoder reconstructs the sub-tomograms from the given representation by upsampling. Results We performed binary and multi-class localization and identification tasks on synthetic and experimental data. We observed that the 3D-UNet and the 3D-UCaps had an \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$F_1-$$\end{document}F1-score mostly above 60% and 70%, respectively, on the test data. In both network architectures, we observed degradation of at least 40% in the \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$F_1$$\end{document}F1-score when identifying very small particles (PDB entry 3GL1) compared to a large particle (PDB entry 4D8Q). In the multi-class identification task of experimental data, 3D-UCaps had an \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$F_1$$\end{document}F1-score of 91% on the test data in contrast to 64% of the 3D-UNet. The better \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$F_1$$\end{document}F1-score of 3D-UCaps compared to 3D-UNet is obtained by a higher precision score. We speculate this to be due to the capsule network employed in the encoder. To study the effect of the CapsNet-based encoder architecture further, we performed an ablation study and perceived that the \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$F_1$$\end{document}F1-score is boosted as network depth is increased which is in contrast to the previously reported results for the 3D-UNet. To present a reproducible work, source code, trained models, data as well as visualization results are made publicly available. Conclusion Quantitative and qualitative results show that 3D-UCaps successfully perform various downstream tasks including identification and localization of macromolecules and can at least compete with CNN architectures for this task. Given that the capsule layers extract both the existence probability and the orientation of the molecules, this architecture has the potential to lead to representations of the data that are better interpretable than those of 3D-UNet. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04901-w.
Collapse
Affiliation(s)
- Noushin Hajarolasvadi
- Department of Visual and Data-Centric Computing, Zuse Institute Berlin, Takustraße 7, 14195, Berlin, Germany.
| | - Vikram Sunkara
- Department of Visual and Data-Centric Computing, Zuse Institute Berlin, Takustraße 7, 14195, Berlin, Germany
| | - Sagar Khavnekar
- Department of CryoEM Technology, Max Planck Institute of Biochemistry, Am Klopferspitz 18, 82152, Martinsried, Germany
| | - Florian Beck
- Department of CryoEM Technology, Max Planck Institute of Biochemistry, Am Klopferspitz 18, 82152, Martinsried, Germany
| | - Robert Brandt
- Materials and Structural Analysis, Thermo Fisher Scientific, Takustraße 7, 14195, Berlin, Germany
| | - Daniel Baum
- Department of Visual and Data-Centric Computing, Zuse Institute Berlin, Takustraße 7, 14195, Berlin, Germany
| |
Collapse
|
7
|
Treder KP, Huang C, Kim JS, Kirkland AI. Applications of deep learning in electron microscopy. Microscopy (Oxf) 2022; 71:i100-i115. [DOI: 10.1093/jmicro/dfab043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/30/2021] [Accepted: 11/08/2021] [Indexed: 12/25/2022] Open
Abstract
Abstract
We review the growing use of machine learning in electron microscopy (EM) driven in part by the availability of fast detectors operating at kiloHertz frame rates leading to large data sets that cannot be processed using manually implemented algorithms. We summarize the various network architectures and error metrics that have been applied to a range of EM-related problems including denoising and inpainting. We then provide a review of the application of these in both physical and life sciences, highlighting how conventional networks and training data have been specifically modified for EM.
Collapse
Affiliation(s)
- Kevin P Treder
- Department of Materials, University of Oxford, Oxford, Oxfordshire OX1 3PH, UK
| | - Chen Huang
- Rosalind Franklin Institute, Harwell Research Campus, Didcot, Oxfordshire OX11 0FA, UK
| | - Judy S Kim
- Department of Materials, University of Oxford, Oxford, Oxfordshire OX1 3PH, UK
- Rosalind Franklin Institute, Harwell Research Campus, Didcot, Oxfordshire OX11 0FA, UK
| | - Angus I Kirkland
- Department of Materials, University of Oxford, Oxford, Oxfordshire OX1 3PH, UK
- Rosalind Franklin Institute, Harwell Research Campus, Didcot, Oxfordshire OX11 0FA, UK
| |
Collapse
|
8
|
Bandyopadhyay H, Deng Z, Ding L, Liu S, Uddin MR, Zeng X, Behpour S, Xu M. Cryo-shift: reducing domain shift in cryo-electron subtomograms with unsupervised domain adaptation and randomization. Bioinformatics 2022; 38:977-984. [PMID: 34897387 DOI: 10.1093/bioinformatics/btab794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 10/18/2021] [Accepted: 11/17/2021] [Indexed: 02/05/2023] Open
Abstract
MOTIVATION Cryo-Electron Tomography (cryo-ET) is a 3D imaging technology that enables the visualization of subcellular structures in situ at near-atomic resolution. Cellular cryo-ET images help in resolving the structures of macromolecules and determining their spatial relationship in a single cell, which has broad significance in cell and structural biology. Subtomogram classification and recognition constitute a primary step in the systematic recovery of these macromolecular structures. Supervised deep learning methods have been proven to be highly accurate and efficient for subtomogram classification, but suffer from limited applicability due to scarcity of annotated data. While generating simulated data for training supervised models is a potential solution, a sizeable difference in the image intensity distribution in generated data as compared with real experimental data will cause the trained models to perform poorly in predicting classes on real subtomograms. RESULTS In this work, we present Cryo-Shift, a fully unsupervised domain adaptation and randomization framework for deep learning-based cross-domain subtomogram classification. We use unsupervised multi-adversarial domain adaption to reduce the domain shift between features of simulated and experimental data. We develop a network-driven domain randomization procedure with 'warp' modules to alter the simulated data and help the classifier generalize better on experimental data. We do not use any labeled experimental data to train our model, whereas some of the existing alternative approaches require labeled experimental samples for cross-domain classification. Nevertheless, Cryo-Shift outperforms the existing alternative approaches in cross-domain subtomogram classification in extensive evaluation studies demonstrated herein using both simulated and experimental data. AVAILABILITYAND IMPLEMENTATION https://github.com/xulabs/aitom. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hmrishav Bandyopadhyay
- Department of Electronics and Telecommunication Engineering, Jadavpur University, Kolkata 700032, India
| | - Zihao Deng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Leiting Ding
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Sinuo Liu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Mostofa Rafid Uddin
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Sima Behpour
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
9
|
Gao S, Han R, Zeng X, Liu Z, Xu M, Zhang F. Macromolecules Structural Classification With a 3D Dilated Dense Network in Cryo-Electron Tomography. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:209-219. [PMID: 33729943 PMCID: PMC8446108 DOI: 10.1109/tcbb.2021.3065986] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Cryo-electron tomography, combined with subtomogram averaging (STA), can reveal three-dimensional (3D) macromolecule structures in the near-native state from cells and other biological samples. In STA, to get a high-resolution 3D view of macromolecule structures, diverse macromolecules captured by the cellular tomograms need to be accurately classified. However, due to the poor signal-to-noise-ratio (SNR) and severe ray artifacts in the tomogram, it remains a major challenge to classify macromolecules with high accuracy. In this paper, we propose a new convolutional neural network, named 3D-Dilated-DenseNet, to improve the performance of macromolecule classification. In 3D-Dilated-DenseNet, there are two key strategies to guarantee macromolecule classification accuracy: 1) Using dense connections to enhance feature map utilization (corresponding to the baseline 3D-C-DenseNet); 2) Adopting dilated convolution to enrich multi-level information in feature maps. We tested 3D-Dilated-DenseNet and 3D-C-DenseNet both on synthetic data and experimental data. The results show that, on synthetic data, compared with the state-of-the-art method in the SHREC contest (SHREC-CNN), both 3D-C-DenseNet and 3D-Dilated-DenseNet outperform SHREC-CNN. In particular, 3D-Dilated-DenseNet improves 0.393 of F1 metric on tiny-size macromolecules and 0.213 on small-size macromolecules. On experimental data, compared with 3D-C-DenseNet, 3D-Dilated-DenseNet can increase classification performance by 2.1 percent.
Collapse
|
10
|
Yu L, Li R, Zeng X, Wang H, Jin J, Ge Y, Jiang R, Xu M. Few shot domain adaptation for in situ macromolecule structural classification in cryoelectron tomograms. Bioinformatics 2021; 37:185-191. [PMID: 32722755 DOI: 10.1093/bioinformatics/btaa671] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Revised: 07/06/2020] [Accepted: 07/20/2020] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Cryoelectron tomography (cryo-ET) visualizes structure and spatial organization of macromolecules and their interactions with other subcellular components inside single cells in the close-to-native state at submolecular resolution. Such information is critical for the accurate understanding of cellular processes. However, subtomogram classification remains one of the major challenges for the systematic recognition and recovery of the macromolecule structures in cryo-ET because of imaging limits and data quantity. Recently, deep learning has significantly improved the throughput and accuracy of large-scale subtomogram classification. However, often it is difficult to get enough high-quality annotated subtomogram data for supervised training due to the enormous expense of labeling. To tackle this problem, it is beneficial to utilize another already annotated dataset to assist the training process. However, due to the discrepancy of image intensity distribution between source domain and target domain, the model trained on subtomograms in source domain may perform poorly in predicting subtomogram classes in the target domain. RESULTS In this article, we adapt a few shot domain adaptation method for deep learning-based cross-domain subtomogram classification. The essential idea of our method consists of two parts: (i) take full advantage of the distribution of plentiful unlabeled target domain data, and (ii) exploit the correlation between the whole source domain dataset and few labeled target domain data. Experiments conducted on simulated and real datasets show that our method achieves significant improvement on cross domain subtomogram classification compared with baseline methods. AVAILABILITY AND IMPLEMENTATION Software is available online https://github.com/xulabs/aitom. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liangyong Yu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ran Li
- Department of Automation, Tsinghua University, Beijing 100084, China
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Hongyi Wang
- Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
| | - Jie Jin
- National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science, Beijing 100190, China
| | - Yang Ge
- National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science, Beijing 100190, China
| | - Rui Jiang
- Department of Automation, Tsinghua University, Beijing 100084, China
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
11
|
Du X, Wang H, Zhu Z, Zeng X, Chang YW, Zhang J, Xing E, Xu M. Active Learning to Classify Macromolecular Structures in situ for Less Supervision in Cryo-Electron Tomography. Bioinformatics 2021; 37:2340-2346. [PMID: 33620460 DOI: 10.1093/bioinformatics/btab123] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 01/14/2021] [Accepted: 02/22/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Cryo-Electron Tomography (cryo-ET) is a 3D bioimaging tool that visualizes the structural and spatial organization of macromolecules at a near-native state in single cells, which has broad applications in life science. However, the systematic structural recognition and recovery of macromolecules captured by cryo-ET are difficult due to high structural complexity and imaging limits. Deep learning based subtomogram classification have played critical roles for such tasks. As supervised approaches, however, their performance relies on sufficient and laborious annotation on a large training dataset. RESULTS To alleviate this major labeling burden, we proposed a Hybrid Active Learning (HAL) framework for querying subtomograms for labelling from a large unlabeled subtomogram pool. Firstly, HAL adopts uncertainty sampling to select the subtomograms that have the most uncertain predictions. This strategy enforces the model to be aware of the inductive bias during classification and subtomogram selection, which satisfies the discriminativeness principle in AL literature. Moreover, to mitigate the sampling bias caused by such strategy, a discriminator is introduced to judge if a certain subtomogram is labeled or unlabeled and subsequently the model queries the subtomogram that have higher probabilities to be unlabeled. Such query strategy encourages to match the data distribution between the labeled and unlabeled subtomogram samples, which essentially encodes the representativeness criterion into the subtomogram selection process. Additionally, HAL introduces a subset sampling strategy to improve the diversity of the query set, so that the information overlap is decreased between the queried batches and the algorithmic efficiency is improved. Our experiments on subtomogram classification tasks using both simulated and real data demonstrate that we can achieve comparable testing performance (on average only 3% accuracy drop) by using less than 30% of the labeled subtomograms, which shows a very promising result for subtomogram classification task with limited labeling resources. AVAILABILITY https://github.com/xulabs/aitom.
Collapse
Affiliation(s)
- Xuefeng Du
- Department of Computer Science, University of Wisconsin-Madison, Madison, 53706, USA
| | - Haohan Wang
- Language Technologies Institute, Carnegie Mellon University, Pittsburgh, 15213, USA
| | - Zhenxi Zhu
- Department of Computer Science, Beijing University of Posts and Telecommunications, 100876, China
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, 15213, USA
| | - Yi-Wei Chang
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, 19104, USA
| | - Jing Zhang
- Department of Computer Science, University of California - Irvine, Irvine, 92697, USA
| | - Eric Xing
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, 15213, USA
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, 15213, USA
| |
Collapse
|
12
|
Zhou B, Yu H, Zeng X, Yang X, Zhang J, Xu M. One-Shot Learning With Attention-Guided Segmentation in Cryo-Electron Tomography. Front Mol Biosci 2021; 7:613347. [PMID: 33511158 PMCID: PMC7835881 DOI: 10.3389/fmolb.2020.613347] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 12/09/2020] [Indexed: 11/13/2022] Open
Abstract
Cryo-electron Tomography (cryo-ET) generates 3D visualization of cellular organization that allows biologists to analyze cellular structures in a near-native state with nano resolution. Recently, deep learning methods have demonstrated promising performance in classification and segmentation of macromolecule structures captured by cryo-ET, but training individual deep learning models requires large amounts of manually labeled and segmented data from previously observed classes. To perform classification and segmentation in the wild (i.e., with limited training data and with unseen classes), novel deep learning model needs to be developed to classify and segment unseen macromolecules captured by cryo-ET. In this paper, we develop a one-shot learning framework, called cryo-ET one-shot network (COS-Net), for simultaneous classification of macromolecular structure and generation of the voxel-level 3D segmentation, using only one training sample per class. Our experimental results on 22 macromolecule classes demonstrated that our COS-Net could efficiently classify macromolecular structures with small amounts of samples and produce accurate 3D segmentation at the same time.
Collapse
Affiliation(s)
- Bo Zhou
- Department of Biomedical Engineering, Yale University, New Haven, CT, United States
| | - Haisu Yu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Xiaoyan Yang
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Jing Zhang
- Computer Science Department, University of California, Irvine, Irvine, CA, United States
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, United States
| |
Collapse
|
13
|
Few-shot learning for classification of novel macromolecular structures in cryo-electron tomograms. PLoS Comput Biol 2020; 16:e1008227. [PMID: 33175839 PMCID: PMC7682871 DOI: 10.1371/journal.pcbi.1008227] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Revised: 11/23/2020] [Accepted: 08/08/2020] [Indexed: 01/25/2023] Open
Abstract
Cryo-electron tomography (cryo-ET) provides 3D visualization of subcellular components in the near-native state and at sub-molecular resolutions in single cells, demonstrating an increasingly important role in structural biology in situ. However, systematic recognition and recovery of macromolecular structures in cryo-ET data remain challenging as a result of low signal-to-noise ratio (SNR), small sizes of macromolecules, and high complexity of the cellular environment. Subtomogram structural classification is an essential step for such task. Although acquisition of large amounts of subtomograms is no longer an obstacle due to advances in automation of data collection, obtaining the same number of structural labels is both computation and labor intensive. On the other hand, existing deep learning based supervised classification approaches are highly demanding on labeled data and have limited ability to learn about new structures rapidly from data containing very few labels of such new structures. In this work, we propose a novel approach for subtomogram classification based on few-shot learning. With our approach, classification of unseen structures in the training data can be conducted given few labeled samples in test data through instance embedding. Experiments were performed on both simulated and real datasets. Our experimental results show that we can make inference on new structures given only five labeled samples for each class with a competitive accuracy (> 0.86 on the simulated dataset with SNR = 0.1), or even one sample with an accuracy of 0.7644. The results on real datasets are also promising with accuracy > 0.9 on both conditions and even up to 1 on one of the real datasets. Our approach achieves significant improvement compared with the baseline method and has strong capabilities of generalizing to other cellular components.
Collapse
|
14
|
Abstract
The complex environment of biological cells and tissues has motivated development of three-dimensional (3D) imaging in both light and electron microscopies. To this end, one of the primary tools in fluorescence microscopy is that of computational deconvolution. Wide-field fluorescence images are often corrupted by haze due to out-of-focus light, i.e., to cross-talk between different object planes as represented in the 3D image. Using prior understanding of the image formation mechanism, it is possible to suppress the cross-talk and reassign the unfocused light to its proper source post facto. Electron tomography based on tilted projections also exhibits a cross-talk between distant planes due to the discrete angular sampling and limited tilt range. By use of a suitably synthesized 3D point spread function, we show here that deconvolution leads to similar improvements in volume data reconstructed from cryoscanning transmission electron tomography (CSTET), namely a dramatic in-plane noise reduction and improved representation of features in the axial dimension. Contrast enhancement is demonstrated first with colloidal gold particles and then in representative cryotomograms of intact cells. Deconvolution of CSTET data collected from the periphery of an intact nucleus revealed partially condensed, extended structures in interphase chromatin.
Collapse
|
15
|
Lin R, Zeng X, Kitani K, Xu M. Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms. Bioinformatics 2019; 35:i260-i268. [PMID: 31510673 PMCID: PMC6612867 DOI: 10.1093/bioinformatics/btz364] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
MOTIVATION Since 2017, an increasing amount of attention has been paid to the supervised deep learning-based macromolecule in situ structural classification (i.e. subtomogram classification) in cellular electron cryo-tomography (CECT) due to the substantially higher scalability of deep learning. However, the success of such supervised approach relies heavily on the availability of large amounts of labeled training data. For CECT, creating valid training data from the same data source as prediction data is usually laborious and computationally intensive. It would be beneficial to have training data from a separate data source where the annotation is readily available or can be performed in a high-throughput fashion. However, the cross data source prediction is often biased due to the different image intensity distributions (a.k.a. domain shift). RESULTS We adapt a deep learning-based adversarial domain adaptation (3D-ADA) method to timely address the domain shift problem in CECT data analysis. 3D-ADA first uses a source domain feature extractor to extract discriminative features from the training data as the input to a classifier. Then it adversarially trains a target domain feature extractor to reduce the distribution differences of the extracted features between training and prediction data. As a result, the same classifier can be directly applied to the prediction data. We tested 3D-ADA on both experimental and realistically simulated subtomogram datasets under different imaging conditions. 3D-ADA stably improved the cross data source prediction, as well as outperformed two popular domain adaptation methods. Furthermore, we demonstrate that 3D-ADA can improve cross data source recovery of novel macromolecular structures. AVAILABILITY AND IMPLEMENTATION https://github.com/xulabs/projects. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ruogu Lin
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Kris Kitani
- Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
16
|
Castaño-Díez D, Zanetti G. In situ structure determination by subtomogram averaging. Curr Opin Struct Biol 2019; 58:68-75. [PMID: 31233977 DOI: 10.1016/j.sbi.2019.05.011] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 05/06/2019] [Accepted: 05/12/2019] [Indexed: 02/02/2023]
Abstract
Cryo-tomography and subtomogram averaging are increasingly popular techniques for structural determination of macromolecular complexes in situ. They have the potential to achieve high-resolution views of native complexes, together with the details of their location relative to interacting molecules. The subtomogram averaging (StA) pipelines are well-established, with current developments aiming to optimise each step by reducing manual intervention and user decisions, following similar trends in single-particle approaches that have dramatically increased their popularity. Here, we review the main steps of typical StA workflows. We focus on considerations arising from the fact that the objects of study are embedded within unique crowded environments, and we emphasise those steps where careful decisions need to be made by the user.
Collapse
Affiliation(s)
- Daniel Castaño-Díez
- BioEM Lab, Center for Cellular Imaging and Nanoanalytics, Biozentrum, University of Basel, Mattenstrasse 26, CH-4058, Basel, Switzerland.
| | - Giulia Zanetti
- Institute of Structural and Molecular Biology, Birkbeck College, Malet St., London, WC1E 7HX, UK.
| |
Collapse
|
17
|
Abstract
Cryo-electron tomography (cryo-ET) allows three-dimensional (3D) visualization of frozen-hydrated biological samples, such as protein complexes and cell organelles, in near-native environments at nanometer scale. Protein complexes that are present in multiple copies in a set of tomograms can be extracted, mutually aligned, and averaged to yield a signal-enhanced 3D structure up to sub-nanometer or even near-atomic resolution. This technique, called subtomogram averaging (StA), is powered by improvements in EM hardware and image processing software. Importantly, StA provides unique biological insights into the structure and function of cellular machinery in close-to-native contexts. In this chapter, we describe the principles and key steps of StA. We briefly cover sample preparation and data collection with an emphasis on image processing procedures related to tomographic reconstruction, subtomogram alignment, averaging, and classification. We conclude by summarizing current limitations and future directions of this technique with a focus on high-resolution StA.
Collapse
|
18
|
Xu M, Singla J, Tocheva EI, Chang YW, Stevens RC, Jensen GJ, Alber F. De Novo Structural Pattern Mining in Cellular Electron Cryotomograms. Structure 2019; 27:679-691.e14. [PMID: 30744995 DOI: 10.1016/j.str.2019.01.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Revised: 07/27/2018] [Accepted: 01/14/2019] [Indexed: 11/16/2022]
Abstract
Electron cryotomography enables 3D visualization of cells in a near-native state at molecular resolution. The produced cellular tomograms contain detailed information about a plethora of macromolecular complexes, their structures, abundances, and specific spatial locations in the cell. However, extracting this information in a systematic way is very challenging, and current methods usually rely on individual templates of known structures. Here, we propose a framework called "Multi-Pattern Pursuit" for de novo discovery of different complexes from highly heterogeneous sets of particles extracted from entire cellular tomograms without using information of known structures. These initially detected structures can then serve as input for more targeted refinement efforts. Our tests on simulated and experimental tomograms show that our automated method is a promising tool for supporting large-scale template-free visual proteomics analysis.
Collapse
Affiliation(s)
- Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| | - Jitin Singla
- Institute for Quantitative and Computational Biosciences, Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095, USA; Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Elitza I Tocheva
- Department of Microbiology and Immunology, Life Sciences Institute, The University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Yi-Wei Chang
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Raymond C Stevens
- Department of Biological Sciences and Department of Chemistry, Bridge Institute, University of Southern California, Los Angeles, CA 90089, USA
| | - Grant J Jensen
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Howard Hughes Medical Institute, Pasadena, CA 91125, USA
| | - Frank Alber
- Institute for Quantitative and Computational Biosciences, Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095, USA; Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA.
| |
Collapse
|
19
|
Zhou B, Guo Q, Zeng X, Xu M. Feature Decomposition Based Saliency Detection in Electron Cryo-Tomograms. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2019; 2018:2467-2473. [PMID: 31205800 DOI: 10.1109/bibm.2018.8621363] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Electron Cryo-Tomography (ECT) allows 3D visualization of subcellular structures at the submolecular resolution in close to the native state. However, due to the high degree of structural complexity and imaging limits, the automatic segmentation of cellular components from ECT images is very difficult. To complement and speed up existing segmentation methods, it is desirable to develop a generic cell component segmentation method that is 1) not specific to particular types of cellular components, 2) able to segment unknown cellular components, 3) fully unsupervised and does not rely on the availability of training data. As an important step towards this goal, in this paper, we propose a saliency detection method that computes the likelihood that a subregion in a tomogram stands out from the background. Our method consists of four steps: supervoxel over-segmentation, feature extraction, feature matrix decomposition, and computation of saliency. The method produces a distribution map that represents the regions' saliency in tomograms. Our experiments show that our method can successfully label most salient regions detected by a human observer, and able to filter out regions not containing cellular components. Therefore, our method can remove the majority of the background region, and significantly speed up the subsequent processing of segmentation and recognition of cellular components captured by ECT.
Collapse
Affiliation(s)
- Bo Zhou
- Robotics Institute, Carnegie Mellon University, Pittsburgh, USA
| | - Qiang Guo
- Max Planck Institute for Biochemistry, Martinsried, Germany
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, USA
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, USA
| |
Collapse
|
20
|
Che C, Lin R, Zeng X, Elmaaroufi K, Galeotti J, Xu M. Improved deep learning-based macromolecules structure classification from electron cryo-tomograms. MACHINE VISION AND APPLICATIONS 2018; 29:1227-1236. [PMID: 31511756 PMCID: PMC6738941 DOI: 10.1007/s00138-018-0949-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 01/16/2018] [Accepted: 05/18/2018] [Indexed: 05/30/2023]
Abstract
Cellular processes are governed by macromolecular complexes inside the cell. Study of the native structures of macromolecular complexes has been extremely difficult due to lack of data. With recent breakthroughs in Cellular Electron Cryo-Tomography (CECT) 3D imaging technology, it is now possible for researchers to gain accesses to fully study and understand the macro-molecular structures single cells. However, systematic recovery of macromolecular structures from CECT is very difficult due to high degree of structural complexity and practical imaging limitations. Specifically, we proposed a deep learning-based image classification approach for large-scale systematic macromolecular structure separation from CECT data. However, our previous work was only a very initial step toward exploration of the full potential of deep learning-based macromolecule separation. In this paper, we focus on improving classification performance by proposing three newly designed individual CNN models: an extended version of (Deep Small Receptive Field) DSRF3D, donated as DSRF3D-v2, a 3D residual block-based neural network, named as RB3D, and a convolutional 3D (C3D)-based model, CB3D. We compare them with our previously developed model (DSRF3D) on 12 datasets with different SNRs and tilt angle ranges. The experiments show that our new models achieved significantly higher classification accuracies. The accuracies are not only higher than 0.9 on normal datasets, but also demonstrate potentials to operate on datasets with high levels of noises and missing wedge effects presented.
Collapse
Affiliation(s)
- Chengqian Che
- The Robotics Institute, Carnegie Mellon University,Pittsburgh, USA
| | - Ruogu Lin
- Department of Automation, Tsinghua University, Beijing, China
| | - Xiangrui Zeng
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, USA
| | - Karim Elmaaroufi
- Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, USA
| | - John Galeotti
- The Robotics Institute, Carnegie Mellon University,Pittsburgh, USA
| | - Min Xu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, USA
| |
Collapse
|
21
|
Liu C, Zeng X, Wang KW, Guo Q, Xu M. Multi-task Learning for Macromolecule Classification, Segmentation and Coarse Structural Recovery in Cryo-Tomography. BMVC : PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE. BRITISH MACHINE VISION CONFERENCE 2018; 2018:1007. [PMID: 36951799 PMCID: PMC10028434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Cellular Electron Cryo-Tomography (CECT) is a powerful 3D imaging tool for studying the native structure and organization of macromolecules inside single cells. For systematic recognition and recovery of macromolecular structures captured by CECT, methods for several important tasks such as subtomogram classification and semantic segmentation have been developed. However, the recognition and recovery of macromolecular structures are still very difficult due to high molecular structural diversity, crowding molecular environment, and the imaging limitations of CECT. In this paper, we propose a novel multi-task 3D convolutional neural network model for simultaneous classification, segmentation, and coarse structural recovery of macromolecules of interest in subtomograms. In our model, the learned image features of one task are shared and thereby mutually reinforce the learning of other tasks. Evaluated on realistically simulated and experimental CECT data, our multi-task learning model outperformed all single-task learning methods for classification and segmentation. In addition, we demonstrate that our model can generalize to discover, segment and recover novel structures that do not exist in the training data.
Collapse
Affiliation(s)
- Chang Liu
- School of Computer Science, Carnegie Mellon University Pittsburgh, PA, USA
| | - Xiangrui Zeng
- School of Computer Science, Carnegie Mellon University Pittsburgh, PA, USA
| | - Kai Wen Wang
- School of Computer Science, Carnegie Mellon University Pittsburgh, PA, USA
| | - Qiang Guo
- Max Planck Institute for Biochemistry Martinsried, Germany
| | - Min Xu
- School of Computer Science, Carnegie Mellon University Pittsburgh, PA, USA
| |
Collapse
|
22
|
Zhao G, Zhou B, Wang K, Jiang R, Xu M. Respond-CAM: Analyzing Deep Models for 3D Imaging Data by Visualizations. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION : MICCAI ... INTERNATIONAL CONFERENCE ON MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION 2018; 11070:485-492. [PMID: 36951805 PMCID: PMC10028588 DOI: 10.1007/978-3-030-00928-1_55] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The convolutional neural network (CNN) has become a powerful tool for various biomedical image analysis tasks, but there is a lack of visual explanation for the machinery of CNNs. In this paper, we present a novel algorithm, Respond-weighted Class Activation Mapping (Respond-CAM), for making CNN-based models interpretable by visualizing input regions that are important for predictions, especially for biomedical 3D imaging data inputs. Our method uses the gradients of any target concept (e.g. the score of target class) that flow into a convolutional layer. The weighted feature maps are combined to produce a heatmap that highlights the important regions in the image for predicting the target concept. We prove a preferable sum-to-score property of the Respond-CAM and verify its significant improvement on 3D images from the current state-of-the-art approach. Our tests on Cellular Electron Cryo-Tomography 3D images show that Respond-CAM achieves superior performance on visualizing the CNNs with 3D biomedical image inputs, and is able to get reasonably good results on visualizing the CNNs with natural image inputs. The Respond-CAM is an efficient and reliable approach for visualizing the CNN machinery, and is applicable to a wide variety of CNN model families and image analysis tasks. Our code is available at: https://github.com/xulabs/projects/tree/master/respond_cam.
Collapse
Affiliation(s)
- Guannan Zhao
- Department of Automation, Tsinghua University, Beijing, China
| | - Bo Zhou
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Kaiwen Wang
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Rui Jiang
- Department of Automation, Tsinghua University, Beijing, China
| | - Min Xu
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
23
|
Guo J, Zhou B, Zeng X, Freyberg Z, Xu M. Model Compression for Faster Structural Separation of Macromolecules Captured by Cellular Electron Cryo-Tomography. IMAGE ANALYSIS AND RECOGNITION: INTERNATIONAL CONFERENCE, ICIAR ... : PROCEEDINGS. ICIAR 2018; 10882:144-152. [PMID: 31231722 PMCID: PMC6588193 DOI: 10.1007/978-3-319-93000-8_17] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Electron Cryo-Tomography (ECT) enables 3D visualization of macromolecule structure inside single cells. Macromolecule classification approaches based on convolutional neural networks (CNN) were developed to separate millions of macromolecules captured from ECT systematically. However, given the fast accumulation of ECT data, it will soon become necessary to use CNN models to efficiently and accurately separate substantially more macromolecules at the prediction stage, which requires additional computational costs. To speed up the prediction, we compress classification models into compact neural networks with little in accuracy for deployment. Specifically, we propose to perform model compression through knowledge distillation. Firstly, a complex teacher network is trained to generate soft labels with better classification feasibility followed by training of customized student networks with simple architectures using the soft label to compress model complexity. Our tests demonstrate that our compressed models significantly reduce the number of parameters and time cost while maintaining similar classification accuracy.
Collapse
Affiliation(s)
| | - Bo Zhou
- School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
| | - Xiangrui Zeng
- School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
| | - Zachary Freyberg
- Departments of Psychiatry and Cell Biology, University of Pittsburgh, Pittsburgh, USA
| | - Min Xu
- School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
| |
Collapse
|
24
|
Zeng X, Leung MR, Zeev-Ben-Mordehai T, Xu M. A convolutional autoencoder approach for mining features in cellular electron cryo-tomograms and weakly supervised coarse segmentation. J Struct Biol 2018; 202:150-160. [PMID: 29289599 PMCID: PMC6661905 DOI: 10.1016/j.jsb.2017.12.015] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Revised: 12/24/2017] [Accepted: 12/27/2017] [Indexed: 01/08/2023]
Abstract
Cellular electron cryo-tomography enables the 3D visualization of cellular organization in the near-native state and at submolecular resolution. However, the contents of cellular tomograms are often complex, making it difficult to automatically isolate different in situ cellular components. In this paper, we propose a convolutional autoencoder-based unsupervised approach to provide a coarse grouping of 3D small subvolumes extracted from tomograms. We demonstrate that the autoencoder can be used for efficient and coarse characterization of features of macromolecular complexes and surfaces, such as membranes. In addition, the autoencoder can be used to detect non-cellular features related to sample preparation and data collection, such as carbon edges from the grid and tomogram boundaries. The autoencoder is also able to detect patterns that may indicate spatial interactions between cellular components. Furthermore, we demonstrate that our autoencoder can be used for weakly supervised semantic segmentation of cellular components, requiring a very small amount of manual annotation.
Collapse
Affiliation(s)
- Xiangrui Zeng
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh 15213, USA
| | - Miguel Ricardo Leung
- Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK; Cryo-electron Microscopy, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands
| | - Tzviya Zeev-Ben-Mordehai
- Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK; Cryo-electron Microscopy, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands
| | - Min Xu
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh 15213, USA.
| |
Collapse
|
25
|
Baldwin PR, Tan YZ, Eng ET, Rice WJ, Noble AJ, Negro CJ, Cianfrocco MA, Potter CS, Carragher B. Big data in cryoEM: automated collection, processing and accessibility of EM data. Curr Opin Microbiol 2017; 43:1-8. [PMID: 29100109 DOI: 10.1016/j.mib.2017.10.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 09/27/2017] [Accepted: 10/09/2017] [Indexed: 11/24/2022]
Abstract
The scope and complexity of cryogenic electron microscopy (cryoEM) data has greatly increased, and will continue to do so, due to recent and ongoing technical breakthroughs that have led to much improved resolutions for macromolecular structures solved using this method. This big data explosion includes single particle data as well as tomographic tilt series, both generally acquired as direct detector movies of ∼10-100 frames per image or per tilt-series. We provide a brief survey of the developments leading to the current status, and describe existing cryoEM pipelines, with an emphasis on the scope of data acquisition, methods for automation, and use of cloud storage and computing.
Collapse
Affiliation(s)
- Philip R Baldwin
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Yong Zi Tan
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Edward T Eng
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - William J Rice
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Alex J Noble
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Carl J Negro
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Michael A Cianfrocco
- Life Sciences Institute and Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Clinton S Potter
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Bridget Carragher
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA.
| |
Collapse
|