1
|
Liu J, Lu Y, Zhu L. A kinetic model for solving a combination optimization problem in ab-initio Cryo-EM 3D reconstruction. Brief Bioinform 2024; 25:bbad473. [PMID: 38261343 PMCID: PMC10805181 DOI: 10.1093/bib/bbad473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/22/2023] [Accepted: 11/28/2023] [Indexed: 01/24/2024] Open
Abstract
Cryo-Electron Microscopy (cryo-EM) is a widely used and effective method for determining the three-dimensional (3D) structure of biological molecules. For ab-initio Cryo-EM 3D reconstruction using single particle analysis (SPA), estimating the projection direction of the projection image is a crucial step. However, the existing SPA methods based on common lines are sensitive to noise. The error in common line detection will lead to a poor estimation of the projection directions and thus may greatly affect the final reconstruction results. To improve the reconstruction results, multiple candidate common lines are estimated for each pair of projection images. The key problem then becomes a combination optimization problem of selecting consistent common lines from multiple candidates. To solve the problem efficiently, a physics-inspired method based on a kinetic model is proposed in this work. More specifically, hypothetical attractive forces between each pair of candidate common lines are used to calculate a hypothetical torque exerted on each projection image in the 3D reconstruction space, and the rotation under the hypothetical torque is used to optimize the projection direction estimation of the projection image. This way, the consistent common lines along with the projection directions can be found directly without enumeration of all the combinations of the multiple candidate common lines. Compared with the traditional methods, the proposed method is shown to be able to produce more accurate 3D reconstruction results from high noise projection images. Besides the practical value, the proposed method also serves as a good reference for solving similar combinatorial optimization problems.
Collapse
Affiliation(s)
- Jiaxuan Liu
- School of Information Science and Engineering, Lanzhou
| | - Yonggang Lu
- School of Information Science and Engineering, Lanzhou
| | - Li Zhu
- School of Life Sciences, Lanzhou University
| |
Collapse
|
2
|
Toader B, Brubaker MA, Lederman RR. Efficient high-resolution refinement in cryo-EM with stochastic gradient descent. ArXiv 2023:arXiv:2311.16100v1. [PMID: 38076514 PMCID: PMC10705587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Electron cryomicroscopy (cryo-EM) is an imaging technique widely used in structural biology to determine the three-dimensional structure of biological molecules from noisy two-dimensional projections with unknown orientations. As the typical pipeline involves processing large amounts of data, efficient algorithms are crucial for fast and reliable results. The stochastic gradient descent (SGD) algorithm has been used to improve the speed of ab initio reconstruction, which results in a first, low-resolution estimation of the volume representing the molecule of interest, but has yet to be applied successfully in the high-resolution regime, where expectation-maximization algorithms achieve state-of-the-art results, at a high computational cost. In this article, we investigate the conditioning of the optimization problem and show that the large condition number prevents the successful application of gradient descent-based methods at high resolution. Our results include a theoretical analysis of the condition number of the optimization problem in a simplified setting where the individual projection directions are known, an algorithm based on computing a diagonal preconditioner using Hutchinson's diagonal estimator, and numerical experiments showing the improvement in the convergence speed when using the estimated preconditioner with SGD. The preconditioned SGD approach can potentially enable a simple and unified approach to ab initio reconstruction and high-resolution refinement with faster convergence speed and higher flexibility, and our results are a promising step in this direction.
Collapse
Affiliation(s)
- Bogdan Toader
- Department of Statistics and Data Science, Yale University
| | - Marcus A Brubaker
- Department of Electrical Engineering and Computer Science, York University
| | - Roy R Lederman
- Department of Statistics and Data Science, Yale University
| |
Collapse
|
3
|
Harpaz Y, Shkolnisky Y. Three-dimensional alignment of density maps in cryo-electron microscopy. Biol Imaging 2023; 3:e8. [PMID: 38487687 PMCID: PMC10936424 DOI: 10.1017/s2633903x23000089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 02/26/2023] [Accepted: 02/28/2023] [Indexed: 03/17/2024]
Abstract
A common task in cryo-electron microscopy data processing is to compare three-dimensional density maps of macromolecules. In this paper, we propose an algorithm for aligning three-dimensional density maps, which exploits common lines between projection images of the maps. The algorithm is fully automatic and handles rotations, reflections (handedness), and translations between the maps. In addition, the algorithm is applicable to any type of molecular symmetry without requiring any information regarding the symmetry of the maps. We evaluate our alignment algorithm on publicly available density maps, demonstrating its accuracy and efficiency. The algorithm is available at https://github.com/ShkolniskyLab/emalign.
Collapse
Affiliation(s)
- Yael Harpaz
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Tel-Aviv, Israel
| | - Yoel Shkolnisky
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Tel-Aviv, Israel
| |
Collapse
|
4
|
Abstract
Cryo-electron microscopy (CryoEM) has become a vital technique in structural biology. It is an interdisciplinary field that takes advantage of advances in biochemistry, physics, and image processing, among other disciplines. Innovations in these three basic pillars have contributed to the boosting of CryoEM in the past decade. This work reviews the main contributions in image processing to the current reconstruction workflow of single particle analysis (SPA) by CryoEM. Our review emphasizes the time evolution of the algorithms across the different steps of the workflow differentiating between two groups of approaches: analytical methods and deep learning algorithms. We present an analysis of the current state of the art. Finally, we discuss the emerging problems and challenges still to be addressed in the evolution of CryoEM image processing methods in SPA.
Collapse
Affiliation(s)
- Jose Luis Vilas
- Biocomputing Unit, Centro Nacional de Biotecnologia (CNB-CSIC), Darwin, 3, Campus Universidad Autonoma, 28049 Cantoblanco, Madrid, Spain
| | - Jose Maria Carazo
- Biocomputing Unit, Centro Nacional de Biotecnologia (CNB-CSIC), Darwin, 3, Campus Universidad Autonoma, 28049 Cantoblanco, Madrid, Spain
| | - Carlos Oscar S Sorzano
- Biocomputing Unit, Centro Nacional de Biotecnologia (CNB-CSIC), Darwin, 3, Campus Universidad Autonoma, 28049 Cantoblanco, Madrid, Spain
| |
Collapse
|
5
|
Lu Y, Liu J, Zhu L, Zhang B, He J. 3D reconstruction from cryo-EM projection images using two spherical embeddings. Commun Biol 2022; 5:304. [PMID: 35379919 DOI: 10.1038/s42003-022-03255-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 03/11/2022] [Indexed: 11/08/2022] Open
Abstract
Single-particle analysis (SPA) in cryo-electron microscopy has become a powerful tool for determining and studying the macromolecular structure at an atomic level. However, since the SPA problem is a non-convex optimization problem with enormous search space and there is high level of noise in the input images, the existing methods may produce biased or even wrong final models. In this work, to deal with the problem, consistent constraints from the input data are explored in an embedding space, a 3D spherical surface. More specifically, the orientation of a projection image is represented by two intersection points of the normal vector and the local X-axis vector of the projection image on the unit spherical surface. To determine the orientations of the projection images, the global consistency constraints of the relative orientations of all the projection images are satisfied by two spherical embeddings which estimate the normal vectors and the local X-axis vectors of the projection images respectively. Compared to the traditional methods, the proposed method is shown to be able to rectify the initial computation errors and produce a more accurate estimation of the projection angles, which results in a better final model reconstruction from the noisy image data. A 3D reconstruction method using two spherical embeddings to resolve projection angles of the cryo-EM images is shown to improve the initial model reconstruction for single-particle analysis.
Collapse
|
6
|
Wang X, Lu Y, Liu J. A Fast Image Alignment Approach for 2D Classification of Cryo-EM Images Using Spectral Clustering. Curr Issues Mol Biol 2021; 43:1652-68. [PMID: 34698131 DOI: 10.3390/cimb43030117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 10/14/2021] [Accepted: 10/14/2021] [Indexed: 01/22/2023] Open
Abstract
Three-dimensional (3D) reconstruction in single-particle cryo-electron microscopy (cryo-EM) is a significant technique for recovering the 3D structure of proteins or other biological macromolecules from their two-dimensional (2D) noisy projection images taken from unknown random directions. Class averaging in single-particle cryo-EM is an important procedure for producing high-quality initial 3D structures, where image alignment is a fundamental step. In this paper, an efficient image alignment algorithm using 2D interpolation in the frequency domain of images is proposed to improve the estimation accuracy of alignment parameters of rotation angles and translational shifts between the two projection images, which can obtain subpixel and subangle accuracy. The proposed algorithm firstly uses the Fourier transform of two projection images to calculate a discrete cross-correlation matrix and then performs the 2D interpolation around the maximum value in the cross-correlation matrix. The alignment parameters are directly determined according to the position of the maximum value in the cross-correlation matrix after interpolation. Furthermore, the proposed image alignment algorithm and a spectral clustering algorithm are used to compute class averages for single-particle 3D reconstruction. The proposed image alignment algorithm is firstly tested on a Lena image and two cryo-EM datasets. Results show that the proposed image alignment algorithm can estimate the alignment parameters accurately and efficiently. The proposed method is also used to reconstruct preliminary 3D structures from a simulated cryo-EM dataset and a real cryo-EM dataset and to compare them with RELION. Experimental results show that the proposed method can obtain more high-quality class averages than RELION and can obtain higher reconstruction resolution than RELION even without iteration.
Collapse
|
7
|
Ronen R, Attias Y, Schechner YY, Jaffe JS, Orenstein E. Plankton reconstruction through robust statistical optical tomography. J Opt Soc Am A Opt Image Sci Vis 2021; 38:1320-1331. [PMID: 34613139 DOI: 10.1364/josaa.423037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 07/07/2021] [Indexed: 06/13/2023]
Abstract
Plankton interact with the environment according to their size and three-dimensional (3D) structure. To study them outdoors, these translucent specimens are imaged in situ. Light projects through a specimen in each image. The specimen has a random scale, drawn from the population's size distribution and random unknown pose. The specimen appears only once before drifting away. We achieve 3D tomography using such a random ensemble to statistically estimate an average volumetric distribution of the plankton type and specimen size. To counter errors due to non-rigid deformations, we weight the data, drawing from advanced models developed for cryo-electron microscopy. The weights convey the confidence in the quality of each datum. This confidence relies on a statistical error model. We demonstrate the approach on live plankton using an underwater field microscope.
Collapse
|
8
|
Abstract
Factor models are a class of powerful statistical models that have been widely used to deal with dependent measurements that arise frequently from various applications from genomics and neuroscience to economics and finance. As data are collected at an ever-growing scale, statistical machine learning faces some new challenges: high dimensionality, strong dependence among observed variables, heavy-tailed variables and heterogeneity. High-dimensional robust factor analysis serves as a powerful toolkit to conquer these challenges. This paper gives a selective overview on recent advance on high-dimensional factor models and their applications to statistics including Factor-Adjusted Robust Model selection (FarmSelect) and Factor-Adjusted Robust Multiple testing (FarmTest). We show that classical methods, especially principal component analysis (PCA), can be tailored to many new problems and provide powerful tools for statistical estimation and inference. We highlight PCA and its connections to matrix perturbation theory, robust statistics, random projection, false discovery rate, etc., and illustrate through several applications how insights from these fields yield solutions to modern challenges. We also present far-reaching connections between factor models and popular statistical learning problems, including network analysis and low-rank matrix recovery.
Collapse
Affiliation(s)
- Jianqing Fan
- Department of Operations Research and Financial Engineering, Princeton University, Princeton, 08540, NJ, USA
| | - Kaizheng Wang
- Department of Industrial Engineering and Operations Research, Columbia University, New York, 10027, NY, USA
| | - Yiqiao Zhong
- Packard 239, Stanford University, Stanford, 94305, CA, USA
| | - Ziwei Zhu
- Department of Statistics, University of Michigan, Ann Arbor, 48109, MI, USA
| |
Collapse
|
9
|
Caspy I, Borovikova-Sheinker A, Klaiman D, Shkolnisky Y, Nelson N. The structure of a triple complex of plant photosystem I with ferredoxin and plastocyanin. Nat Plants 2020. [PMID: 33020607 DOI: 10.1038/s41477-020-00779-779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
The ability of photosynthetic organisms to use sunlight as a sole source of energy is endowed by two large membrane complexes-photosystem I (PSI) and photosystem II (PSII). PSI and PSII are the fundamental components of oxygenic photosynthesis, providing oxygen, food and an energy source for most living organisms on Earth. Currently, high-resolution crystal structures of these complexes from various organisms are available. The crystal structures of megadalton complexes have revealed excitation transfer and electron-transport pathways within the various complexes. PSI is defined as plastocyanin-ferredoxin oxidoreductase but a high-resolution structure of the entire triple supercomplex is not available. Here, using a new cryo-electron microscopy technique, we solve the structure of native plant PSI in complex with its electron donor plastocyanin and the electron acceptor ferredoxin. We reveal all of the contact sites and the modes of interaction between the interacting electron carriers and PSI.
Collapse
Affiliation(s)
- Ido Caspy
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Anna Borovikova-Sheinker
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Daniel Klaiman
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Yoel Shkolnisky
- School of Mathematical Sciences, Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel.
| | - Nathan Nelson
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
10
|
Caspy I, Borovikova-Sheinker A, Klaiman D, Shkolnisky Y, Nelson N. The structure of a triple complex of plant photosystem I with ferredoxin and plastocyanin. Nat Plants 2020; 6:1300-1305. [PMID: 33020607 DOI: 10.1038/s41477-020-00779-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 09/03/2020] [Indexed: 06/11/2023]
Abstract
The ability of photosynthetic organisms to use sunlight as a sole source of energy is endowed by two large membrane complexes-photosystem I (PSI) and photosystem II (PSII). PSI and PSII are the fundamental components of oxygenic photosynthesis, providing oxygen, food and an energy source for most living organisms on Earth. Currently, high-resolution crystal structures of these complexes from various organisms are available. The crystal structures of megadalton complexes have revealed excitation transfer and electron-transport pathways within the various complexes. PSI is defined as plastocyanin-ferredoxin oxidoreductase but a high-resolution structure of the entire triple supercomplex is not available. Here, using a new cryo-electron microscopy technique, we solve the structure of native plant PSI in complex with its electron donor plastocyanin and the electron acceptor ferredoxin. We reveal all of the contact sites and the modes of interaction between the interacting electron carriers and PSI.
Collapse
Affiliation(s)
- Ido Caspy
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Anna Borovikova-Sheinker
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Daniel Klaiman
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Yoel Shkolnisky
- School of Mathematical Sciences, Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel.
| | - Nathan Nelson
- Department of Biochemistry and Molecular Biology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
11
|
Jiménez A, Jonic S, Majtner T, Otón J, Vilas JL, Maluenda D, Mota J, Ramírez-Aportela E, Martínez M, Rancel Y, Segura J, Sánchez-García R, Melero R, Del Cano L, Conesa P, Skjaerven L, Marabini R, Carazo JM, Sorzano COS. Validation of electron microscopy initial models via small angle X-ray scattering curves. Bioinformatics 2020; 35:2427-2433. [PMID: 30500892 DOI: 10.1093/bioinformatics/bty985] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Revised: 10/29/2018] [Accepted: 11/29/2018] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Cryo electron microscopy (EM) is currently one of the main tools to reveal the structural information of biological macromolecules. The re-construction of three-dimensional (3D) maps is typically carried out following an iterative process that requires an initial estimation of the 3D map to be refined in subsequent steps. Therefore, its determination is key in the quality of the final results, and there are cases in which it is still an open issue in single particle analysis (SPA). Small angle X-ray scattering (SAXS) is a well-known technique applied to structural biology. It is useful from small nanostructures up to macromolecular ensembles for its ability to obtain low resolution information of the biological sample measuring its X-ray scattering curve. These curves, together with further analysis, are able to yield information on the sizes, shapes and structures of the analyzed particles. RESULTS In this paper, we show how the low resolution structural information revealed by SAXS is very useful for the validation of EM initial 3D models in SPA, helping the following refinement process to obtain more accurate 3D structures. For this purpose, we approximate the initial map by pseudo-atoms and predict the SAXS curve expected for this pseudo-atomic structure. The match between the predicted and experimental SAXS curves is considered as a good sign of the correctness of the EM initial map. AVAILABILITY AND IMPLEMENTATION The algorithm is freely available as part of the Scipion 1.2 software at http://scipion.i2pc.es/.
Collapse
Affiliation(s)
- Amaya Jiménez
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Slavica Jonic
- UMR CNRS 7590, Muséum National d ´Histoire Naturelle, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Sorbonne Université, Paris, France
| | - Tomas Majtner
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Joaquín Otón
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Jose Luis Vilas
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - David Maluenda
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Javier Mota
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | | | - Marta Martínez
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Yaiza Rancel
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Joan Segura
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | | | - Roberto Melero
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Laura Del Cano
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Pablo Conesa
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Lars Skjaerven
- Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Roberto Marabini
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain.,Department of Computer Science, University Autónoma de Madrid, Cantoblanco, Madrid, Spain
| | - Jose M Carazo
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain
| | - Carlos Oscar S Sorzano
- Biocomputing Unit, Centro Nac. Biotecnología (CSIC), Cantoblanco, Madrid, Spain.,Department of Engineering of Electronic and Telecommunication System, University San Pablo-CEU, Campus Urb. Montepríncipe, Boadilla del Monte, Madrid, Spain
| |
Collapse
|
12
|
Abstract
Recovering low-rank structures via eigenvector perturbation analysis is a common problem in statistical machine learning, such as in factor analysis, community detection, ranking, matrix completion, among others. While a large variety of bounds are available for average errors between empirical and population statistics of eigenvectors, few results are tight for entrywise analyses, which are critical for a number of problems such as community detection. This paper investigates entrywise behaviors of eigenvectors for a large class of random matrices whose expectations are low-rank, which helps settle the conjecture in Abbe et al. (2014b) that the spectral algorithm achieves exact recovery in the stochastic block model without any trimming or cleaning steps. The key is a first-order approximation of eigenvectors under the ℓ ∞ norm:u k ≈ A u k * λ k * , where {u k } and{ u k * } are eigenvectors of a random matrix A and its expectation E A , respectively. The fact that the approximation is both tight and linear in A facilitates sharp comparisons between u k andu k * . In particular, it allows for comparing the signs of u k andu k * even if‖ u k - u k * ‖ ∞ is large. The results are further extended to perturbations of eigenspaces, yielding new ℓ ∞-type bounds for synchronization (ℤ 2 -spiked Wigner model) and noisy matrix completion.
Collapse
Affiliation(s)
- Emmanuel Abbe
- PACM and Department of EE, Princeton University, Princeton, NJ 08544, USA
| | - Jianqing Fan
- Department of ORFE, Princeton University, Princeton, NJ 08544, USA
| | - Kaizheng Wang
- Department of ORFE, Princeton University, Princeton, NJ 08544, USA
| | - Yiqiao Zhong
- Department of ORFE, Princeton University, Princeton, NJ 08544, USA
| |
Collapse
|
13
|
Abstract
Let 𝒢 be a compact group and let f i j ∈ C ( 𝒢 ) . We define the Non-Unique Games (NUG) problem as finding g 1 , … , g n ∈ 𝒢 to minimize ∑ i , j = 1 n f i j g i g j - 1 . We introduce a convex relaxation of the NUG problem to a semidefinite program (SDP) by taking the Fourier transform of f i j over 𝒢 . The NUG framework can be seen as a generalization of the little Grothendieck problem over the orthogonal group and the Unique Games problem and includes many practically relevant problems, such as the maximum likelihood estimator to registering bandlimited functions over the unit sphere in d -dimensions and orientation estimation of noisy cryo-Electron Microscopy (cryo-EM) projection images. We implement a SDP solver for the NUG cryo-EM problem using the alternating direction method of multipliers (ADMM). Numerical study with synthetic datasets indicate that while our ADMM solver is slower than existing methods, it can estimate the rotations more accurately, especially at low signal-to-noise ratio (SNR).
Collapse
Affiliation(s)
| | - ETH Zürich
- Ramistrasse 101, 8092 Zürich, Switzerland
| | - Yutong Chen
- Systems Trading, Tudor Investment Corporation, New York, NY USA
| | - Roy R. Lederman
- Department of Statistics and Data Science, Yale University, 24 Hillhouse Avenue, New Haven, CT 06511 USA
| | - Amit Singer
- Department of Mathematics, Program in Applied and Computational Mathematics, Princeton University, Fine Hall, Washington Road, Princeton, NJ 08544 USA
| |
Collapse
|
14
|
Lederman RR, Andén J, Singer A. Hyper-Molecules: on the Representation and Recovery of Dynamical Structures for Applications in Flexible Macro-Molecules in Cryo-EM. Inverse Probl 2020; 36:044005. [PMID: 38304203 PMCID: PMC10831863 DOI: 10.1088/1361-6420/ab5ede] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
Cryo-electron microscopy (cryo-EM), the subject of the 2017 Nobel Prize in Chemistry, is a technology for obtaining 3-D reconstructions of macromolecules from many noisy 2-D projections of instances of these macromolecules, whose orientations and positions are unknown. These molecules are not rigid objects, but flexible objects involved in dynamical processes. The different conformations are exhibited by different instances of the macromolecule observed in a cryo-EM experiment, each of which is recorded as a particle image. The range of conformations and the conformation of each particle are not known a priori; one of the great promises of cryo-EM is to map this conformation space. Remarkable progress has been made in reconstructing rigid molecules based on homogeneous samples of molecules in spite of the unknown orientation of each particle image and significant progress has been made in recovering a few distinct states from mixtures of rather distinct conformations, but more complex heterogeneous samples remain a major challenge. We introduce the "hyper-molecule" theoretical framework for modeling structures across different states of heterogeneous molecules, including continuums of states. The key idea behind this framework is representing heterogeneous macromolecules as high-dimensional objects, with the additional dimensions representing the conformation space. This idea is then refined to model properties such as localized heterogeneity. In addition, we introduce an algorithmic framework for reconstructing such heterogeneous objects from experimental data using a Bayesian formulation of the problem and Markov chain Monte Carlo (MCMC) algorithms to address the computational challenges in recovering these high dimensional hyper-molecules. We demonstrate these ideas in a preliminary prototype implementation, applied to synthetic data.
Collapse
Affiliation(s)
- Roy R Lederman
- The Department of Statistics and Data Science, Yale University, New Haven, CT
| | - Joakim Andén
- Center for Computational Mathematics, Flatiron Institute, New York, NY
| | - Amit Singer
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ
| |
Collapse
|
15
|
Ciak R, Melching M, Scherzer O. Regularization with Metric Double Integrals of Functions with Values in a Set of Vectors. J Math Imaging Vis 2019; 61:824-848. [PMID: 31396002 PMCID: PMC6647495 DOI: 10.1007/s10851-018-00869-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 12/22/2018] [Indexed: 06/10/2023]
Abstract
We present an approach for variational regularization of inverse and imaging problems for recovering functions with values in a set of vectors. We introduce regularization functionals, which are derivative-free double integrals of such functions. These regularization functionals are motivated from double integrals, which approximate Sobolev semi-norms of intensity functions. These were introduced in Bourgain et al. (Another look at Sobolev spaces. In: Menaldi, Rofman, Sulem (eds) Optimal control and partial differential equations-innovations and applications: in honor of professor Alain Bensoussan's 60th anniversary, IOS Press, Amsterdam, pp 439-455, 2001). For the proposed regularization functionals, we prove existence of minimizers as well as a stability and convergence result for functions with values in a set of vectors.
Collapse
Affiliation(s)
- René Ciak
- Computational Science Center, University of Vienna, Oskar-Morgenstern-Platz 1, 1090 Vienna, Austria
| | - Melanie Melching
- Computational Science Center, University of Vienna, Oskar-Morgenstern-Platz 1, 1090 Vienna, Austria
| | - Otmar Scherzer
- Computational Science Center, University of Vienna, Oskar-Morgenstern-Platz 1, 1090 Vienna, Austria
- Johann Radon Institute for Computational and Applied Mathematics (RICAM), Altenbergstraße 69, 4040 Linz, Austria
| |
Collapse
|
16
|
Vilas JL, Tabassum N, Mota J, Maluenda D, Jiménez-Moreno A, Majtner T, Carazo JM, Acton ST, Sorzano COS. Advances in image processing for single-particle analysis by electron cryomicroscopy and challenges ahead. Curr Opin Struct Biol 2018; 52:127-45. [PMID: 30509756 DOI: 10.1016/j.sbi.2018.11.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Revised: 10/26/2018] [Accepted: 11/17/2018] [Indexed: 12/20/2022]
Abstract
Electron cryomicroscopy (cryoEM) is essential for the study and functional understanding of non-crystalline macromolecules such as proteins. These molecules cannot be imaged using X-ray crystallography or other popular methods. CryoEM has been successfully used to visualize macromolecular complexes such as ribosomes, viruses, and ion channels. Determination of structural models of these at various conformational states leads to insight on how these molecules function. Recent advances in imaging technology have given cryoEM a scientific rebirth. As a result of these technological advances image processing and analysis have yielded molecular structures at atomic resolution. Nevertheless there continue to be challenges in image processing, and in this article we will touch on the most essential in order to derive an accurate three-dimensional model from noisy projection images. Traditional approaches, such as k-means clustering for class averaging, will be provided as background. We will then highlight new approaches for each image processing subproblem, including a 3D reconstruction method for asymmetric molecules using just two projection images and deep learning algorithms for automated particle picking.
Collapse
|
17
|
Abstract
In cryo-electron microscopy, the three-dimensional (3D) electric potentials of an ensemble of molecules are projected along arbitrary viewing directions to yield noisy two-dimensional images. The volume maps representing these potentials typically exhibit a great deal of structural variability, which is described by their 3D covariance matrix. Typically, this covariance matrix is approximately low rank and can be used to cluster the volumes or estimate the intrinsic geometry of the conformation space. We formulate the estimation of this covariance matrix as a linear inverse problem, yielding a consistent least-squares estimator. For n images of size N-by-N pixels, we propose an algorithm for calculating this covariance estimator with computational complexity O ( n N 4 + κ N 6 log N ) , where the condition number κ is empirically in the range 10-200. Its efficiency relies on the observation that the normal equations are equivalent to a deconvolution problem in six dimensions. This is then solved by the conjugate gradient method with an appropriate circulant preconditioner. The result is the first computationally efficient algorithm for consistent estimation of the 3D covariance from noisy projections. It also compares favorably in runtime with respect to previously proposed nonconsistent estimators. Motivated by the recent success of eigenvalue shrinkage procedures for high-dimensional covariance matrix estimation, we incorporate a shrinkage procedure that improves accuracy at lower signal-to-noise ratios. We evaluate our methods on simulated datasets and achieve classification results comparable to state-of-the-art methods in shorter running time. We also present results on clustering volumes in an experimental dataset, illustrating the power of the proposed algorithm for practical determination of structural variability.
Collapse
Affiliation(s)
- Joakim Andén
- Center for Computational Biology, Flatiron Institute, New York, NY 10100
| | - Amit Singer
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544
| |
Collapse
|
18
|
Abstract
We consider the problem of estimating a signal from noisy circularly-translated versions of itself, called multireference alignment (MRA). One natural approach to MRA could be to estimate the shifts of the observations first, and infer the signal by aligning and averaging the data. In contrast, we consider a method based on estimating the signal directly, using features of the signal that are invariant under translations. Specifically, we estimate the power spectrum and the bispectrum of the signal from the observations. Under mild assumptions, these invariant features contain enough information to infer the signal. In particular, the bispectrum can be used to estimate the Fourier phases. To this end, we propose and analyze a few algorithms. Our main methods consist of non-convex optimization over the smooth manifold of phases. Empirically, in the absence of noise, these non-convex algorithms appear to converge to the target signal with random initialization. The algorithms are also robust to noise. We then suggest three additional methods. These methods are based on frequency marching, semidefinite relaxation and integer programming. The first two methods provably recover the phases exactly in the absence of noise. In the high noise level regime, the invariant features approach for MRA results in stable estimation if the number of measurements scales like the cube of the noise variance, which is the information-theoretic rate. Additionally, it requires only one pass over the data which is important at low signal-to-noise ratio when the number of observations must be large.
Collapse
Affiliation(s)
- Tamir Bendory
- Princeton University in PACM and the Mathematics Department
| | - Nicolas Boumal
- Princeton University in PACM and the Mathematics Department
| | - Chao Ma
- Princeton University in PACM and the Mathematics Department
| | - Zhizhen Zhao
- University of Illinois at Urbana-Champaign in the Department of Electrical and Computer Engineering
| | - Amit Singer
- Princeton University in PACM and the Mathematics Department
| |
Collapse
|
19
|
Greenberg I, Shkolnisky Y. Common lines modeling for reference free Ab-initio reconstruction in cryo-EM. J Struct Biol 2017; 200:106-117. [PMID: 28943480 DOI: 10.1016/j.jsb.2017.09.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2017] [Revised: 09/17/2017] [Accepted: 09/20/2017] [Indexed: 10/18/2022]
Abstract
We consider the problem of estimating an unbiased and reference-free ab initio model for non-symmetric molecules from images generated by single-particle cryo-electron microscopy. The proposed algorithm finds the globally optimal assignment of orientations that simultaneously respects all common lines between all images. The contribution of each common line to the estimated orientations is weighted according to a statistical model for common lines' detection errors. The key property of the proposed algorithm is that it finds the global optimum for the orientations given the common lines. In particular, any local optima in the common lines energy landscape do not affect the proposed algorithm. As a result, it is applicable to thousands of images at once, very robust to noise, completely reference free, and not biased towards any initial model. A byproduct of the algorithm is a set of measures that allow to asses the reliability of the obtained ab initio model. We demonstrate the algorithm using class averages from two experimental data sets, resulting in ab initio models with resolutions of 20Å or better, even from class averages consisting of as few as three raw images per class.
Collapse
Affiliation(s)
- Ido Greenberg
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Israel.
| | - Yoel Shkolnisky
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Israel.
| |
Collapse
|
20
|
Abstract
Cryo-electron microscopy (EM) and small angle X-ray scattering (SAXS) are two different data acquisition modalities often used to glean information about the structure of large biomolecular complexes in their native states. A SAXS experiment is generally considered fast and easy but unveils the structure at very low resolution, whereas a cryo-EM experiment needs more extensive preparation and postacquisition computation to yield a three-dimensional (3D) density map at higher resolution. In certain applications, we may need to verify whether the data acquired in the SAXS and cryo-EM experiments correspond to the same structure (e.g., before reconstructing the 3D density map in EM). In this article, a simple and fast method is proposed to verify the compatibility of the SAXS and EM experimental data. The method is based on averaging the two-dimensional correlation of EM images and the Abel transform of the SAXS data. Orientational preferences are known to exist in cryo-EM experiments, and we also consider these effects on our method. The results are verified on simulations of conformational states of large biomolecular complexes.
Collapse
Affiliation(s)
- Jin Seob Kim
- Department of Mechanical Engineering, Johns Hopkins University, Baltimore, Maryland
| | - Bijan Afsari
- Center for Imaging Science, Johns Hopkins University, Baltimore, Maryland
| | | |
Collapse
|
21
|
Abstract
One of the primary challenges in single particle reconstruction with cryo-electron microscopy is to find a three-dimensional model of a molecule using its noisy two-dimensional projection-images. As the imaging orientations of the projection-images are unknown, we suggest a common-lines-based method to simultaneously estimate the imaging orientations of all images that is independent of the distribution of the orientations. Since the relative orientation of each pair of images may only be estimated up to a two-way handedness ambiguity, we suggest an efficient procedure to consistently assign the same handedness to all relative orientations. This is achieved by casting the handedness assignment problem as a graph-partitioning problem. Once a consistent handedness of all relative orientations is determined, the orientations corresponding to all projection-images are determined simultaneously, thus rendering the method robust to noise. Our proposed method has also the advantage of allowing one to incorporate confidence information regarding the trustworthiness of each relative orientation in a natural manner. We demonstrate the efficacy of our approach using simulated clean and noisy data.
Collapse
Affiliation(s)
- Gabi Pragier
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Tel Aviv 6997801, Israel
| | - Ido Greenberg
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Tel Aviv 6997801, Israel
| | - Xiuyuan Cheng
- Applied Mathematics Program, Yale University, New Haven, CT 06520 USA
| | - Yoel Shkolnisky
- Department of Applied Mathematics, School of Mathematical Sciences, Tel-Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
22
|
Abstract
A major challenge in single particle reconstruction from cryo-electron microscopy is to establish a reliable ab initio three-dimensional model using two-dimensional projection images with unknown orientations. Common-lines-based methods estimate the orientations without additional geometric information. However, such methods fail when the detection rate of common-lines is too low due to the high level of noise in the images. An approximation to the least squares global self-consistency error was obtained in [A. Singer and Y. Shkolnisky, SIAM J. Imaging Sci., 4 (2011), pp. 543-572] using convex relaxation by semidefinite programming. In this paper we introduce a more robust global self-consistency error and show that the corresponding optimization problem can be solved via semidefinite relaxation. In order to prevent artificial clustering of the estimated viewing directions, we further introduce a spectral norm term that is added as a constraint or as a regularization term to the relaxed minimization problem. The resulting problems are solved using either the alternating direction method of multipliers or an iteratively reweighted least squares procedure. Numerical experiments with both simulated and real images demonstrate that the proposed methods significantly reduce the orientation estimation error when the detection rate of common-lines is low.
Collapse
Affiliation(s)
- Lanhui Wang
- Corresponding author: The Program in Applied and Computational Mathematics (PACM), Princeton University, Princeton, NJ 08544-1000 ()
| | - Amit Singer
- Department of Mathematics and PACM, Princeton University, Princeton, NJ 08544-1000
| | - Zaiwen Wen
- Beijng International Center for Mathematical Research, Peking University, Beijing, China
| |
Collapse
|