1
|
Baldwin PR, Tan YZ, Eng ET, Rice WJ, Noble AJ, Negro CJ, Cianfrocco MA, Potter CS, Carragher B. Big data in cryoEM: automated collection, processing and accessibility of EM data. Curr Opin Microbiol 2017; 43:1-8. [PMID: 29100109 DOI: 10.1016/j.mib.2017.10.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 09/27/2017] [Accepted: 10/09/2017] [Indexed: 11/24/2022]
Abstract
The scope and complexity of cryogenic electron microscopy (cryoEM) data has greatly increased, and will continue to do so, due to recent and ongoing technical breakthroughs that have led to much improved resolutions for macromolecular structures solved using this method. This big data explosion includes single particle data as well as tomographic tilt series, both generally acquired as direct detector movies of ∼10-100 frames per image or per tilt-series. We provide a brief survey of the developments leading to the current status, and describe existing cryoEM pipelines, with an emphasis on the scope of data acquisition, methods for automation, and use of cloud storage and computing.
Collapse
Affiliation(s)
- Philip R Baldwin
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Yong Zi Tan
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Edward T Eng
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - William J Rice
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Alex J Noble
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Carl J Negro
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA
| | - Michael A Cianfrocco
- Life Sciences Institute and Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Clinton S Potter
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Bridget Carragher
- The National Resource for Automated Molecular Microscopy, Simons Electron Microscopy Center, New York Structural Biology Center, 89 Convent Ave, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA.
| |
Collapse
|
2
|
Zheng Q, Lu Z, Zhang M, Xu L, Ma H, Song S, Feng Q, Feng Y, Chen W, He T. Automatic segmentation of myocardium from black-blood MR images using entropy and local neighborhood information. PLoS One 2015; 10:e0120018. [PMID: 25811976 PMCID: PMC4374880 DOI: 10.1371/journal.pone.0120018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 01/26/2015] [Indexed: 11/19/2022] Open
Abstract
By using entropy and local neighborhood information, we present in this study a robust adaptive Gaussian regularizing Chan-Vese (CV) model to segment the myocardium from magnetic resonance images with intensity inhomogeneity. By utilizing the circular Hough transformation (CHT) our model is able to detect epicardial and endocardial contours of the left ventricle (LV) as circles automatically, and the circles are used as the initialization. In the cost functional of our model, the interior and exterior energies are weighted by the entropy to improve the robustness of the evolving curve. Local neighborhood information is used to evolve the level set function to reduce the impact of the heterogeneity inside the regions and to improve the segmentation accuracy. An adaptive window is utilized to reduce the sensitivity to initialization. The Gaussian kernel is used to regularize the level set function, which can not only ensure the smoothness and stability of the level set function, but also eliminate the traditional Euclidean length term and re-initialization. Extensive validation of the proposed method on patient data demonstrates its superior performance over other state-of-the-art methods.
Collapse
Affiliation(s)
- Qian Zheng
- Zhengzhou University of Light Industry, Zhengzhou 450002, China
| | - Zhentai Lu
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Minghui Zhang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Lin Xu
- University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Huan Ma
- Zhengzhou University of Light Industry, Zhengzhou 450002, China
| | - Shengli Song
- Zhengzhou University of Light Industry, Zhengzhou 450002, China
| | - Qianjin Feng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Yanqiu Feng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Wufan Chen
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Taigang He
- Cardiovascular Sciences Research Centre, St George’s, University of London, London SW17 0RE, United Kingdom
- Biomedical Research Unit, Royal Brompton Hospital and Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
3
|
Particle quality assessment and sorting for automatic and semiautomatic particle-picking techniques. J Struct Biol 2013; 183:342-353. [PMID: 23933392 DOI: 10.1016/j.jsb.2013.07.015] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2013] [Revised: 07/10/2013] [Accepted: 07/31/2013] [Indexed: 11/22/2022]
Abstract
Three-dimensional reconstruction of biological specimens using electron microscopy by single particle methodologies requires the identification and extraction of the imaged particles from the acquired micrographs. Automatic and semiautomatic particle selection approaches can localize these particles, minimizing the user interaction, but at the cost of selecting a non-negligible number of incorrect particles, which can corrupt the final three-dimensional reconstruction. In this work, we present a novel particle quality assessment and sorting method that can separate most erroneously picked particles from correct ones. The proposed method is based on multivariate statistical analysis of a particle set that has been picked previously using any automatic or manual approach. The new method uses different sets of particle descriptors, which are morphology-based, histogram-based and signal to noise analysis based. We have tested our proposed algorithm with experimental data obtaining very satisfactory results. The algorithm is freely available as a part of the Xmipp 3.0 package [http://xmipp.cnb.csic.es].
Collapse
|
4
|
Zheng Q, Lu Z, Yang W, Zhang M, Feng Q, Chen W. A robust medical image segmentation method using KL distance and local neighborhood information. Comput Biol Med 2013; 43:459-70. [PMID: 23566392 DOI: 10.1016/j.compbiomed.2013.01.002] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2011] [Revised: 11/20/2012] [Accepted: 01/02/2013] [Indexed: 11/15/2022]
Abstract
In this paper, we propose an improved Chan-Vese (CV) model that uses Kullback-Leibler (KL) distances and local neighborhood information (LNI). Due to the effects of heterogeneity and complex constructions, the performance of level set segmentation is subject to confounding by the presence of nearby structures of similar intensity, preventing it from discerning the exact boundary of the object. Moreover, the CV model cannot usually obtain accurate results in medical image segmentation in cases of optimal configuration of controlling parameters, which requires substantial manual intervention. To overcome the above deficiency, we improve the segmentation accuracy by the usage of KL distance and LNI, thereby introducing the image local characteristics. Performance evaluation of the present method was achieved through experiments on the synthetic images and a series of real medical images. The extensive experimental results showed the superior performance of the proposed method over the state-of-the-art methods, in terms of both robustness and efficiency.
Collapse
Affiliation(s)
- Qian Zheng
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | | | | | | | | | | |
Collapse
|
5
|
Langlois R, Pallesen J, Frank J. Reference-free particle selection enhanced with semi-supervised machine learning for cryo-electron microscopy. J Struct Biol 2011; 175:353-61. [PMID: 21708269 PMCID: PMC3205936 DOI: 10.1016/j.jsb.2011.06.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2011] [Revised: 06/03/2011] [Accepted: 06/11/2011] [Indexed: 10/18/2022]
Abstract
Reference-based methods have dominated the approaches to the particle selection problem, proving fast, and accurate on even the most challenging micrographs. A reference volume, however, is not always available and compiling a set of reference projections from the micrographs themselves requires significant effort to attain the same level of accuracy. We propose a reference-free method to quickly extract particles from the micrograph. The method is augmented with a new semi-supervised machine-learning algorithm to accurately discriminate particles from contaminants and noise.
Collapse
Affiliation(s)
- Robert Langlois
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
| | - Jesper Pallesen
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
| | - Joachim Frank
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
| |
Collapse
|
6
|
Langlois R, Frank J. A clarification of the terms used in comparing semi-automated particle selection algorithms in cryo-EM. J Struct Biol 2011; 175:348-52. [PMID: 21420497 PMCID: PMC3164847 DOI: 10.1016/j.jsb.2011.03.009] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2011] [Revised: 03/02/2011] [Accepted: 03/12/2011] [Indexed: 11/22/2022]
Abstract
Many cyro-EM datasets are heterogeneous stemming from molecules undergoing conformational changes. The need to characterize each of the substrates with sufficient resolution entails a large increase in the data flow and motivates the development of more effective automated particle selection algorithms. Concepts and procedures from the machine-learning field are increasingly employed toward this end. However, a review of recent literature has revealed a discrepancy in terminology of the performance scores used to compare particle selection algorithms, and this has subsequently led to ambiguities in the meaning of claimed performance. In an attempt to curtail the perpetuation of this confusion and to disentangle past mistakes, we review the performance of published particle selection efforts with a set of explicitly defined performance scores using the terminology established and accepted within the field of machine learning.
Collapse
Affiliation(s)
- Robert Langlois
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
| | - Joachim Frank
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032
| |
Collapse
|
7
|
Lyumkis D, Moeller A, Cheng A, Herold A, Hou E, Irving C, Jacovetty EL, Lau PW, Mulder AM, Pulokas J, Quispe JD, Voss NR, Potter CS, Carragher B. Automation in single-particle electron microscopy connecting the pieces. Methods Enzymol 2010; 483:291-338. [PMID: 20888480 DOI: 10.1016/s0076-6879(10)83015-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Throughout the history of single-particle electron microscopy (EM), automated technologies have seen varying degrees of emphasis and development, usually depending upon the contemporary demands of the field. We are currently faced with increasingly sophisticated devices for specimen preparation, vast increases in the size of collected data sets, comprehensive algorithms for image processing, sophisticated tools for quality assessment, and an influx of interested scientists from outside the field who might lack the skills of experienced microscopists. This situation places automated techniques in high demand. In this chapter, we provide a generic definition of and discuss some of the most important advances in automated approaches to specimen preparation, grid handling, robotic screening, microscope calibrations, data acquisition, image processing, and computational infrastructure. Each section describes the general problem and then provides examples of how that problem has been addressed through automation, highlighting available processing packages, and sometimes describing the particular approach at the National Resource for Automated Molecular Microscopy (NRAMM). We contrast the more familiar manual procedures with automated approaches, emphasizing breakthroughs as well as current limitations. Finally, we speculate on future directions and improvements in automated technologies. Our overall goal is to present automation as more than simply a tool to save time. Rather, we aim to illustrate that automation is a comprehensive and versatile strategy that can deliver biological information on an unprecedented scale beyond the scope available with classical manual approaches.
Collapse
Affiliation(s)
- Dmitry Lyumkis
- National Resource for Automated Molecular Microscopy, Department of Cell Biology, The Scripps Research Institute, La Jolla, California, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Sorzano COS, Recarte E, Alcorlo M, Bilbao-Castro JR, San-Martín C, Marabini R, Carazo JM. Automatic particle selection from electron micrographs using machine learning techniques. J Struct Biol 2009; 167:252-60. [PMID: 19555764 PMCID: PMC2777658 DOI: 10.1016/j.jsb.2009.06.011] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2009] [Revised: 06/15/2009] [Accepted: 06/16/2009] [Indexed: 10/20/2022]
Abstract
The 3D reconstruction of biological specimens using Electron Microscopy is currently capable of achieving subnanometer resolution. Unfortunately, this goal requires gathering tens of thousands of projection images that are frequently selected manually from micrographs. In this paper we introduce a new automatic particle selection that learns from the user which particles are of interest. The training phase is semi-supervised so that the user can correct the algorithm during picking and specifically identify incorrectly picked particles. By treating such errors specially, the algorithm attempts to minimize the number of false positives. We show that our algorithm is able to produce datasets with fewer wrongly selected particles than previously reported methods. Another advantage is that we avoid the need for an initial reference volume from which to generate picking projections by instead learning which particles to pick from the user. This package has been made publicly available in the open-source package Xmipp.
Collapse
Affiliation(s)
- C O S Sorzano
- Unidad de Biocomputación, Centro Nacional de Biotecnología (CSIC), Campus Universidad Autónoma s/n, 28049 Cantoblanco, Madrid, Spain.
| | | | | | | | | | | | | |
Collapse
|
9
|
Prust CJ, Doerschuk PC, Lander GC, Johnson JE. Ab initio maximum likelihood reconstruction from cryo electron microscopy images of an infectious virion of the tailed bacteriophage P22 and maximum likelihood versions of Fourier Shell Correlation appropriate for measuring resolution of spherical or cylindrical objects. J Struct Biol 2009; 167:185-99. [PMID: 19457456 PMCID: PMC2803348 DOI: 10.1016/j.jsb.2009.04.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2008] [Revised: 03/04/2009] [Accepted: 04/28/2009] [Indexed: 11/25/2022]
Abstract
A maximum likelihood reconstruction method for an asymmetric reconstruction of the infectious P22 bacteriophage virion is described and demonstrated on a subset of the images used in [Lander, G.C., Tang, L., Casjens, S.R., Gilcrease, E.B., Prevelige, P., Poliakov, A., Potter, C.S., Carragher, B., Johnson, J.E., 2006. The structure of an infectious P22 virion shows the signal for headful DNA packaging. Science 312(5781), 1791-1795]. The method makes no assumptions at any stage regarding the structure of the phage tail or the relative rotational orientation of the phage tail and capsid but rather the structure and the rotation angle are determined as a part of the analysis. A statistical method for determining resolution consistent with maximum likelihood principles based on ideas for cylinders analogous to the ideas for spheres that are embedded in the Fourier Shell Correlation method is described and demonstrated on the P22 reconstruction. With a correlation threshold of .95, the resolution in the tail measured radially is greater than 0.0301A(-1) (33.3A) and measured axially is greater than 0.0142A(-1) (70.6A) both with probability p=0.02.
Collapse
Affiliation(s)
- Cory J. Prust
- Department of Electrical Engineering and Computer Science, Milwaukee School of Engineering, 1025 N. Broadway, Milwaukee, WI 53202-3109, USA
| | - Peter C. Doerschuk
- Department of Biomedical Engineering and School of Electrical and Computer Engineering, Cornell University, 305 Phillips Hall, Ithaca, NY 14853-5401, USA
| | - Gabriel C. Lander
- Department of Molecular Biology, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA
| | - John E. Johnson
- Department of Molecular Biology, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA
| |
Collapse
|