1
|
Rahmani V, Nawaz S, Pennicard D, Graafsma H. Robust image descriptor for machine learning based data reduction in serial crystallography. J Appl Crystallogr 2024; 57:413-430. [PMID: 38596725 PMCID: PMC11001400 DOI: 10.1107/s160057672400147x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 02/13/2024] [Indexed: 04/11/2024] Open
Abstract
Serial crystallography experiments at synchrotron and X-ray free-electron laser (XFEL) sources are producing crystallographic data sets of ever-increasing volume. While these experiments have large data sets and high-frame-rate detectors (around 3520 frames per second), only a small percentage of the data are useful for downstream analysis. Thus, an efficient and real-time data classification pipeline is essential to differentiate reliably between useful and non-useful images, typically known as 'hit' and 'miss', respectively, and keep only hit images on disk for further analysis such as peak finding and indexing. While feature-point extraction is a key component of modern approaches to image classification, existing approaches require computationally expensive patch preprocessing to handle perspective distortion. This paper proposes a pipeline to categorize the data, consisting of a real-time feature extraction algorithm called modified and parallelized FAST (MP-FAST), an image descriptor and a machine learning classifier. For parallelizing the primary operations of the proposed pipeline, central processing units, graphics processing units and field-programmable gate arrays are implemented and their performances compared. Finally, MP-FAST-based image classification is evaluated using a multi-layer perceptron on various data sets, including both synthetic and experimental data. This approach demonstrates superior performance compared with other feature extractors and classifiers.
Collapse
Affiliation(s)
- Vahid Rahmani
- Deutsches Elektronen-Synchrotron (DESY), Notkestraße 85, Hamburg, 22607, Germany
| | - Shah Nawaz
- Deutsches Elektronen-Synchrotron (DESY), Notkestraße 85, Hamburg, 22607, Germany
| | - David Pennicard
- Deutsches Elektronen-Synchrotron (DESY), Notkestraße 85, Hamburg, 22607, Germany
| | - Heinz Graafsma
- Deutsches Elektronen-Synchrotron (DESY), Notkestraße 85, Hamburg, 22607, Germany
- Mid-Sweden University, Sundsvall, Sweden
| |
Collapse
|
2
|
Xie J, Liu J, Zhang C, Chen X, Huai P, Zheng J, Zhang X. Weakly supervised learning for pattern classification in serial femtosecond crystallography. OPTICS EXPRESS 2023; 31:32909-32924. [PMID: 37859083 DOI: 10.1364/oe.492311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 07/30/2023] [Indexed: 10/21/2023]
Abstract
Serial femtosecond crystallography at X-ray free electron laser facilities opens a new era for the determination of crystal structure. However, the data processing of those experiments is facing unprecedented challenge, because the total number of diffraction patterns needed to determinate a high-resolution structure is huge. Machine learning methods are very likely to play important roles in dealing with such a large volume of data. Convolutional neural networks have made a great success in the field of pattern classification, however, training of the networks need very large datasets with labels. This heavy dependence on labeled datasets will seriously restrict the application of networks, because it is very costly to annotate a large number of diffraction patterns. In this article we present our job on the classification of diffraction pattern by weakly supervised algorithms, with the aim of reducing as much as possible the size of the labeled dataset required for training. Our result shows that weakly supervised methods can significantly reduce the need for the number of labeled patterns while achieving comparable accuracy to fully supervised methods.
Collapse
|
3
|
Sun Y, Brockhauser S, Hegedűs P, Plückthun C, Gelisio L, Ferreira de Lima DE. Application of self-supervised approaches to the classification of X-ray diffraction spectra during phase transitions. Sci Rep 2023; 13:9370. [PMID: 37296300 PMCID: PMC10256752 DOI: 10.1038/s41598-023-36456-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 06/04/2023] [Indexed: 06/12/2023] Open
Abstract
Spectroscopy and X-ray diffraction techniques encode ample information on investigated samples. The ability of rapidly and accurately extracting these enhances the means to steer the experiment, as well as the understanding of the underlying processes governing the experiment. It improves the efficiency of the experiment, and maximizes the scientific outcome. To address this, we introduce and validate three frameworks based on self-supervised learning which are capable of classifying 1D spectral curves using data transformations preserving the scientific content and only a small amount of data labeled by domain experts. In particular, in this work we focus on the identification of phase transitions in samples investigated by x-ray powder diffraction. We demonstrate that the three frameworks, based either on relational reasoning, contrastive learning, or a combination of the two, are capable of accurately identifying phase transitions. Furthermore, we discuss in detail the selection of data augmentation techniques, crucial to ensure that scientifically meaningful information is retained.
Collapse
Affiliation(s)
- Yue Sun
- Software Engineering Department, Institute of Informatics, University of Szeged, Dugonics tér 13, Szeged, 6720, Hungary.
- European XFEL GmbH, Holzkoppel 4, 22869, Schenefeld, Germany.
| | - Sandor Brockhauser
- Software Engineering Department, Institute of Informatics, University of Szeged, Dugonics tér 13, Szeged, 6720, Hungary
- Center for Materials Science Data, Humboldt-Universität zu Berlin, Zum Großen Windkanal 2, 12489, Berlin, Germany
| | - Péter Hegedűs
- Software Engineering Department, Institute of Informatics, University of Szeged, Dugonics tér 13, Szeged, 6720, Hungary.
| | - Christian Plückthun
- European XFEL GmbH, Holzkoppel 4, 22869, Schenefeld, Germany
- Deutsches Elektronen-Synchrotron (DESY), 22607, Hamburg, Germany
| | - Luca Gelisio
- European XFEL GmbH, Holzkoppel 4, 22869, Schenefeld, Germany
| | | |
Collapse
|
4
|
Zimmermann J, Beguet F, Guthruf D, Langbehn B, Rupp D. Finding the semantic similarity in single-particle diffraction images using self-supervised contrastive projection learning. NPJ COMPUTATIONAL MATERIALS 2023; 9:24. [PMID: 38666059 PMCID: PMC11041688 DOI: 10.1038/s41524-023-00966-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 01/10/2023] [Indexed: 04/28/2024]
Abstract
Single-shot coherent diffraction imaging of isolated nanosized particles has seen remarkable success in recent years, yielding in-situ measurements with ultra-high spatial and temporal resolution. The progress of high-repetition-rate sources for intense X-ray pulses has further enabled recording datasets containing millions of diffraction images, which are needed for the structure determination of specimens with greater structural variety and dynamic experiments. The size of the datasets, however, represents a monumental problem for their analysis. Here, we present an automatized approach for finding semantic similarities in coherent diffraction images without relying on human expert labeling. By introducing the concept of projection learning, we extend self-supervised contrastive learning to the context of coherent diffraction imaging and achieve a dimensionality reduction producing semantically meaningful embeddings that align with physical intuition. The method yields substantial improvements compared to previous approaches, paving the way toward real-time and large-scale analysis of coherent diffraction experiments at X-ray free-electron lasers.
Collapse
Affiliation(s)
| | | | | | | | - Daniela Rupp
- ETH Zürich, Zürich, Switzerland
- Max-Born-Institut, Berlin, Germany
| |
Collapse
|
5
|
Coherent correlation imaging for resolving fluctuating states of matter. Nature 2023; 614:256-261. [PMID: 36653456 PMCID: PMC9908557 DOI: 10.1038/s41586-022-05537-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 11/08/2022] [Indexed: 01/19/2023]
Abstract
Fluctuations and stochastic transitions are ubiquitous in nanometre-scale systems, especially in the presence of disorder. However, their direct observation has so far been impeded by a seemingly fundamental, signal-limited compromise between spatial and temporal resolution. Here we develop coherent correlation imaging (CCI) to overcome this dilemma. Our method begins by classifying recorded camera frames in Fourier space. Contrast and spatial resolution emerge by averaging selectively over same-state frames. Temporal resolution down to the acquisition time of a single frame arises independently from an exceptionally low misclassification rate, which we achieve by combining a correlation-based similarity metric1,2 with a modified, iterative hierarchical clustering algorithm3,4. We apply CCI to study previously inaccessible magnetic fluctuations in a highly degenerate magnetic stripe domain state with nanometre-scale resolution. We uncover an intricate network of transitions between more than 30 discrete states. Our spatiotemporal data enable us to reconstruct the pinning energy landscape and to thereby explain the dynamics observed on a microscopic level. CCI massively expands the potential of emerging high-coherence X-ray sources and paves the way for addressing large fundamental questions such as the contribution of pinning5-8 and topology9-12 in phase transitions and the role of spin and charge order fluctuations in high-temperature superconductivity13,14.
Collapse
|
6
|
Colombo A, Zimmermann J, Langbehn B, Möller T, Peltz C, Sander K, Kruse B, Tümmler P, Barke I, Rupp D, Fennel T. The Scatman: an approximate method for fast wide-angle scattering simulations. J Appl Crystallogr 2022; 55:1232-1246. [PMID: 36249495 PMCID: PMC9533759 DOI: 10.1107/s1600576722008068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 08/11/2022] [Indexed: 11/17/2022] Open
Abstract
Single-shot coherent diffraction imaging (CDI) is a powerful approach to characterize the structure and dynamics of isolated nanoscale objects such as single viruses, aerosols, nanocrystals and droplets. Using X-ray wavelengths, the diffraction images in CDI experiments usually cover only small scattering angles of a few degrees. These small-angle patterns represent the magnitude of the Fourier transform of the 2D projection of the sample's electron density, which can be reconstructed efficiently but lacks any depth information. In cases where the diffracted signal can be measured up to scattering angles exceeding ∼10°, i.e. in the wide-angle regime, some 3D morphological information of the target is contained in a single-shot diffraction pattern. However, the extraction of the 3D structural information is no longer straightforward and defines the key challenge in wide-angle CDI. So far, the most convenient approach relies on iterative forward fitting of the scattering pattern using scattering simulations. Here the Scatman is presented, an approximate and fast numerical tool for the simulation and iterative fitting of wide-angle scattering images of isolated samples. Furthermore, the open-source software implementation of the Scatman algorithm, PyScatman, is published and described in detail. The Scatman approach, which has already been applied in previous work for forward-fitting-based shape retrieval, adopts the multi-slice Fourier transform method. The effects of optical properties are partially included, yielding quantitative results for small, isolated and weakly interacting samples. PyScatman is capable of computing wide-angle scattering patterns in a few milliseconds even on consumer-level computing hardware, potentially enabling new data analysis schemes for wide-angle coherent diffraction experiments.
Collapse
Affiliation(s)
- Alessandro Colombo
- Laboratory for Solid State Physics, ETH Zürich, 8093 Zürich, Switzerland
| | - Julian Zimmermann
- Laboratory for Solid State Physics, ETH Zürich, 8093 Zürich, Switzerland
| | - Bruno Langbehn
- Institute for Optics and Atomic Physics, Technical University Berlin, 10623 Berlin, Germany
| | - Thomas Möller
- Institute for Optics and Atomic Physics, Technical University Berlin, 10623 Berlin, Germany
| | - Christian Peltz
- Institute for Physics, University of Rostock, 18059 Rostock, Germany
| | - Katharina Sander
- Institute for Physics, University of Rostock, 18059 Rostock, Germany
| | - Björn Kruse
- Institute for Physics, University of Rostock, 18059 Rostock, Germany
| | - Paul Tümmler
- Institute for Physics, University of Rostock, 18059 Rostock, Germany
| | - Ingo Barke
- Institute for Physics, University of Rostock, 18059 Rostock, Germany
- Department of Life, Light and Matter, University of Rostock, 18059 Rostock, Germany
| | - Daniela Rupp
- Laboratory for Solid State Physics, ETH Zürich, 8093 Zürich, Switzerland
| | - Thomas Fennel
- Institute for Physics, University of Rostock, 18059 Rostock, Germany
- Department of Life, Light and Matter, University of Rostock, 18059 Rostock, Germany
| |
Collapse
|
7
|
Assalauova D, Ignatenko A, Isensee F, Trofimova D, Vartanyants IA. Classification of diffraction patterns using a convolutional neural network in single-particle-imaging experiments performed at X-ray free-electron lasers. J Appl Crystallogr 2022; 55:444-454. [PMID: 35719305 PMCID: PMC9172041 DOI: 10.1107/s1600576722002667] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 03/08/2022] [Indexed: 11/10/2022] Open
Abstract
Single particle imaging (SPI) at X-ray free-electron lasers is particularly well suited to determining the 3D structure of particles at room temperature. For a successful reconstruction, diffraction patterns originating from a single hit must be isolated from a large number of acquired patterns. It is proposed that this task could be formulated as an image-classification problem and solved using convolutional neural network (CNN) architectures. Two CNN configurations are developed: one that maximizes the F1 score and one that emphasizes high recall. The CNNs are also combined with expectation-maximization (EM) selection as well as size filtering. It is observed that the CNN selections have lower contrast in power spectral density functions relative to the EM selection used in previous work. However, the reconstruction of the CNN-based selections gives similar results. Introducing CNNs into SPI experiments allows the reconstruction pipeline to be streamlined, enables researchers to classify patterns on the fly, and, as a consequence, enables them to tightly control the duration of their experiments. Incorporating non-standard artificial-intelligence-based solutions into an existing SPI analysis workflow may be beneficial for the future development of SPI experiments.
Collapse
Affiliation(s)
- Dameli Assalauova
- Deutsches Elektronen-Synchrotron DESY, Notkestraße 85, 22607 Hamburg, Germany
| | - Alexandr Ignatenko
- Deutsches Elektronen-Synchrotron DESY, Notkestraße 85, 22607 Hamburg, Germany
| | - Fabian Isensee
- Applied Computer Vision Lab, Helmholtz Imaging, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
- Division of Medical Image Computing, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| | - Darya Trofimova
- Applied Computer Vision Lab, Helmholtz Imaging, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
- Division of Medical Image Computing, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| | - Ivan A. Vartanyants
- Deutsches Elektronen-Synchrotron DESY, Notkestraße 85, 22607 Hamburg, Germany
| |
Collapse
|
8
|
Zhuang Y, Awel S, Barty A, Bean R, Bielecki J, Bergemann M, Daurer BJ, Ekeberg T, Estillore AD, Fangohr H, Giewekemeyer K, Hunter MS, Karnevskiy M, Kirian RA, Kirkwood H, Kim Y, Koliyadu J, Lange H, Letrun R, Lübke J, Mall A, Michelat T, Morgan AJ, Roth N, Samanta AK, Sato T, Shen Z, Sikorski M, Schulz F, Spence JCH, Vagovic P, Wollweber T, Worbs L, Xavier PL, Yefanov O, Maia FRNC, Horke DA, Küpper J, Loh ND, Mancuso AP, Chapman HN, Ayyer K. Unsupervised learning approaches to characterizing heterogeneous samples using X-ray single-particle imaging. IUCRJ 2022; 9:204-214. [PMID: 35371510 PMCID: PMC8895023 DOI: 10.1107/s2052252521012707] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 11/30/2021] [Indexed: 06/12/2023]
Abstract
One of the outstanding analytical problems in X-ray single-particle imaging (SPI) is the classification of structural heterogeneity, which is especially difficult given the low signal-to-noise ratios of individual patterns and the fact that even identical objects can yield patterns that vary greatly when orientation is taken into consideration. Proposed here are two methods which explicitly account for this orientation-induced variation and can robustly determine the structural landscape of a sample ensemble. The first, termed common-line principal component analysis (PCA), provides a rough classification which is essentially parameter free and can be run automatically on any SPI dataset. The second method, utilizing variation auto-encoders (VAEs), can generate 3D structures of the objects at any point in the structural landscape. Both these methods are implemented in combination with the noise-tolerant expand-maximize-compress (EMC) algorithm and its utility is demonstrated by applying it to an experimental dataset from gold nanoparticles with only a few thousand photons per pattern. Both discrete structural classes and continuous deformations are recovered. These developments diverge from previous approaches of extracting reproducible subsets of patterns from a dataset and open up the possibility of moving beyond the study of homogeneous sample sets to addressing open questions on topics such as nanocrystal growth and dynamics, as well as phase transitions which have not been externally triggered.
Collapse
Affiliation(s)
- Yulong Zhuang
- Max Planck Institute for the Structure and Dynamics of Matter, 22761 Hamburg, Germany
- Center for Free-Electron Laser Science, 22761 Hamburg, Germany
| | - Salah Awel
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
| | - Anton Barty
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
| | | | | | | | - Benedikt J. Daurer
- Center for Bio-Imaging Sciences, National University of Singapore, 117557, Singapore
| | - Tomas Ekeberg
- Department of Cell and Molecular Biology, Uppsala University, 75124 Uppsala, Sweden
| | - Armando D. Estillore
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
| | - Hans Fangohr
- Max Planck Institute for the Structure and Dynamics of Matter, 22761 Hamburg, Germany
- Center for Free-Electron Laser Science, 22761 Hamburg, Germany
- European XFEL, 22869 Schenefeld, Germany
- University of Southampton, Southampton SO17 1BJ, United Kingdom
| | | | - Mark S. Hunter
- Linac Coherent Light Source, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | | | - Richard A. Kirian
- Department of Physics, Arizona State University, Tempe, AZ 85287, USA
| | | | | | | | - Holger Lange
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
- Institute of Physical Chemistry, Universität Hamburg, 20146 Hamburg, Germany
| | | | - Jannik Lübke
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
- Department of Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - Abhishek Mall
- Max Planck Institute for the Structure and Dynamics of Matter, 22761 Hamburg, Germany
- Center for Free-Electron Laser Science, 22761 Hamburg, Germany
| | | | - Andrew J. Morgan
- Department of Physics, University of Melbourne, Victoria 3010, Australia
| | - Nils Roth
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Department of Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - Amit K. Samanta
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
| | | | - Zhou Shen
- Center for Bio-Imaging Sciences, National University of Singapore, 117557, Singapore
| | - Marcin Sikorski
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
| | - Florian Schulz
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
- Institute of Nanostructure and Solid State Physics, University of Hamburg, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - John C. H. Spence
- Department of Physics, Arizona State University, Tempe, AZ 85287, USA
| | - Patrik Vagovic
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- European XFEL, 22869 Schenefeld, Germany
| | - Tamme Wollweber
- Max Planck Institute for the Structure and Dynamics of Matter, 22761 Hamburg, Germany
- Center for Free-Electron Laser Science, 22761 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
| | - Lena Worbs
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Department of Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - P. Lourdu Xavier
- Max Planck Institute for the Structure and Dynamics of Matter, 22761 Hamburg, Germany
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
| | - Oleksandr Yefanov
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
| | - Filipe R. N. C. Maia
- Department of Cell and Molecular Biology, Uppsala University, 75124 Uppsala, Sweden
- NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Daniel A. Horke
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
- Institute for Molecules and Materials, Radboud University, 6525 AJ Nijmegen, Netherlands
| | - Jochen Küpper
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
- Department of Physics, Universität Hamburg, 22761 Hamburg, Germany
- Department of Chemistry, Universität Hamburg, 20146 Hamburg, Germany
| | - N. Duane Loh
- Center for Bio-Imaging Sciences, National University of Singapore, 117557, Singapore
- Department of Physics, National University of Singapore, 117551, Singapore
| | - Adrian P. Mancuso
- European XFEL, 22869 Schenefeld, Germany
- Department of Chemistry and Physics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, Victoria 3086, Australia
| | - Henry N. Chapman
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, 22607 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
- Department of Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - Kartik Ayyer
- Max Planck Institute for the Structure and Dynamics of Matter, 22761 Hamburg, Germany
- Center for Free-Electron Laser Science, 22761 Hamburg, Germany
- Hamburg Center for Ultrafast Imaging, Universität Hamburg, 22761 Hamburg, Germany
| |
Collapse
|
9
|
Bellisario A, Maia FRNC, Ekeberg T. Noise reduction and mask removal neural network for X-ray single-particle imaging. J Appl Crystallogr 2022; 55:122-132. [PMID: 35145358 PMCID: PMC8805166 DOI: 10.1107/s1600576721012371] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 11/22/2021] [Indexed: 12/03/2022] Open
Abstract
Free-electron lasers could enable X-ray imaging of single biological macromolecules and the study of protein dynamics, paving the way for a powerful new imaging tool in structural biology, but a low signal-to-noise ratio and missing regions in the detectors, colloquially termed 'masks', affect data collection and hamper real-time evaluation of experimental data. In this article, the challenges posed by noise and masks are tackled by introducing a neural network pipeline that aims to restore diffraction intensities. For training and testing of the model, a data set of diffraction patterns was simulated from 10 900 different proteins with molecular weights within the range of 10-100 kDa and collected at a photon energy of 8 keV. The method is compared with a simple low-pass filtering algorithm based on autocorrelation constraints. The results show an improvement in the mean-squared error of roughly two orders of magnitude in the presence of masks compared with the noisy data. The algorithm was also tested at increasing mask width, leading to the conclusion that demasking can achieve good results when the mask is smaller than half of the central speckle of the pattern. The results highlight the competitiveness of this model for data processing and the feasibility of restoring diffraction intensities from unknown structures in real time using deep learning methods. Finally, an example is shown of this preprocessing making orientation recovery more reliable, especially for data sets containing very few patterns, using the expansion-maximization-compression algorithm.
Collapse
Affiliation(s)
- Alfredo Bellisario
- Laboratory of Molecular Biophysics, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3 (Box 596), SE-751 24 Uppsala, Sweden
| | - Filipe R. N. C. Maia
- Laboratory of Molecular Biophysics, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3 (Box 596), SE-751 24 Uppsala, Sweden
| | - Tomas Ekeberg
- Laboratory of Molecular Biophysics, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3 (Box 596), SE-751 24 Uppsala, Sweden
| |
Collapse
|
10
|
Stielow T, Scheel S. Reconstruction of nanoscale particles from single-shot wide-angle free-electron-laser diffraction patterns with physics-informed neural networks. Phys Rev E 2021; 103:053312. [PMID: 34134223 DOI: 10.1103/physreve.103.053312] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 05/11/2021] [Indexed: 11/07/2022]
Abstract
Single-shot wide-angle diffraction imaging is a widely used method to investigate the structure of noncrystallizing objects such as nanoclusters, large proteins, or even viruses. Its main advantage is that information about the three-dimensional structure of the object is already contained in a single image. This makes it useful for the reconstruction of fragile and nonreproducible particles without the need for tomographic measurements. However, currently there is no efficient numerical inversion algorithm available that is capable of determining the object's structure in real time. Neural networks, on the other hand, excel in image processing tasks suited for such purpose. Here we show how a physics-informed deep neural network can be used to reconstruct complete three-dimensional object models of uniform, convex particles on a voxel grid from single two-dimensional wide-angle scattering patterns. We demonstrate its universal reconstruction capabilities for silver nanoclusters, where the network uncovers novel geometric structures that reproduce the experimental scattering data with very high precision.
Collapse
Affiliation(s)
- Thomas Stielow
- Institut für Physik, Universität Rostock, D-18059 Rostock, Germany
| | - Stefan Scheel
- Institut für Physik, Universität Rostock, D-18059 Rostock, Germany
| |
Collapse
|
11
|
Ignatenko A, Assalauova D, Bobkov SA, Gelisio L, Teslyuk AB, Ilyin VA, Vartanyants IA. Classification of diffraction patterns in single particle imaging experiments performed at x-ray free-electron lasers using a convolutional neural network. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abd916] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Single particle imaging (SPI) is a promising method of native structure determination, which has undergone fast progress with the development of x-ray free-electron lasers. Large amounts of data are collected during SPI experiments, driving the need for automated data analysis. The necessary data analysis pipeline has a number of steps including binary object classification (single versus non-single hits). Classification and object detection are areas where deep neural networks currently outperform other approaches. In this work, we use the fast object detector networks YOLOv2 and YOLOv3. By exploiting transfer learning, a moderate amount of data is sufficient to train the neural network. We demonstrate here that a convolutional neural network can be successfully used to classify data from SPI experiments. We compare the results of classification for the two different networks, with different depth and architecture, by applying them to the same SPI data with different data representation. The best results are obtained for diffracted intensity represented by color images on a linear scale using YOLOv2 for classification. It shows an accuracy of about 95% with precision and recall of about 50% and 60%, respectively, in comparison to manual data classification.
Collapse
|
12
|
Cruz-Chú ER, Hosseinizadeh A, Mashayekhi G, Fung R, Ourmazd A, Schwander P. Selecting XFEL single-particle snapshots by geometric machine learning. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2021; 8:014701. [PMID: 33644252 PMCID: PMC7902084 DOI: 10.1063/4.0000060] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 01/21/2021] [Indexed: 05/05/2023]
Abstract
A promising new route for structural biology is single-particle imaging with an X-ray Free-Electron Laser (XFEL). This method has the advantage that the samples do not require crystallization and can be examined at room temperature. However, high-resolution structures can only be obtained from a sufficiently large number of diffraction patterns of individual molecules, so-called single particles. Here, we present a method that allows for efficient identification of single particles in very large XFEL datasets, operates at low signal levels, and is tolerant to background. This method uses supervised Geometric Machine Learning (GML) to extract low-dimensional feature vectors from a training dataset, fuse test datasets into the feature space of training datasets, and separate the data into binary distributions of "single particles" and "non-single particles." As a proof of principle, we tested simulated and experimental datasets of the Coliphage PR772 virus. We created a training dataset and classified three types of test datasets: First, a noise-free simulated test dataset, which gave near perfect separation. Second, simulated test datasets that were modified to reflect different levels of photon counts and background noise. These modified datasets were used to quantify the predictive limits of our approach. Third, an experimental dataset collected at the Stanford Linear Accelerator Center. The single-particle identification for this experimental dataset was compared with previously published results and it was found that GML covers a wide photon-count range, outperforming other single-particle identification methods. Moreover, a major advantage of GML is its ability to retrieve single particles in the presence of structural variability.
Collapse
Affiliation(s)
- Eduardo R. Cruz-Chú
- Department of Physics, University of Wisconsin-Milwaukee, 3135 N. Maryland Ave, Milwaukee, Wisconsin 53211, USA
| | - Ahmad Hosseinizadeh
- Department of Physics, University of Wisconsin-Milwaukee, 3135 N. Maryland Ave, Milwaukee, Wisconsin 53211, USA
| | - Ghoncheh Mashayekhi
- Department of Physics, University of Wisconsin-Milwaukee, 3135 N. Maryland Ave, Milwaukee, Wisconsin 53211, USA
| | - Russell Fung
- Department of Physics, University of Wisconsin-Milwaukee, 3135 N. Maryland Ave, Milwaukee, Wisconsin 53211, USA
| | - Abbas Ourmazd
- Department of Physics, University of Wisconsin-Milwaukee, 3135 N. Maryland Ave, Milwaukee, Wisconsin 53211, USA
| | - Peter Schwander
- Department of Physics, University of Wisconsin-Milwaukee, 3135 N. Maryland Ave, Milwaukee, Wisconsin 53211, USA
| |
Collapse
|
13
|
Stielow T, Schmidt R, Peltz C, Fennel T, Scheel S. Fast reconstruction of single-shot wide-angle diffraction images through deep learning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/abb213] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
Single-shot x-ray imaging of short-lived nanostructures such as clusters and nanoparticles near a phase transition or non-crystalizing objects such as large proteins and viruses is currently the most elegant method for characterizing their structure. Using hard x-ray radiation provides scattering images that encode two-dimensional projections, which can be combined to identify the full three-dimensional object structure from multiple identical samples. Wide-angle scattering using XUV or soft x-rays, despite yielding lower resolution, provides three-dimensional structural information in a single shot and has opened routes towards the characterization of non-reproducible objects in the gas phase. The retrieval of the structural information contained in wide-angle scattering images is highly non-trivial, and currently no efficient rigorous algorithm is known. Here we show that deep learning networks, trained with simulated scattering data, allow for fast and accurate reconstruction of shape and orientation of nanoparticles from experimental images. The gain in speed compared to conventional retrieval techniques opens the route for automated structure reconstruction algorithms capable of real-time discrimination and pre-identification of nanostructures in scattering experiments with high repetition rate—thus representing the enabling technology for fast femtosecond nanocrystallography.
Collapse
|
14
|
Chen H, He X, Teng Q, Sheriff RE, Feng J, Xiong S. Super-resolution of real-world rock microcomputed tomography images using cycle-consistent generative adversarial networks. Phys Rev E 2020; 101:023305. [PMID: 32168576 DOI: 10.1103/physreve.101.023305] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 01/23/2020] [Indexed: 11/07/2022]
Abstract
Digital rock imaging plays an important role in studying the microstructure and macroscopic properties of rocks, where microcomputed tomography (MCT) is widely used. Due to the inherent limitations of MCT, a balance should be made between the field of view (FOV) and resolution of rock MCT images-a large FOV at low resolution (LR) or a small FOV at high resolution (HR). However, large FOV and HR are both expected for reliable analysis results in practice. Super-resolution (SR) is an effective solution to break through the mutual restriction between the FOV and resolution of rock MCT images, for it can reconstruct an HR image from a LR observation. Most of the existing SR methods cannot produce satisfactory HR results on real-world rock MCT images. One of the main reasons for this is that paired images are usually needed to learn the relationship between LR and HR rock images. However, it is challenging to collect such a dataset in a real scenario. Meanwhile, the simulated datasets may be unable to accurately reflect the model in actual applications. To address these problems, we propose a cycle-consistent generative adversarial network (CycleGAN)-based SR approach for real-world rock MCT images, namely, SRCycleGAN. In the off-line training phase, a set of unpaired rock MCT images is used to train the proposed SRCycleGAN, which can model the mapping between rock MCT images at different resolutions. In the on-line testing phase, the resolution of the LR input is enhanced via the learned mapping by SRCycleGAN. Experimental results show that the proposed SRCycleGAN can greatly improve the quality of simulated and real-world rock MCT images. The HR images reconstructed by SRCycleGAN show good agreement with the targets in terms of both the visual quality and the statistical parameters, including the porosity, the local porosity distribution, the two-point correlation function, the lineal-path function, the two-point cluster function, the chord-length distribution function, and the pore size distribution. Large FOV and HR rock MCT images can be obtained with the help of SRCycleGAN. Hence, this work makes it possible to generate HR rock MCT images that exceed the limitations of imaging systems on FOV and resolution.
Collapse
Affiliation(s)
- Honggang Chen
- College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
| | - Xiaohai He
- College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China.,Key Laboratory of Wireless Power Transmission of Ministry of Education, Chengdu 610065, China
| | - Qizhi Teng
- College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
| | - Raymond E Sheriff
- School of Engineering, University of Bolton, Bolton BL35AB, United Kingdom
| | - Junxi Feng
- College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
| | - Shuhua Xiong
- College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
| |
Collapse
|