1
|
Roberts EJ, Chavez T, Hexemer A, Zwart PH. DLSIA: Deep Learning for Scientific Image Analysis. J Appl Crystallogr 2024; 57:392-402. [PMID: 38596727 PMCID: PMC11001410 DOI: 10.1107/s1600576724001390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 02/12/2024] [Indexed: 04/11/2024] Open
Abstract
DLSIA (Deep Learning for Scientific Image Analysis) is a Python-based machine learning library that empowers scientists and researchers across diverse scientific domains with a range of customizable convolutional neural network (CNN) architectures for a wide variety of tasks in image analysis to be used in downstream data processing. DLSIA features easy-to-use architectures, such as autoencoders, tunable U-Nets and parameter-lean mixed-scale dense networks (MSDNets). Additionally, this article introduces sparse mixed-scale networks (SMSNets), generated using random graphs, sparse connections and dilated convolutions connecting different length scales. For verification, several DLSIA-instantiated networks and training scripts are employed in multiple applications, including inpainting for X-ray scattering data using U-Nets and MSDNets, segmenting 3D fibers in X-ray tomographic reconstructions of concrete using an ensemble of SMSNets, and leveraging autoencoder latent spaces for data compression and clustering. As experimental data continue to grow in scale and complexity, DLSIA provides accessible CNN construction and abstracts CNN complexities, allowing scientists to tailor their machine learning approaches, accelerate discoveries, foster interdisciplinary collaboration and advance research in scientific image analysis.
Collapse
Affiliation(s)
- Eric J. Roberts
- Center for Advanced Mathematics for Energy Research Applications, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Tanny Chavez
- Advanced Light Source, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Alexander Hexemer
- Center for Advanced Mathematics for Energy Research Applications, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Advanced Light Source, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Petrus H. Zwart
- Center for Advanced Mathematics for Energy Research Applications, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Berkeley Synchrotron Infrared Structural Biology Program, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| |
Collapse
|
2
|
Ekeberg T. Introduction to the virtual collection of papers on Artificial neural networks: applications in X-ray photon science and crystallography. J Appl Crystallogr 2024; 57:1-2. [PMID: 38322721 PMCID: PMC10840311 DOI: 10.1107/s1600576723010476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2024] Open
Abstract
Artificial intelligence is more present than ever, both in our society in general and in science. At the center of this development has been the concept of deep learning, the use of artificial neural networks that are many layers deep and can often reproduce human-like behavior much better than other machine-learning techniques. The articles in this collection are some recent examples of its application for X-ray photon science and crystallography that have been published in Journal of Applied Crystallography.
Collapse
Affiliation(s)
- Tomas Ekeberg
- Department of Cell and Molecular Biology, Uppsala University, Husargatan 3 (Box 596), SE-751 24 Uppsala, Sweden
| |
Collapse
|
3
|
Fuller CA, Rudden LSP. Unravelling the components of diffuse scattering using deep learning. IUCRJ 2024; 11:34-44. [PMID: 37962471 PMCID: PMC10833394 DOI: 10.1107/s2052252523009521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 10/31/2023] [Indexed: 11/15/2023]
Abstract
Many technologically important material properties are underpinned by disorder and short-range structural correlations; therefore, elucidating structure-property relationships in functional materials requires understanding both the average and the local structures. The latter information is contained within diffuse scattering but is challenging to exploit, particularly in single-crystal systems. Separation of the diffuse scattering into its constituent components can greatly simplify analysis and allows for quantitative parameters describing the disorder to be extracted directly. Here, a deep-learning method, DSFU-Net, is presented based on the Pix2Pix generative adversarial network, which takes a plane of diffuse scattering as input and factorizes it into the contributions from the molecular form factor and the chemical short-range order. DSFU-Net was trained on 198 421 samples of simulated diffuse scattering data and performed extremely well on the unseen simulated validation dataset in this work. On a real experimental example, DSFU-Net successfully reproduced the two components with a quality sufficient to distinguish between similar structural models based on the form factor and to refine short-range-order parameters, achieving values comparable to other established methods. This new approach could streamline the analysis of diffuse scattering as it requires minimal prior knowledge of the system, allows access to both components in seconds and is able to compensate for small regions with missing data. DSFU-Net is freely available for use and represents a first step towards an automated workflow for the analysis of single-crystal diffuse scattering.
Collapse
|
4
|
Hao G, Roberts EJ, Chavez T, Zhao Z, Holman EA, Yanxon H, Green A, Krishnan H, Ushizima D, McReynolds D, Schwarz N, Zwart PH, Hexemer A, Parkinson DY. Deploying Machine Learning Based Segmentation for Scientific Imaging Analysis at Synchrotron Facilities. IS&T INTERNATIONAL SYMPOSIUM ON ELECTRONIC IMAGING 2023; 35:IPAS-290. [PMID: 38130938 PMCID: PMC10735246 DOI: 10.2352/ei.2023.35.9.ipas-290] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Scientific user facilities present a unique set of challenges for image processing due to the large volume of data generated from experiments and simulations. Furthermore, developing and implementing algorithms for real-time processing and analysis while correcting for any artifacts or distortions in images remains a complex task, given the computational requirements of the processing algorithms. In a collaborative effort across multiple Department of Energy national laboratories, the "MLExchange" project is focused on addressing these challenges. MLExchange is a Machine Learning framework deploying interactive web interfaces to enhance and accelerate data analysis. The platform allows users to easily upload, visualize, label, and train networks. The resulting models can be deployed on real data while both results and models could be shared with the scientists. The MLExchange web-based application for image segmentation allows for training, testing, and evaluating multiple machine learning models on hand-labeled tomography data. This environment provides users with an intuitive interface for segmenting images using a variety of machine learning algorithms and deep-learning neural networks. Additionally, these tools have the potential to overcome limitations in traditional image segmentation techniques, particularly for complex and low-contrast images.
Collapse
Affiliation(s)
- Guanhua Hao
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Eric J. Roberts
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
- Molecular Biophysics and Integrated Bioimaging (MBIB), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Tanny Chavez
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Zhuowen Zhao
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Elizabeth A. Holman
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Howard Yanxon
- Advanced Photon Source (APS), Argonne National Laboratory; Lemont, IL 60439
| | - Adam Green
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Harinarayan Krishnan
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Daniela Ushizima
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
- Computational Research Division (CRD), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Dylan McReynolds
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Nicholas Schwarz
- Advanced Photon Source (APS), Argonne National Laboratory; Lemont, IL 60439
| | - Petrus H. Zwart
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
- Molecular Biophysics and Integrated Bioimaging (MBIB), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Alexander Hexemer
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| | - Dilworth Y. Parkinson
- Advanced Light Source (ALS), Lawrence Berkeley National Laboratory; Berkeley, CA 94720
| |
Collapse
|
5
|
Zhao Z, Chavez T, Holman EA, Hao G, Green A, Krishnan H, McReynolds D, Pandolfi RJ, Roberts EJ, Zwart PH, Yanxon H, Schwarz N, Sankaranarayanan S, Kalinin SV, Mehta A, Campbell SI, Hexemer A. MLExchange: A web-based platform enabling exchangeable machine learning workflows for scientific studies. ANNUAL WORKSHOP ON EXTREME-SCALE EXPERIMENT-IN-THE-LOOP COMPUTING : XLOOP. ANNUAL WORKSHOP ON EXTREME-SCALE EXPERIMENT-IN-THE-LOOP COMPUTING 2022; 2022:10-15. [PMID: 38131031 PMCID: PMC10733127 DOI: 10.1109/xloop56614.2022.00007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Machine learning (ML) algorithms are showing a growing trend in helping the scientific communities across different disciplines and institutions to address large and diverse data problems. However, many available ML tools are programmatically demanding and computationally costly. The MLExchange project aims to build a collaborative platform equipped with enabling tools that allow scientists and facility users who do not have a profound ML background to use ML and computational resources in scientific discovery. At the high level, we are targeting a full user experience where managing and exchanging ML algorithms, workflows, and data are readily available through web applications. Since each component is an independent container, the whole platform or its individual service(s) can be easily deployed at servers of different scales, ranging from a personal device (laptop, smart phone, etc.) to high performance clusters (HPC) accessed (simultaneously) by many users. Thus, MLExchange renders flexible using scenarios-users could either access the services and resources from a remote server or run the whole platform or its individual service(s) within their local network.
Collapse
Affiliation(s)
- Zhuowen Zhao
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Tanny Chavez
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Elizabeth A. Holman
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Guanhua Hao
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Adam Green
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Harinarayan Krishnan
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Dylan McReynolds
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Ronald J. Pandolfi
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Eric J. Roberts
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- Molecular Biophysics and Integrated Bioimaging Division (MBIB), Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Petrus H. Zwart
- Center for Advanced Mathematics for Energy Research Applications (CAMERA), Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- Molecular Biophysics and Integrated Bioimaging Division (MBIB), Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| | - Howard Yanxon
- Advanced Photon Source, Argonne National Laboratory, Lemont, IL 60439
| | - Nicholas Schwarz
- Advanced Photon Source, Argonne National Laboratory, Lemont, IL 60439
| | - Subramanian Sankaranarayanan
- Center for Nanoscale Materials (CNM), Argonne National Laboratory, Lemont, IL 60439
- Department of Mechanical and Industrial Engineering, University of Illinois Chicago, Chicago, IL 60607
| | - Sergei V. Kalinin
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37830
| | - Apurva Mehta
- SLAC National Accelerator Laboratory, Menlo Park, CA 94025
| | - Stuart I. Campbell
- National Synchrotron Light Source II, Brookhaven National Laboratory, Upton, NY 11973
| | - Alexander Hexemer
- Advanced Light Source (ALS) Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| |
Collapse
|