1
|
Applications of multivariate analysis and unsupervised machine learning to ToF-SIMS images of organic, bioorganic, and biological systems. Biointerphases 2022; 17:020802. [DOI: 10.1116/6.0001590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging offers a powerful, label-free method for exploring organic, bioorganic, and biological systems. The technique is capable of very high spatial resolution, while also producing an enormous amount of information about the chemical and molecular composition of a surface. However, this information is inherently complex, making interpretation and analysis of the vast amount of data produced by a single ToF-SIMS experiment a considerable challenge. Much research over the past few decades has focused on the application and development of multivariate analysis (MVA) and machine learning (ML) techniques that find meaningful patterns and relationships in these datasets. Here, we review the unsupervised algorithms—that is, algorithms that do not require ground truth labels—that have been applied to ToF-SIMS images, as well as other algorithms and approaches that have been used in the broader family of mass spectrometry imaging (MSI) techniques. We first give a nontechnical overview of several commonly used classes of unsupervised algorithms, such as matrix factorization, clustering, and nonlinear dimensionality reduction. We then review the application of unsupervised algorithms to various organic, bioorganic, and biological systems including cells and tissues, organic films, residues and coatings, and spatially structured systems such as polymer microarrays. We then cover several novel algorithms employed for other MSI techniques that have received little attention from ToF-SIMS imaging researchers. We conclude with a brief outline of potential future directions for the application of MVA and ML algorithms to ToF-SIMS images.
Collapse
|
2
|
Taylor M, Lukowski JK, Anderton CR. Spatially Resolved Mass Spectrometry at the Single Cell: Recent Innovations in Proteomics and Metabolomics. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:872-894. [PMID: 33656885 PMCID: PMC8033567 DOI: 10.1021/jasms.0c00439] [Citation(s) in RCA: 182] [Impact Index Per Article: 45.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 01/20/2021] [Accepted: 01/25/2021] [Indexed: 05/02/2023]
Abstract
Biological systems are composed of heterogeneous populations of cells that intercommunicate to form a functional living tissue. Biological function varies greatly across populations of cells, as each single cell has a unique transcriptome, proteome, and metabolome that translates to functional differences within single species and across kingdoms. Over the past decade, substantial advancements in our ability to characterize omic profiles on a single cell level have occurred, including in multiple spectroscopic and mass spectrometry (MS)-based techniques. Of these technologies, spatially resolved mass spectrometry approaches, including mass spectrometry imaging (MSI), have shown the most progress for single cell proteomics and metabolomics. For example, reporter-based methods using heavy metal tags have allowed for targeted MS investigation of the proteome at the subcellular level, and development of technologies such as laser ablation electrospray ionization mass spectrometry (LAESI-MS) now mean that dynamic metabolomics can be performed in situ. In this Perspective, we showcase advancements in single cell spatial metabolomics and proteomics over the past decade and highlight important aspects related to high-throughput screening, data analysis, and more which are vital to the success of achieving proteomic and metabolomic profiling at the single cell scale. Finally, using this broad literature summary, we provide a perspective on how the next decade may unfold in the area of single cell MS-based proteomics and metabolomics.
Collapse
Affiliation(s)
- Michael
J. Taylor
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Jessica K. Lukowski
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Christopher R. Anderton
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| |
Collapse
|
3
|
Verbeeck N, Caprioli RM, Van de Plas R. Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry. MASS SPECTROMETRY REVIEWS 2020; 39:245-291. [PMID: 31602691 PMCID: PMC7187435 DOI: 10.1002/mas.21602] [Citation(s) in RCA: 142] [Impact Index Per Article: 28.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 08/27/2018] [Indexed: 05/20/2023]
Abstract
Imaging mass spectrometry (IMS) is a rapidly advancing molecular imaging modality that can map the spatial distribution of molecules with high chemical specificity. IMS does not require prior tagging of molecular targets and is able to measure a large number of ions concurrently in a single experiment. While this makes it particularly suited for exploratory analysis, the large amount and high-dimensional nature of data generated by IMS techniques make automated computational analysis indispensable. Research into computational methods for IMS data has touched upon different aspects, including spectral preprocessing, data formats, dimensionality reduction, spatial registration, sample classification, differential analysis between IMS experiments, and data-driven fusion methods to extract patterns corroborated by both IMS and other imaging modalities. In this work, we review unsupervised machine learning methods for exploratory analysis of IMS data, with particular focus on (a) factorization, (b) clustering, and (c) manifold learning. To provide a view across the various IMS modalities, we have attempted to include examples from a range of approaches including matrix assisted laser desorption/ionization, desorption electrospray ionization, and secondary ion mass spectrometry-based IMS. This review aims to be an entry point for both (i) analytical chemists and mass spectrometry experts who want to explore computational techniques; and (ii) computer scientists and data mining specialists who want to enter the IMS field. © 2019 The Authors. Mass Spectrometry Reviews published by Wiley Periodicals, Inc. Mass SpecRev 00:1-47, 2019.
Collapse
Affiliation(s)
- Nico Verbeeck
- Delft Center for Systems and ControlDelft University of Technology ‐ TU DelftDelftThe Netherlands
- Aspect Analytics NVGenkBelgium
- STADIUS Center for Dynamical Systems, Signal Processing, and Data Analytics, Department of Electrical Engineering (ESAT)KU LeuvenLeuvenBelgium
| | - Richard M. Caprioli
- Mass Spectrometry Research CenterVanderbilt UniversityNashvilleTN
- Department of BiochemistryVanderbilt UniversityNashvilleTN
- Department of ChemistryVanderbilt UniversityNashvilleTN
- Department of PharmacologyVanderbilt UniversityNashvilleTN
- Department of MedicineVanderbilt UniversityNashvilleTN
| | - Raf Van de Plas
- Delft Center for Systems and ControlDelft University of Technology ‐ TU DelftDelftThe Netherlands
- Mass Spectrometry Research CenterVanderbilt UniversityNashvilleTN
- Department of BiochemistryVanderbilt UniversityNashvilleTN
| |
Collapse
|
4
|
Burnett TL, Withers PJ. Completing the picture through correlative characterization. NATURE MATERIALS 2019; 18:1041-1049. [PMID: 31209389 DOI: 10.1038/s41563-019-0402-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 05/15/2019] [Indexed: 05/28/2023]
Abstract
Natural and manufactured materials rely on complex hierarchical microstructures to deliver a suite of interesting properties. To predict and tailor their performance requires a joined-up knowledge of their multiphase microstructure, interfaces, chemistry and crystallography from the nanoscale to the macroscale. This Perspective reflects on how recent developments in correlative characterization can bring together multiple image modalities and maps of the local chemistry, structure and functionality to form rich multimodal and multiscale correlated datasets. The automated collection and digitization of multidimensional data is an essential part of the picture for developing multiscale modelling and 'big data'-driven machine learning approaches. These are needed to both improve our understanding of existing materials and exploit high-throughput combinatorial synthesis, processing and testing methods to develop materials with bespoke properties.
Collapse
Affiliation(s)
- T L Burnett
- Henry Royce Institute for Advanced Materials, School of Materials, The University of Manchester, Manchester, UK
| | - P J Withers
- Henry Royce Institute for Advanced Materials, School of Materials, The University of Manchester, Manchester, UK.
| |
Collapse
|
5
|
Madiona RMT, Bamford SE, Winkler DA, Muir BW, Pigram PJ. Distinguishing Chemically Similar Polyamide Materials with ToF-SIMS Using Self-Organizing Maps and a Universal Data Matrix. Anal Chem 2018; 90:12475-12484. [DOI: 10.1021/acs.analchem.8b01951] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Robert M. T. Madiona
- Centre for Materials and Surface Science and Department of Chemistry and Physics, School of Molecular Sciences, La Trobe University, Melbourne, VIC 3086, Australia
- CSIRO Manufacturing, Clayton, VIC 3168, Australia
| | - Sarah E. Bamford
- Centre for Materials and Surface Science and Department of Chemistry and Physics, School of Molecular Sciences, La Trobe University, Melbourne, VIC 3086, Australia
| | - David A. Winkler
- La Trobe Institute for Molecular Sciences, School of Molecular Sciences, La Trobe University, Melbourne, VIC 3086, Australia
- CSIRO Manufacturing, Clayton, VIC 3168, Australia
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville 3052, Australia
- School of Pharmacy, University of Nottingham, Nottingham NG7 2RD, U.K
| | | | - Paul J. Pigram
- Centre for Materials and Surface Science and Department of Chemistry and Physics, School of Molecular Sciences, La Trobe University, Melbourne, VIC 3086, Australia
| |
Collapse
|
6
|
Trindade GF, Abel ML, Lowe C, Tshulu R, Watts JF. A Time-of-Flight Secondary Ion Mass Spectrometry/Multivariate Analysis (ToF-SIMS/MVA) Approach To Identify Phase Segregation in Blends of Incompatible but Extremely Similar Resins. Anal Chem 2018; 90:3936-3941. [PMID: 29488747 DOI: 10.1021/acs.analchem.7b04877] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
This work presents a data analysis extension to a well-established methodology for the assessment of organic coatings using imaging time-of-flight secondary ion mass spectrometry (ToF-SIMS). Such an approach produced results that can be analyzed using a multivariate analysis (MVA) procedure that performs the simultaneous processing of spatially and chemically related datasets. The coatings consist of two commercial resins that yield extremely similar spectra, and there are no peaks of sufficient intensity that are uniquely diagnostic of either material to provide an unambiguous identification of each. In order to resolve the problem, in addition to microtome-based sample preparation steps of tapers for the analysis through sample thickness, standard samples in cured and uncured conditions are introduced and measured in the same fashion as the specimens under investigation. The resulting ToF-SIMS imaging datasets have been processed using non-negative matrix factorization (NMF), which enabled identification of phase separation in the cured coatings.
Collapse
Affiliation(s)
- Gustavo F Trindade
- The Surface Analysis Laboratory, Department of Mechanical Engineering Sciences , University of Surrey , Guildford , Surrey , GU2 7XH , United Kingdom
| | - Marie-Laure Abel
- The Surface Analysis Laboratory, Department of Mechanical Engineering Sciences , University of Surrey , Guildford , Surrey , GU2 7XH , United Kingdom
| | - Chris Lowe
- Becker Industrial Coatings, Ltd. , Goodlass Road , Speke , Liverpool , L24 9HJ , United Kingdom
| | - Rene Tshulu
- The Surface Analysis Laboratory, Department of Mechanical Engineering Sciences , University of Surrey , Guildford , Surrey , GU2 7XH , United Kingdom
| | - John F Watts
- The Surface Analysis Laboratory, Department of Mechanical Engineering Sciences , University of Surrey , Guildford , Surrey , GU2 7XH , United Kingdom
| |
Collapse
|
7
|
Tuccitto N, Capizzi G, Torrisi A, Licciardello A. Unsupervised Analysis of Big ToF-SIMS Data Sets: a Statistical Pattern Recognition Approach. Anal Chem 2018; 90:2860-2866. [DOI: 10.1021/acs.analchem.7b05003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Affiliation(s)
- Nunzio Tuccitto
- Dipartimento di Scienze Chimiche and ‡Dipartimento di Ingegneria Elettrica, Elettronica
e Informatica, Università di Catania, viale A. Doria, 6 - 95125 Catania, Italy
| | - Giacomo Capizzi
- Dipartimento di Scienze Chimiche and ‡Dipartimento di Ingegneria Elettrica, Elettronica
e Informatica, Università di Catania, viale A. Doria, 6 - 95125 Catania, Italy
| | - Alberto Torrisi
- Dipartimento di Scienze Chimiche and ‡Dipartimento di Ingegneria Elettrica, Elettronica
e Informatica, Università di Catania, viale A. Doria, 6 - 95125 Catania, Italy
| | - Antonino Licciardello
- Dipartimento di Scienze Chimiche and ‡Dipartimento di Ingegneria Elettrica, Elettronica
e Informatica, Università di Catania, viale A. Doria, 6 - 95125 Catania, Italy
| |
Collapse
|
8
|
Trindade GF, Williams DF, Abel ML, Watts JF. Analysis of atmospheric plasma-treated polypropylene by large area ToF-SIMS imaging and NMF. SURF INTERFACE ANAL 2018. [DOI: 10.1002/sia.6378] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Gustavo F. Trindade
- The Surface Analysis Laboratory; Department of Mechanical Engineering Sciences, University of Surrey; Guildford Surrey GU2 7XH UK
| | - David F. Williams
- The Surface Analysis Laboratory; Department of Mechanical Engineering Sciences, University of Surrey; Guildford Surrey GU2 7XH UK
- TWI Ltd; Granta Park Great Abington; Cambridge CB21 6AL UK
| | - Marie-Laure Abel
- The Surface Analysis Laboratory; Department of Mechanical Engineering Sciences, University of Surrey; Guildford Surrey GU2 7XH UK
| | - John F. Watts
- The Surface Analysis Laboratory; Department of Mechanical Engineering Sciences, University of Surrey; Guildford Surrey GU2 7XH UK
| |
Collapse
|
9
|
Bedia C, Tauler R, Jaumot J. Analysis of multiple mass spectrometry images from different Phaseolus vulgaris samples by multivariate curve resolution. Talanta 2017; 175:557-565. [DOI: 10.1016/j.talanta.2017.07.087] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Revised: 07/26/2017] [Accepted: 07/28/2017] [Indexed: 10/19/2022]
|