Moens RAR, Migas LG, Van Ardenne JM, Skaar EP, Spraggins JM, Van de Plas R. Preserving full spectrum information in imaging mass spectrometry data reduction.
Bioinformatics 2025;
41:btaf247. [PMID:
40341809 PMCID:
PMC12119130 DOI:
10.1093/bioinformatics/btaf247]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Revised: 01/06/2025] [Accepted: 05/07/2025] [Indexed: 05/11/2025] Open
Abstract
MOTIVATION
Imaging mass spectrometry (IMS) has become an important tool for molecular characterization of biological tissue. However, IMS experiments tend to yield large datasets, routinely recording over 200 000 ion intensity values per mass spectrum and more than 100 000 pixels, i.e. spectra, per dataset. Traditionally, IMS data size challenges have been addressed by feature selection or extraction, such as by peak picking and peak integration. Selective data reduction techniques such as peak picking only retain certain parts of a mass spectrum, and often these describe only medium-to-high-abundance species. Since lower-intensity peaks and, for example, near-isobar species are sometimes missed, selective methods can potentially bias downstream analysis toward a subset of species in the data rather than considering all species measured.
RESULTS
We present an alternative to selective data reduction of IMS data that achieves similar data size reduction while better conserving the ion intensity profiles across all recorded m/z-bins, thereby preserving full spectrum information. Our method utilizes a low-rank matrix completion model combined with a randomized sparse-format-aware algorithm to approximate IMS datasets. This representation offers reduced dimensionality and a data footprint comparable to peak picking but also captures complete spectral profiles, enabling comprehensive analysis and compression. We demonstrate improved preservation of lower signal-to-noise ratio signals and near-isobars, mitigation of selection bias, and reduced information loss compared to current state-of-the-art data reduction methods in IMS.
AVAILABILITY AND IMPLEMENTATION
The source code is available at https://github.com/vandeplaslab/full_profile and data are available at https://doi.org/10.4121/a6efd47a-b4ec-493e-a742-70e8a369f788.
Collapse