1
|
Fuentes AM, Milligan K, Wiebe M, Narayan A, Lum JJ, Brolo AG, Andrews JL, Jirasek A. Stratification of tumour cell radiation response and metabolic signatures visualization with Raman spectroscopy and explainable convolutional neural network. Analyst 2024; 149:1645-1657. [PMID: 38312026 DOI: 10.1039/d3an01797d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
Reprogramming of cellular metabolism is a driving factor of tumour progression and radiation therapy resistance. Identifying biochemical signatures associated with tumour radioresistance may assist with the development of targeted treatment strategies to improve clinical outcomes. Raman spectroscopy (RS) can monitor post-irradiation biomolecular changes and signatures of radiation response in tumour cells in a label-free manner. Convolutional Neural Networks (CNN) perform feature extraction directly from data in an end-to-end learning manner, with high classification performance. Furthermore, recently developed CNN explainability techniques help visualize the critical discriminative features captured by the model. In this work, a CNN is developed to characterize tumour response to radiotherapy based on its degree of radioresistance. The model was trained to classify Raman spectra of three human tumour cell lines as radiosensitive (LNCaP) or radioresistant (MCF7, H460) over a range of treatment doses and data collection time points. Additionally, a method based on Gradient-Weighted Class Activation Mapping (Grad-CAM) was used to determine response-specific salient Raman peaks influencing the CNN predictions. The CNN effectively classified the cell spectra, with accuracy, sensitivity, specificity, and F1 score exceeding 99.8%. Grad-CAM heatmaps of H460 and MCF7 cell spectra (radioresistant) exhibited high contributions from Raman bands tentatively assigned to glycogen, amino acids, and nucleic acids. Conversely, heatmaps of LNCaP cells (radiosensitive) revealed activations at lipid and phospholipid bands. Finally, Grad-CAM variable importance scores were derived for glycogen, asparagine, and phosphatidylcholine, and we show that their trends over cell line, dose, and acquisition time agreed with previously established models. Thus, the CNN can accurately detect biomolecular differences in the Raman spectra of tumour cells of varying radiosensitivity without requiring manual feature extraction. Finally, Grad-CAM may help identify metabolic signatures associated with the observed categories, offering the potential for automated clinical tumour radiation response characterization.
Collapse
Affiliation(s)
- Alejandra M Fuentes
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada.
| | - Kirsty Milligan
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada.
| | - Mitchell Wiebe
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada.
| | - Apurva Narayan
- Department of Computer Science, Western University, London, Canada
- Department of Computer Science, The University of British Columbia Okanagan Campus, Kelowna, Canada
| | - Julian J Lum
- Department of Biochemistry and Microbiology, The University of Victoria, Victoria, Canada
- Trev and Joyce Deeley Research Centre, BC Cancer, Victoria, Canada
| | - Alexandre G Brolo
- Department of Chemistry, The University of Victoria, Victoria, Canada
| | - Jeffrey L Andrews
- Department of Statistics, The University of British Columbia Okanagan Campus, Kelowna, Canada
| | - Andrew Jirasek
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada.
| |
Collapse
|
2
|
Milligan K, Scarrott K, Andrews JL, Brolo AG, Lum JJ, Jirasek A. Reconstruction of Raman Spectra of Biochemical Mixtures Using Group and Basis Restricted Non-Negative Matrix Factorization. APPLIED SPECTROSCOPY 2023:37028231169971. [PMID: 37097829 DOI: 10.1177/00037028231169971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Raman spectroscopy is a useful tool for obtaining biochemical information from biological samples. However, interpretation of Raman spectroscopy data in order to draw meaningful conclusions related to the biochemical make up of cells and tissues is often difficult and could be misleading if care is not taken in the deconstruction of the spectral data. Our group has previously demonstrated the implementation of a group- and basis-restricted non-negative matrix factorization (GBR-NMF) framework as an alternative to more widely used dimensionality reduction techniques such as principal component analysis (PCA) for the deconstruction of Raman spectroscopy data as related to radiation response monitoring in both cellular and tissue data. While this method provides better biological interpretability of the Raman spectroscopy data, there are some important factors which must be considered in order to provide the most robust GBR-NMF model. We here evaluate and compare the accuracy of a GBR-NMF model in the reconstruction of three mixture solutions of known concentrations. The factors assessed include the effect of solid versus solutions bases spectra, the number of unconstrained components used in the model, the tolerance of different signal to noise thresholds, and how different groups of biochemicals compare to each other. The robustness of the model was assessed by how well the relative concentration of each individual biochemical in the solution mixture is reflected in the GBR-NMF scores obtained. We also evaluated how well the model can reconstruct original data, both with and without the inclusion of an unconstrained component. Overall, we found that solid bases spectra were generally comparable to solution bases spectra in the GBR-NMF model for all groups of biochemicals. The model was found to be relatively tolerant of high levels of noise in the mixture solutions using solid bases spectra. Additionally, the inclusion of an unconstrained component did not have a significant effect on the deconstruction, on the condition that all biochemicals in the mixture were included as bases chemicals in the model. We also report that some groups of biochemicals achieve a more accurate deconstruction using GBR-NMF than others, likely due to similarity in the individual bases spectra.
Collapse
Affiliation(s)
- Kirsty Milligan
- Department of Physics, The University of British Columbia-Okanagan, Kelowna, BC, Canada
| | - Kendra Scarrott
- Southern Medical Program, Faculty of Medicine, The University of British Columbia-Okanagan, Kelowna, BC, Canada
| | - Jeffrey L Andrews
- Department of Statistics, The University of British Columbia-Okanagan, Kelowna, BC, Canada
| | - Alexandre G Brolo
- Department of Chemistry, University of Victoria, Victoria, BC, Canada
| | - Julian J Lum
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, BC, Canada
| | - Andrew Jirasek
- Department of Physics, The University of British Columbia-Okanagan, Kelowna, BC, Canada
| |
Collapse
|
3
|
Fuentes AM, Narayan A, Milligan K, Lum JJ, Brolo AG, Andrews JL, Jirasek A. Raman spectroscopy and convolutional neural networks for monitoring biochemical radiation response in breast tumour xenografts. Sci Rep 2023; 13:1530. [PMID: 36707535 PMCID: PMC9883395 DOI: 10.1038/s41598-023-28479-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 01/19/2023] [Indexed: 01/29/2023] Open
Abstract
Tumour cells exhibit altered metabolic pathways that lead to radiation resistance and disease progression. Raman spectroscopy (RS) is a label-free optical modality that can monitor post-irradiation biomolecular signatures in tumour cells and tissues. Convolutional Neural Networks (CNN) perform automated feature extraction directly from data, with classification accuracy exceeding that of traditional machine learning, in cases where data is abundant and feature extraction is challenging. We are interested in developing a CNN-based predictive model to characterize clinical tumour response to radiation therapy based on their degree of radiosensitivity or radioresistance. In this work, a CNN architecture is built for identifying post-irradiation spectral changes in Raman spectra of tumour tissue. The model was trained to classify irradiated versus non-irradiated tissue using Raman spectra of breast tumour xenografts. The CNN effectively classified the tissue spectra, with accuracies exceeding 92.1% for data collected 3 days post-irradiation, and 85.0% at day 1 post-irradiation. Furthermore, the CNN was evaluated using a leave-one-out- (mouse, section or Raman map) validation approach to investigate its generalization to new test subjects. The CNN retained good predictive accuracy (average accuracies 83.7%, 91.4%, and 92.7%, respectively) when little to no information for a specific subject was given during training. Finally, the classification performance of the CNN was compared to that of a previously developed model based on group and basis restricted non-negative matrix factorization and random forest (GBR-NMF-RF) classification. We found that CNN yielded higher classification accuracy, sensitivity, and specificity in mice assessed 3 days post-irradiation, as compared with the GBR-NMF-RF approach. Overall, the CNN can detect biochemical spectral changes in tumour tissue at an early time point following irradiation, without the need for previous manual feature extraction. This study lays the foundation for developing a predictive framework for patient radiation response monitoring.
Collapse
Affiliation(s)
- Alejandra M Fuentes
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada
| | - Apurva Narayan
- Department of Computer Science, Western University, London, Canada.,Department of Computer Science, The University of British Columbia Okanagan Campus, Kelowna, Canada
| | - Kirsty Milligan
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada
| | - Julian J Lum
- Department of Biochemistry and Microbiology, The University of Victoria, Victoria, Canada
| | - Alex G Brolo
- Department of Chemistry, The University of Victoria, Victoria, Canada
| | - Jeffrey L Andrews
- Department of Statistics, The University of British Columbia Okanagan Campus, Kelowna, Canada
| | - Andrew Jirasek
- Department of Physics, The University of British Columbia Okanagan Campus, Kelowna, Canada.
| |
Collapse
|
4
|
Ikponmwoba E, Ukorigho O, Moitra P, Pan D, Gartia MR, Owoyele O. A Machine Learning Framework for Detecting COVID-19 Infection Using Surface-Enhanced Raman Scattering. BIOSENSORS 2022; 12:bios12080589. [PMID: 36004985 PMCID: PMC9405612 DOI: 10.3390/bios12080589] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 07/27/2022] [Accepted: 07/28/2022] [Indexed: 05/04/2023]
Abstract
In this study, we explored machine learning approaches for predictive diagnosis using surface-enhanced Raman scattering (SERS), applied to the detection of COVID-19 infection in biological samples. To do this, we utilized SERS data collected from 20 patients at the University of Maryland Baltimore School of Medicine. As a preprocessing step, the positive-negative labels are obtained using Polymerase Chain Reaction (PCR) testing. First, we compared the performance of linear and nonlinear dimensionality techniques for projecting the high-dimensional Raman spectra to a low-dimensional space where a smaller number of variables defines each sample. The appropriate number of reduced features used was obtained by comparing the mean accuracy from a 10-fold cross-validation. Finally, we employed Gaussian process (GP) classification, a probabilistic machine learning approach, to correctly predict the occurrence of a negative or positive sample as a function of the low-dimensional space variables. As opposed to providing rigid class labels, the GP classifier provides a probability (ranging from zero to one) that a given sample is positive or negative. In practice, the proposed framework can be used to provide high-throughput rapid testing, and a follow-up PCR can be used for confirmation in cases where the model's uncertainty is unacceptably high.
Collapse
Affiliation(s)
- Eloghosa Ikponmwoba
- Department of Mechanical and Industrial Engineering, Louisiana State University, Baton Rouge, LA 70803, USA; (E.I.); (O.U.)
| | - Okezzi Ukorigho
- Department of Mechanical and Industrial Engineering, Louisiana State University, Baton Rouge, LA 70803, USA; (E.I.); (O.U.)
| | - Parikshit Moitra
- Department of Pediatrics, Center for Blood Oxygen Transport and Hemostasis, University of Maryland Baltimore School of Medicine, Baltimore, MD 21201, USA; (P.M.); (D.P.)
- Department of Nuclear Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Dipanjan Pan
- Department of Pediatrics, Center for Blood Oxygen Transport and Hemostasis, University of Maryland Baltimore School of Medicine, Baltimore, MD 21201, USA; (P.M.); (D.P.)
- Department of Nuclear Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Manas Ranjan Gartia
- Department of Mechanical and Industrial Engineering, Louisiana State University, Baton Rouge, LA 70803, USA; (E.I.); (O.U.)
- Correspondence: (M.R.G.); (O.O.)
| | - Opeoluwa Owoyele
- Department of Mechanical and Industrial Engineering, Louisiana State University, Baton Rouge, LA 70803, USA; (E.I.); (O.U.)
- Correspondence: (M.R.G.); (O.O.)
| |
Collapse
|
5
|
Flores E, Ouyang J, Lapointe F, Finnie P. Nanotube abundance from non-negative matrix factorization of Raman spectra as an example of chemical purity from open source machine learning. Sci Rep 2022; 12:11666. [PMID: 35803993 PMCID: PMC9270454 DOI: 10.1038/s41598-022-15359-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 06/17/2022] [Indexed: 11/09/2022] Open
Abstract
The chemical purity of materials is important for semiconductors, including the carbon nanotube material system, which is emerging in semiconductor applications. One approach to get statistically meaningful abundances and/or concentrations is to measure a large number of small samples. Automated multivariate classification algorithms can be used to draw conclusions from such large data sets. Here, we use spatially-mapped Raman spectra of mixtures of chirality-sorted single walled carbon nanotubes dispersed sparsely on flat silicon/silicon oxide substrates. We use non-negative matrix factorization (NMF) decomposition in scikit-learn, an open-source, python language “machine learning” package, to extract spectral components and derive weighting factors. We extract the abundance of minority species (7,5) nanotubes in mixtures by testing both synthetic data, and real samples prepared by dilution. We show how noise limits the purity level that can be evaluated. We determine real situations where this approach works well, and identify situations where it fails.
Collapse
Affiliation(s)
- Elijah Flores
- National Research Council Canada, 1200 Montreal Road, Ottawa, ON, K1A 0R6, Canada.,University of Waterloo, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada
| | - Jianying Ouyang
- National Research Council Canada, 1200 Montreal Road, Ottawa, ON, K1A 0R6, Canada
| | - François Lapointe
- National Research Council Canada, 1200 Montreal Road, Ottawa, ON, K1A 0R6, Canada
| | - Paul Finnie
- National Research Council Canada, 1200 Montreal Road, Ottawa, ON, K1A 0R6, Canada.
| |
Collapse
|
6
|
Melitto AS, Arias VEA, Shida JY, Gebrim LH, Silveira L. Diagnosing molecular subtypes of breast cancer by means of Raman spectroscopy. Lasers Surg Med Suppl 2022; 54:1143-1156. [PMID: 35789102 DOI: 10.1002/lsm.23580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 06/13/2022] [Accepted: 06/17/2022] [Indexed: 12/24/2022]
Abstract
OBJECTIVES Raman spectroscopy has been used to discriminate human breast cancer and its different tumor molecular subtypes (luminal A, luminal B, HER2, and triple-negative) from normal tissue in surgical specimens. MATERIALS AND METHODS Breast cancer and normal tissue samples from 31 patients were obtained by surgical resection and submitted for histopathology. Before anatomopathological processing, the samples had been submitted to Raman spectroscopy (830 nm, 25 mW excitation laser parameters). In total, 424 Raman spectra were obtained. Principal component analysis (PCA) was used in an exploratory analysis to unveil the compositional differences between the tumors and normal tissues. Discriminant models were developed to distinguish the different cancer subtypes by means of partial least squares (PLS) regression. RESULTS PCA vectors showed spectral features referred to the biochemical constitution of breast tissues, such as lipids, proteins, amino acids, and carotenoids, where lipids were decreased and proteins were increased in breast tumors. Despite the small spectral differences between the different subtypes of tumor and normal tissues, the discriminant model based on PLS was able to discriminate the spectra of the breast tumors from normal tissues with an accuracy of 97.3%, between luminal and nonluminal subtypes with an accuracy of 89.9%, between nontriple-negative and triple-negative with an accuracy of 94.7%, and each molecular subtype with an accuracy of 73.0%. CONCLUSION PCA could reveal the compositional difference between tumors and normal tissues, and PLS could discriminate the Raman spectra of breast tissues regarding the molecular subtypes of cancer, being a useful tool for cancer diagnosis.
Collapse
Affiliation(s)
| | - Victor E A Arias
- Biomedical Engineering Program, Universidade Anhembi Morumbi-UAM, São Paulo, SP, Brazil
| | - Jorge Y Shida
- Biomedical Engineering Program, Universidade Anhembi Morumbi-UAM, São Paulo, SP, Brazil
| | - Luiz H Gebrim
- Biomedical Engineering Program, Universidade Anhembi Morumbi-UAM, São Paulo, SP, Brazil
| | - Landulfo Silveira
- Mastology Department, CRSM-Hospital Pérola Byington, São Paulo, SP, Brazil.,Biomedical Engineering Institute, Center for Innovation, Technology and Education-CITÉ, São José dos Camp, SP, Brazil
| |
Collapse
|