1
|
Zhang X, Yang F, Xiao J, Qu H, Jocelin NF, Ren L, Guo Y. Analysis and comparison of machine learning methods for species identification utilizing ATR-FTIR spectroscopy. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 308:123713. [PMID: 38056185 DOI: 10.1016/j.saa.2023.123713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/26/2023] [Accepted: 11/28/2023] [Indexed: 12/08/2023]
Abstract
Accurate identification of insect species holds paramount significance in diverse fields as it facilitates a comprehensive understanding of their ecological habits, distribution range, and impact on both the environment and humans. While morphological characteristics have traditionally been employed for species identification, the utilization of empty pupariums for this purpose remains relatively limited. In this study, ATR-FTIR was employed to acquire spectral information from empty pupariums of five fly species, subjecting the data to spectral pre-processing to obtain average spectra for preliminary analysis. Subsequently, PCA and OPLS-DA were utilized for clustering and classification. Notably, two wavebands (3000-2800 cm-1 and 1800-1300 cm-1) were found to be significant in distinguishing A. grahami. Further, we established three machine learning models, including SVM, KNN, and RF, to analyze spectra from different waveband groups. The biological fingerprint region (1800-1300 cm-1) demonstrated a substantial advantage in identifying empty puparium species. Remarkably, the SVM model exhibited an impressive accuracy of 100 % in identifying all five fly species. This study represents the first instance of employing infrared spectroscopy and machine learning methods for identifying insect species using empty pupariums, providing a robust research foundation for future investigations in this area.
Collapse
Affiliation(s)
- Xiangyan Zhang
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Fengqin Yang
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Jiao Xiao
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Hongke Qu
- Key Laboratory of Carcinogenesis and Cancer Invasion of the Chinese Ministry of Education, Cancer Research Institute and School of Basic Medicine Sciences, Central South University, Changsha, Hunan, China
| | - Ngando Fernand Jocelin
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Lipin Ren
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China.
| | - Yadong Guo
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha 410013, Hunan, China.
| |
Collapse
|
2
|
Pacher G, Franca T, Lacerda M, Alves NO, Piranda EM, Arruda C, Cena C. Diagnosis of Cutaneous Leishmaniasis Using FTIR Spectroscopy and Machine Learning: An Animal Model Study. ACS Infect Dis 2024; 10:467-474. [PMID: 38189234 DOI: 10.1021/acsinfecdis.3c00430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Cutaneous leishmaniasis (CL) is a polymorphic and spectral skin disease caused by Leishmania spp. protozoan parasites. CL is difficult to diagnose because conventional methods are time-consuming, expensive, and low-sensitive. Fourier transform infrared spectroscopy (FTIR) with machine learning (ML) algorithms has been explored as an alternative to achieve fast and accurate results for many disease diagnoses. Besides the high accuracy demonstrated in numerous studies, the spectral variations between infected and noninfected groups are too subtle to be noticed. Since variability in sample set characteristics (such as sex, age, and diet) often leads to significant data variance and limits the comprehensive understanding of spectral characteristics and immune responses, we investigate a novel methodology for diagnosing CL in an animal model study. Blood serum, skin lesions, and draining popliteal lymph node samples were collected from Leishmania (Leishmania) amazonensis-infected BALB/C mice under experimental conditions. The FTIR method and ML algorithms accurately differentiated between infected (CL group) and noninfected (control group) samples. The best overall accuracy (∼72%) was obtained in an external validation test using principal component analysis and support vector machine algorithms in the 1800-700 cm-1 range for blood serum samples. The accuracy achieved in analyzing skin lesions and popliteal lymph node samples was satisfactory; however, notable disparities emerged in the validation tests compared to results obtained from blood samples. This discrepancy is likely attributed to the elevated sample variability resulting from molecular compositional differences. According to the findings, the successful functioning of prediction models is mainly related to data analysis rather than the differences in the molecular composition of the samples.
Collapse
Affiliation(s)
- Gabriela Pacher
- Laboratório de Parasitologia Humana, Instituto de Biociências, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| | - Thiago Franca
- Laboratório de Óptica e Fotônica (SISFOTON-UFMS), Instituto de Física, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| | - Miller Lacerda
- Laboratório de Óptica e Fotônica (SISFOTON-UFMS), Instituto de Física, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| | - Natália O Alves
- Laboratório de Parasitologia Humana, Instituto de Biociências, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| | - Eliane M Piranda
- Laboratório de Parasitologia Humana, Instituto de Biociências, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| | - Carla Arruda
- Laboratório de Parasitologia Humana, Instituto de Biociências, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| | - Cícero Cena
- Laboratório de Óptica e Fotônica (SISFOTON-UFMS), Instituto de Física, Universidade Federal de Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul 79070-900, Brazil
| |
Collapse
|
3
|
Marangoni-Ghoreyshi YG, Franca T, Esteves J, Maranni A, Pereira Portes KD, Cena C, Leal CRB. Multi-resistant diarrheagenic Escherichia coli identified by FTIR and machine learning: a feasible strategy to improve the group classification. RSC Adv 2023; 13:24909-24917. [PMID: 37608796 PMCID: PMC10440836 DOI: 10.1039/d3ra03518b] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 08/14/2023] [Indexed: 08/24/2023] Open
Abstract
The identification of multidrug-resistant strains from E. coli species responsible for diarrhea in calves still faces many laboratory limitations and is necessary for adequately monitoring the microorganism spread and control. Then, there is a need to develop a screening tool for bacterial strain identification in microbiology laboratories, which must show easy implementation, fast response, and accurate results. The use of FTIR spectroscopy to identify microorganisms has been successfully demonstrated in the literature, including many bacterial strains; here, we explored the FTIR potential for multi-resistant E. coli identification. First, we applied principal component analysis to observe the group formation tendency; the first results showed no clustering tendency with a messy sample score distribution; then, we improved these results by adequately selecting the main principal components which most contribute to group separation. Finally, using machine learning algorithms, a predicting model showed 75% overall accuracy, demonstrating the method's viability as a screaming test for microorganism identification.
Collapse
Affiliation(s)
| | - Thiago Franca
- UFMS - Universidade Federal de Mato Grosso do Sul, Optics and Photonic Lab (SISFOTON-UFMS) Campo Grande MS Brazil
| | - José Esteves
- UFMS - Universidade Federal de Mato Grosso do Sul, Optics and Photonic Lab (SISFOTON-UFMS) Campo Grande MS Brazil
| | - Ana Maranni
- UFMS - Universidade Federal de Mato Grosso do Sul, Optics and Photonic Lab (SISFOTON-UFMS) Campo Grande MS Brazil
| | | | - Cicero Cena
- UFMS - Universidade Federal de Mato Grosso do Sul, Optics and Photonic Lab (SISFOTON-UFMS) Campo Grande MS Brazil
| | - Cassia R B Leal
- UFMS - Universidade Federal de Mato Grosso do Sul, Graduate Program in Veterinary Science (CIVET) Campo Grande MS Brazil
| |
Collapse
|
4
|
Coelho ML, França T, Fontoura Mateus NL, da Costa Lima Junior MS, Cena C, do Nascimento Ramos CA. Canine visceral leishmaniasis diagnosis by UV spectroscopy of blood serum and machine learning algorithms. Photodiagnosis Photodyn Ther 2023; 42:103575. [PMID: 37080349 DOI: 10.1016/j.pdpdt.2023.103575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 04/14/2023] [Accepted: 04/17/2023] [Indexed: 04/22/2023]
Abstract
Visceral leishmaniasis (VL) is a zoonotic disease caused by the protozoan Leishmania infantum, and dogs are considered the main urban hosts for future disease transmission. The first and most effective control against the spread of disease relies on identifying infected animals, followed by their treatment or sacrifice, to reduce the protozoan reservoirs. Despite the availability of various diagnostic tests for VL in dogs the development of a quick and accurate diagnosis is essential from a public health and ethical point of view. Here we analyze the use of UV-Vis spectroscopy as an alternative diagnostic method for VL diagnosis by using the antigen-antibody interaction in canine blood serum and machine learning algorithms. The main UV spectra in the 220 to 280 nm range exhibit nine electronic absorption bands, but no significative difference could be identified between the positive and negative group spectra. Finally, UV pre-proceed spectra by SNV (standard normal variate) were submitted to principal component analysis followed by Linear SVM algorithm, the prediction model was tested in a leave-one-out cross-validation and external validation test reaching an overall accuracy of 75%.
Collapse
Affiliation(s)
- Mateus Lotério Coelho
- Faculty of Veterinary Medicine and Animal Husbandry (FAMEZ), Universidade Federal de Mato Grosso do Sul - UFMS, 79070-900, Campo Grande, Brazil
| | - Thiago França
- SISFOTON-UFMS, Optics and Photonics Group, Institute of Physics, Universidade Federal de Mato Grosso do Sul - UFMS, 79070-900, Campo Grande, Brazil
| | - Nathália Lopes Fontoura Mateus
- Faculty of Veterinary Medicine and Animal Husbandry (FAMEZ), Universidade Federal de Mato Grosso do Sul - UFMS, 79070-900, Campo Grande, Brazil
| | | | - Cicero Cena
- SISFOTON-UFMS, Optics and Photonics Group, Institute of Physics, Universidade Federal de Mato Grosso do Sul - UFMS, 79070-900, Campo Grande, Brazil.
| | - Carlos Alberto do Nascimento Ramos
- Faculty of Veterinary Medicine and Animal Husbandry (FAMEZ), Universidade Federal de Mato Grosso do Sul - UFMS, 79070-900, Campo Grande, Brazil.
| |
Collapse
|
5
|
de Brito EC, Franca T, Canassa T, Weber SS, Paniago AM, Cena C. Paracoccidioidomycosis screening diagnosis by FTIR spectroscopy and multivariate analysis. Photodiagnosis Photodyn Ther 2022; 39:102921. [DOI: 10.1016/j.pdpdt.2022.102921] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 05/17/2022] [Accepted: 05/18/2022] [Indexed: 12/13/2022]
|
6
|
Gomes Rios T, Larios G, Marangoni B, Oliveira SL, Cena C, Alberto do Nascimento Ramos C. FTIR spectroscopy with machine learning: A new approach to animal DNA polymorphism screening. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2021; 261:120036. [PMID: 34116415 DOI: 10.1016/j.saa.2021.120036] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 05/26/2021] [Accepted: 05/28/2021] [Indexed: 06/12/2023]
Abstract
Technological advances in recent decades, especially in molecular genetics, have enabled the detection of genetic DNA markers associated with productive characteristics in animals. However, the prospection of polymorphisms based on DNA sequencing is still expensive for the reality of many food-producing regions around the world, such as Brazil, demanding more accessible prospecting methods. In the present study, the Fourier transform infrared spectroscopy (FTIR) and machine learning algorithms were used to identify single nucleotide polymorphism (SNP) in animal DNA. The fragments of bovine DNA with well-known polymorphisms were used as a model. The DNA fragments were produced and genotyped by PCR-RFLP and classified according to the genotype (homozygous or heterozygous). FTIR spectra of DNA fragments were analyzed by principal component analysis (PCA) and machine learning algorithms. The best results exhibited 75-95% accuracy in the classification of bovine genotypes. Therefore, FTIR spectroscopy and multivariate analysis can be used as an alternative tool for prospecting polymorphisms in animal DNA. The method can contribute with studies to identify genetic markers associated with animal production and indirectly with food production itself, and reduce pressure on available natural resources.
Collapse
Affiliation(s)
- Thaynádia Gomes Rios
- Faculdade de Medicina Veterinária e Zootecnia, Universidade Federal de Mato Grosso do Sul, 79070-900 Campo Grande, MS, Brazil
| | - Gustavo Larios
- Grupo de Óptica e Fotônica, Instituto de Física, Universidade Federal de Mato Grosso do Sul, 79070-900 Campo Grande, MS, Brazil
| | - Bruno Marangoni
- Grupo de Óptica e Fotônica, Instituto de Física, Universidade Federal de Mato Grosso do Sul, 79070-900 Campo Grande, MS, Brazil
| | - Samuel L Oliveira
- Grupo de Óptica e Fotônica, Instituto de Física, Universidade Federal de Mato Grosso do Sul, 79070-900 Campo Grande, MS, Brazil
| | - Cícero Cena
- Grupo de Óptica e Fotônica, Instituto de Física, Universidade Federal de Mato Grosso do Sul, 79070-900 Campo Grande, MS, Brazil
| | | |
Collapse
|