1
|
Vrtělka O, Králová K, Fousková M, Setnička V. Comprehensive assessment of the role of spectral data pre-processing in spectroscopy-based liquid biopsy. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2025; 339:126261. [PMID: 40273765 DOI: 10.1016/j.saa.2025.126261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2025] [Revised: 04/05/2025] [Accepted: 04/16/2025] [Indexed: 04/26/2025]
Abstract
Spectroscopic data often contain artifacts or noise related to the sample characteristics, instrumental variations, or experimental design flaws. Therefore, classifying the raw data is not recommended and might lead to biased results. Nevertheless, most issues may be addressed through appropriate data pre-processing. Effective pre-processing is particularly crucial in critical applications like liquid biopsy for disease detection, where even minor performance improvements may impact patient outcomes. Unfortunately, there is no consensus regarding optimal pre-processing, complicating cross-study comparisons. This study presents a comprehensive evaluation of various pre-processing methods and their combinations to assess their influence on classification results. The goal was to identify whether some pre-processing methods are associated with higher classification outcomes and find an optimal strategy for the given data. Data from Raman optical activity and infrared and Raman spectroscopy were processed, applying tens of thousands of possible pre-processing pipelines. The resulting data were classified using three algorithms to distinguish between subjects with liver cirrhosis and those who had developed hepatocellular carcinoma. Results highlighted that some specific pre-processing methods often ranked among the best classification results, such as the Rolling Ball for correcting the baseline of Raman spectra or the Doubly Reweighted Penalized Least Squares and Mixture model in the case of Raman optical activity. On the other hand, the selection of filtering and/or normalization approach usually did not have a significant impact. Nonetheless, the pre-processing of top-scoring pipelines also depended on the classifier utilized. The best pipelines yielded an AUROC of 0.775-0.823, varying with the evaluated spectroscopic data and classifier.
Collapse
Affiliation(s)
- Ondřej Vrtělka
- Department of Analytical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Prague, Technická 5, 166 28 Prague 6, Czech Republic.
| | - Kateřina Králová
- Department of Analytical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Prague, Technická 5, 166 28 Prague 6, Czech Republic
| | - Markéta Fousková
- Department of Analytical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Prague, Technická 5, 166 28 Prague 6, Czech Republic
| | - Vladimír Setnička
- Department of Analytical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Prague, Technická 5, 166 28 Prague 6, Czech Republic.
| |
Collapse
|
2
|
Alcaraz MR, Montemurro M, Pisano PL, Lombardi JM, Bortolato SA. Functional data analysis, a comprehensive framework for processing non-quadrilinear and low-selective data provided by four-way liquid chromatography analysis. Anal Chim Acta 2025; 1356:344044. [PMID: 40288877 DOI: 10.1016/j.aca.2025.344044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2024] [Revised: 04/03/2025] [Accepted: 04/08/2025] [Indexed: 04/29/2025]
Abstract
The chemometric treatment of higher-order chromatographic (LC) data often crashes due to two main effects: the chromatographic band shifts/warpings across samples and quasi-full overlaps between component signals, affecting their analytical selectivities. From a chemometric point of view, these phenomena have independent effects, although they can jointly contribute to the failure of the algorithms: 1) data multilinearity breaking, leading to the poor performance of multilinear decomposition algorithms, and 2) linear dependence between the analytes signals, causing the failure of folded models. Under this scenario, making chemometric processing feasible involves defining specific experimental conditions that minimize these effects or increasing the number of instrumental ways to deal with selectivity lost. This work presents the Functional Aligned of Pure Vectors (FAPV) algorithm for restoring four-way chromatographic data multilinearity and bearing the spectral overlap trouble. Simulated and experimental four-way data were used to test the FAPV analytical efficiency, covering a wide range of chromatographic artifacts. Based on a multi-injection procedure, the experimental case implied the chromatographic determination of two analytes with uncalibrated interferents in aqueous samples. Both data systems were subjected to FAPV and then processed by PARAFAC. Therefore, a comprehensive comparison was made with the most widely used chemometric models for non-multilinear chromatographic data (MCR-ALS and PARAFAC2). Moreover, the performance of the FAPV approach was compared with commonly used alignment procedures, e.g., correlation-optimized warping. The results (c.a. REPs of 10 % in both analytes from the experimental case) show the efficiency of the FAPV algorithm in solving the troubles observed in chromatographic/spectral data.
Collapse
Affiliation(s)
- Mirta R Alcaraz
- Laboratorio de Desarrollo Analítico y Quimiometría (LADAQ), Cátedra de Química Analítica I, Facultad de Bioquímica y Ciencias Biológicas, Universidad Nacional del Litoral, Ciudad Universitaria, Santa Fe, S3000ZAA, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2290, CABA, C1425FQB, Argentina
| | - Milagros Montemurro
- Laboratorio de Desarrollo Analítico y Quimiometría (LADAQ), Cátedra de Química Analítica I, Facultad de Bioquímica y Ciencias Biológicas, Universidad Nacional del Litoral, Ciudad Universitaria, Santa Fe, S3000ZAA, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2290, CABA, C1425FQB, Argentina
| | - Pablo L Pisano
- Departamentos de Química Analítica y Química Orgánica, Facultad de Ciencias Bioquímicas y Farmacéuticas-Universidad Nacional de Rosario, Suipacha 570 Rosario, Santa Fe, S2002LRK, Argentina; Investigación y Desarrollo de Estrategias Analíticas Sostenibles (IDEAS), Instituto de Química Rosario (IQUIR-CONICET), Rosario, Santa Fe, S2002LRK, Argentina
| | - Juan M Lombardi
- Instituto de Física Rosario (IFIR-CONICET-UNR), Ocampo y Esmeralda, Rosario, Santa Fe, S2002LRK, Argentina; Fritz-Haber-Institut der Max-Planck-Gesellschaft (FHI), Faradayweg 4-6, 14195, Berlin, Germany
| | - Santiago A Bortolato
- Departamentos de Química Analítica y Matemática-Estadística, Facultad de Ciencias Bioquímicas y Farmacéuticas-Universidad Nacional de Rosario, Suipacha 570 Rosario, Santa Fe, S2002LRK, Argentina; Investigación y Desarrollo de Estrategias Analíticas Sostenibles (IDEAS), Instituto de Química Rosario (IQUIR-CONICET), Rosario, Santa Fe, S2002LRK, Argentina.
| |
Collapse
|
3
|
Shi X, Wang X, Zhang Y, Zhang X, Yu M, Zhang L. Innovative novel regularized memory graph attention capsule network for financial fraud detection. PLoS One 2025; 20:e0317893. [PMID: 40435125 PMCID: PMC12118919 DOI: 10.1371/journal.pone.0317893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 12/01/2024] [Indexed: 06/01/2025] Open
Abstract
Financial fraud detection (FFD) is crucial for ensuring the safety and efficiency of financial transactions. This article presents the Regularised Memory Graph Attention Capsule Network (RMGACNet), an original architecture aiming at improving fraud detection using Bidirectional Long Short-Term Memory (BiLSTM) networks combined with advanced feature extraction and classification algorithms. The model is tested on two reliable datasets: the European Cardholder (ECH) transactions dataset, which contains 284,807 transactions and 492 fraud instances, and the IEEE-CIS dataset, which has more than 1 million transactions. Our approach enhances comparison to existing methods of feature selection and classification accuracy. On the ECH dataset, RMGACNet achieves an accuracy of 0.9772, a precision of 0.9768, and an F1 score of 0.9770 measures; on the IEEE-CIS dataset, it achieves an accuracy of 0.9882, a precision of 0.9876 and an F1 score of 0.9879. The findings indicate that RMGACNet routinely surpasses existing models' efficiency and accuracy while ensuring strong execution time performance, especially when handling large-scale datasets. The suggested model demonstrates scalability and stability, making it suitable for real-time financial systems.
Collapse
Affiliation(s)
- Xiangting Shi
- Industrial Engineering and Operations Research Department, Columbia University, New York, New York, United States of America
| | - Xiaochen Wang
- Department of Finance, The London School of Economics and Political Science, London, United Kingdom
| | - Yakang Zhang
- Industrial Engineering and Operations Research Department, Columbia University, New York, New York, United States of America
| | - Xiaoyi Zhang
- College of Liberal Arts and Science, University of Illinois Urbana-Champaign, Illinois, United States of America
| | - Manning Yu
- Department of Statistics, Columbia University, New York, New York, United States of America
| | - Lihao Zhang
- Department of Information Engineering, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
| |
Collapse
|
4
|
Steuer C, Raeber J. A story of falsification and authentication: Authenticity control of phytopharmaceuticals and herbal remedies. Eur J Pharm Sci 2025; 211:107136. [PMID: 40409514 DOI: 10.1016/j.ejps.2025.107136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2025] [Revised: 05/15/2025] [Accepted: 05/20/2025] [Indexed: 05/25/2025]
Abstract
Natural remedies and phytopharmaceuticals are gaining popularity among both consumers and patients who seek a healthier and more environmentally friendly lifestyle. Healthcare professionals are also increasingly incorporating phytopharmaceuticals into therapeutic practices, recognizing their potential benefits for overall health and well-being. This trend is further driven by a growing distrust of synthetic compounds and dissatisfaction with conventional medical treatments, prompting many patients to explore natural alternatives. As a result, the demand for phytopharmaceuticals, essential oils, and nutraceuticals continues to rise steadily. However, falsified natural remedies can pose serious risks to human health, making quality assurance a critical priority. Sophisticated and reliable analytical strategies are essential to ensure product authenticity and to protect consumers and patients from potentially harmful adulteration. This review provides an overview of documented cases of adulteration and the corresponding countermeasures developed to detect such incidents. Commonly used analytical techniques for authenticity verification of phytopharmaceuticals, essential oils, and other natural remedies are presented. The review discusses the application of spectroscopic methods, mass spectrometry, chromatographic separation techniques, and DNA-based profiling, with reference to current scientific literature. In addition, selected chemometric tools are introduced to illustrate how meaningful information can be extracted from complex analytical data. The potential of multidimensional data analysis, combining complementary insights from various analytical platforms, is also explored. Finally, key considerations for generating and analysing reliable data are highlighted.
Collapse
Affiliation(s)
- Christian Steuer
- Institute of Pharmaceutical Sciences, Department of Chemistry and Applied Biosciences, ETH Zurich, Switzerland.
| | - Justine Raeber
- Institute of Pharmaceutical Sciences, Department of Chemistry and Applied Biosciences, ETH Zurich, Switzerland
| |
Collapse
|
5
|
Song S, Wu X, Li M, Wu B. Identifying the Geographical Origin of Wolfberry Using Near-Infrared Spectroscopy and Stacking-Orthogonal Linear Discriminant Analysis. Foods 2025; 14:1684. [PMID: 40428464 PMCID: PMC12111272 DOI: 10.3390/foods14101684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2025] [Revised: 05/02/2025] [Accepted: 05/07/2025] [Indexed: 05/29/2025] Open
Abstract
The geographical origin identification of wolfberry is key to ensuring its medicinal and edible quality. To accurately identify the geographical origin, the Stacking-Orthogonal Linear Discriminant Analysis (OLDA) algorithm was proposed by combining OLDA with the Stacking ensemble learning framework. In this study, Savitzky-Golay (SG) + Multiplicative Scatter Correction (MSC) served as the optimal preprocessing method. Four classifiers-K-Nearest Neighbors (KNN), Decision Tree, Support Vector Machine (SVM), and Naive Bayes-were used to explore 12 stacked combinations on 400 samples from five regions in Gansu: Zhangye, Yumen, Wuwei, Baiyin, and Dunhuang. When Principal Component Analysis (PCA), PCA + Linear Discriminant Analysis (LDA), and OLDA were used for feature extraction, Stacking-OLDA achieved the highest average identification accuracy of 99%. The overall accuracy of stacked combinations was generally higher than that of single-classifier models. This study also assessed the role of different classifiers in different combinations, finding that Stacking-OLDA combined with KNN as the meta-classifier achieved the highest accuracy. Experimental results demonstrate that Stacking-OLDA has excellent classification performance, providing an effective approach for the accurate classification of wolfberry origins and offering an innovative solution for quality control in the food industry.
Collapse
Affiliation(s)
- Shijie Song
- Mengxi Honors College, Jiangsu University, Zhenjiang 212013, China; (S.S.); (M.L.)
| | - Xiaohong Wu
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
- High-Tech Key Laboratory of Agricultural Equipment and Intelligence of Jiangsu Province, Jiangsu University, Zhenjiang 212013, China
| | - Mingyu Li
- Mengxi Honors College, Jiangsu University, Zhenjiang 212013, China; (S.S.); (M.L.)
| | - Bin Wu
- Department of Information Engineering, Chuzhou Polytechnic, Chuzhou 239000, China
- School of Computer Science and Engineering, Southeast University, Nanjing 211189, China
| |
Collapse
|
6
|
Wahab MF, Handlovic TT. Digital Restoration of Separations Data. J Sep Sci 2025; 48:e70139. [PMID: 40326520 DOI: 10.1002/jssc.70139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2024] [Revised: 03/13/2025] [Accepted: 04/02/2025] [Indexed: 05/07/2025]
Abstract
In experimental separations, the acquired signal can have four common but unwanted elements: (i) high-frequency noise, (ii) drift, (iii) occasional periodic and pink noise, and (iv) a mixture of components traveling with close velocities. Additionally, separation scientists observe partially resolved peaks in chromatographic or capillary electrophoresis detector output. A protocol for digitally recovering the "true data" is provided with clear-cut details and open-access codes for formulating denoising, baseline correction, and peak resolution problems as linear or nonlinear optimizations. The smoother of Bohlmann-Whittaker denoises and preserves the total area of the signal. Fourier transform methods to detect and remove periodic noise are also proposed for erratic periodic noise problems. A peak width preserving filter, called the bilateral filter, is also given. This procedure is followed by two reliable baseline correction algorithms based on asymmetric reweighted least squares or curvature-based weights to deal with low-frequency signal variations. Once low- and high-frequency noise components are removed, a convenient peak mode location method is introduced, relying upon numerically stable second derivative calculation from Savitzky-Golay (SG) filter. Peak locations, area, height, and higher statistical moments can be derived from iterative curve fitting. The protocol shows two peak models that can model left or right skewed peaks. The new peak models are (i) a numerically stable version of the bi-directional exponentially modified Gaussian (EMG), suitable for a wide variety of real chromatograms, and (ii) the twice generalized normal model with third- and fourth-moment parameters built into it. The nonlinear least squares regression approach uses the trust-region reflective optimization method. The algorithm converges to a solution even with multiple peaks and approximate guesses of peak parameters. The protocol can be generalized to fit more than 200 peak functions in separation sciences available in literature by minor adaptations of the provided codes in the MATLAB environment.
Collapse
Affiliation(s)
- M Farooq Wahab
- Department of Chemistry & Biochemistry, The University of Texas at Arlington, Arlington, Texas, USA
| | - Troy T Handlovic
- Department of Chemistry & Biochemistry, The University of Texas at Arlington, Arlington, Texas, USA
| |
Collapse
|
7
|
Alsubaei FS, Almazroi AA, Atwa WS, Almazroi AA, Ayub N, Jhanjhi NZ. BERT ensemble based MBR framework for android malware detection. Sci Rep 2025; 15:14027. [PMID: 40269058 PMCID: PMC12019352 DOI: 10.1038/s41598-025-98596-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Accepted: 04/14/2025] [Indexed: 04/25/2025] Open
Abstract
Predicting attacks in Android Malware (AM) devices within recommender systems-based IoT is challenging. A novel framework is presented in this study for AM Detection (AMD) using BERT Ensemble (MBR) and MobileNetV2. The MBR model uses a threat analysis technique to assess Android apps by using a subset of 100 permissions from 329 Android application-based permissions, together with a refined feature set. Using MCADS, DroidRL, CNN, FAGnet, GAN, and FEDriod, the MBR model performs exceptionally well, achieving 98% accuracy, 96% precision, 98% recall, 97% F1-score, and a log loss of 0.058. By leveraging their strengths, the MBR model introduces significant innovation. By using ensemble methods on static data, the MBR framework not only provides a reliable malware detection solution but also presents a novel strategy. This research highlights the potential for significant applications in this dynamic and evolving field by addressing user privacy and system security issues, despite the growing Android malware risks in IoT.
Collapse
Affiliation(s)
- Faisal S Alsubaei
- Department of Cybersecurity, College of Computer Science and Engineering, University of Jeddah, Jeddah, 21959, Saudi Arabia.
| | - Abdulwahab Ali Almazroi
- College of Computing and Information Technology at Khulais, Department of Information Technology, University of Jeddah, Jeddah, 21959, Saudi Arabia
| | - Walid Said Atwa
- College of Computing and Information Technology at Khulais, Department of Information Technology, University of Jeddah, Jeddah, 21959, Saudi Arabia
- Computer Science Department, Faculty of Computers and Information, Menoufia University, Menoufia, Egypt
| | - Abdulaleem Ali Almazroi
- Department of Information Technology, Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Rabigh, 21911, Saudi Arabia
| | - Nasir Ayub
- Department of Creative Technologies, Air University Islamabad, Islamabad, 44000, Pakistan
| | - Noor Zaman Jhanjhi
- School of Computer Science SCS, Taylor's University SDN BHD, Subang Jaya, 47500, Selangor, Malaysia
| |
Collapse
|
8
|
Su-An X, Yan-Dong Z, Lu-Shuai Q, Kai-Xing H, Yaqiong F. Quantitative analysis of safflower seed oil adulteration based on near-infrared spectroscopy combined with improved sparrow algorithm optimization model ISSA-ELM. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2025; 17:3045-3057. [PMID: 40166813 DOI: 10.1039/d4ay02252a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Safflower seed oil is expensive, and there are issues in the market such as the adulteration with cheaper edible oils, leading to inferior products. Near-infrared spectroscopy (NIR) is a non-destructive analytical method with the advantages of being fast, non-destructive, and pollution-free. However, in the case of low-concentration adulterated oil, the main components and their contents are nearly identical to those in pure oil, and the spectral feature peaks of the samples are similar, making it difficult for conventional analysis methods to select effective characteristic variables. Extreme Learning Machine (ELM), as a single-hidden-layer feedforward neural network model, possesses strong feature extraction and model representation capabilities. However, its random initialization of weights and biases leads to the issue of blind training. The Sparrow Search Algorithm (SSA) can effectively optimize the random initialization problem in ELM, but it is prone to issues such as being trapped in local optima and slower convergence. This study proposes a novel quantitative adulteration analysis model for safflower seed oil (ISSA-ELM), which combines ELM with an improved Sparrow Search Algorithm (ISSA). In the experiment, safflower seed oil was used as the base oil, and peanut oil, corn oil, and soybean oil were gradually added to prepare adulterated oil samples. To ensure the resolution of experimental samples and accurately capture spectral variations in the low-concentration range, a concentration gradient of 2% was used for adulteration levels between 2% and 70%, while a 5% gradient was applied for concentrations exceeding 70%. Each type of adulterant had 42 gradient concentrations, with 6 samples per concentration, totaling 252 samples. Original spectral data were collected, and the samples were randomly divided into a training set (70%) and a testing set (30%). Different preprocessing methods and feature extraction techniques were combined to discuss the results of three adulteration analysis models: PLS, SSA-ELM and ISSA-ELM. The final experimental results show that compared to the safflower seed oil adulteration prediction model based on SSA-ELM (R2 = 0.8938, RMSE = 0.0835, RPD = 2.9018)and PLS(RP2 = 0.9015, RMSEP = 0.0876, RPD = 3.0128), the model based on ISSA-ELM (RP2 = 0.9934, RMSEP = 0.0207, RPD = 10.1457) offers higher prediction accuracy and stability, and the error detection limit of ISSA-ELM can reach 2%. The ISSA optimized the SSA's tendency to reduce population diversity, slow convergence, and get stuck in local optima in the later stages of iteration, greatly enhancing the global optimization ability of the algorithm. Therefore, ISSA-ELM can effectively identify adulterated safflower seed oil, providing a technological pathway and basis for research into the adulteration of safflower seed oil.
Collapse
Affiliation(s)
- Xu Su-An
- College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, China
| | - Zhu Yan-Dong
- College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, China
| | - Qian Lu-Shuai
- College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, China
| | - Hong Kai-Xing
- College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, China
| | - Fu Yaqiong
- College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, China
| |
Collapse
|
9
|
Nanni L, Brahnam S, Ruta M, Fabris D, Boscolo Bacheto M, Milanello T. Deep Ensembling of Multiband Images for Earth Remote Sensing and Foramnifera Data. SENSORS (BASEL, SWITZERLAND) 2025; 25:2231. [PMID: 40218743 PMCID: PMC11991470 DOI: 10.3390/s25072231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2024] [Revised: 03/25/2025] [Accepted: 03/29/2025] [Indexed: 04/14/2025]
Abstract
The classification of multiband images captured by advanced sensors, such as satellite-mounted imaging systems, is a critical task in remote sensing and environmental monitoring. These sensors provide high-dimensional data that encapsulate a wealth of spectral and spatial information, enabling detailed analyses of the Earth's surface features. However, the complexity of these data poses significant challenges for accurate and efficient classification. Our study describes and highlights methods for creating ensembles of neural networks for handling multiband images. Two applications are illustrated in this work: (1) satellite image classification tested on the EuroSAT and LCZ42 datasets and (2) a species-level identification of planktic foraminifera. Multichannel images are fed into an ensemble of Convolutional Neural Networks (CNNs) (ResNet50, MobileNetV2, and DenseNet201), where each network is trained using three channels obtained from the multichannel images, and two custom networks (one based on ResNet50 and the other one based on attention) where the input is a multiband image. The ensemble learning framework harnesses these variations to improve classification accuracy, surpassing other advanced methods. The proposed system, implemented in MATLAB 2024b and PyTorch 2.6, is shown to achieve higher classification accuracy than those of human experts for species-level identification of planktic foraminifera (>92% vs. 83%) and state-of-the-art performance on the tested planktic foraminifera, the EuroSAT and LCZ42 datasets.
Collapse
Affiliation(s)
- Loris Nanni
- Department of Information Engineering, University of Padova, 35139 Padova, Italy; (M.R.); (D.F.); (M.B.B.); (T.M.)
| | - Sheryl Brahnam
- Information Technology and Cybersecurity, Missouri State University, 901 S. National, Springfield, MO 65804, USA;
| | - Matteo Ruta
- Department of Information Engineering, University of Padova, 35139 Padova, Italy; (M.R.); (D.F.); (M.B.B.); (T.M.)
| | - Daniele Fabris
- Department of Information Engineering, University of Padova, 35139 Padova, Italy; (M.R.); (D.F.); (M.B.B.); (T.M.)
| | - Martina Boscolo Bacheto
- Department of Information Engineering, University of Padova, 35139 Padova, Italy; (M.R.); (D.F.); (M.B.B.); (T.M.)
| | - Tommaso Milanello
- Department of Information Engineering, University of Padova, 35139 Padova, Italy; (M.R.); (D.F.); (M.B.B.); (T.M.)
| |
Collapse
|
10
|
Zhou H, Zhang S, Chen Y, Zhang S, Xu Z, Cui D, Guo W. Research on Pine Wilt Disease Spread Prediction Based on an Improved Light Gradient Boosting Machine Model. PHYTOPATHOLOGY 2025; 115:410-421. [PMID: 39745355 DOI: 10.1094/phyto-07-24-0202-r] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2025]
Abstract
Pine wilt disease has caused significant damage to China's ecological and financial resources. To prevent its further spread across the country, proactive control measures are necessary. Given the low accuracy of traditional models, we have employed an enhanced light gradient boosting machine (LGBM) model to predict the development trend of pine wilt disease in China, providing a theoretical basis for its monitoring and prevention. We collected and organized data on the occurrence points of pine wilt disease at the county level in China. By incorporating anthropogenic factors such as the volume of pine wood imports from 2017 to 2022, the density of graded roads, the number of adjacent counties, and the presence of wood processing factories, as well as natural factors such as temperature, humidity, and wind speed, we employed Pearson correlation and LGBM model's feature importance analysis to select the 17 most significant influencing factors. Spatial analysis was conducted on the epidemic subcompartments (a divisional unit smaller than a township) of pine wilt disease for 2022 and 2023, revealing the distribution patterns of epidemic subcompartments within 2 km of roads and the spatial relationships between new and old epidemic subcompartments. We improved the LGBM model using a Bayesian algorithm, sparrow search algorithm, and hunter-prey optimization algorithm. By comparison, the enhanced model was validated to outperform in terms of accuracy, precision, recall, sensitivity, and specificity. Based on the results of correlation analysis and spatial analysis, an enhanced model was used to predict the emergence of pine wilt disease in new counties and districts in the future. Currently, pine wilt disease is primarily concentrated in the central-southern and northeastern provinces of China. Predictions indicate that the disease will further spread to the northeastern and southern regions of the country in the future.
Collapse
Affiliation(s)
- Hongwei Zhou
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Siyan Zhang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Yifan Chen
- Center for Biological Disaster Prevention and Control, National Forestry and Grassland Administration, Shenyang 110034, China
| | - Shibo Zhang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Zihan Xu
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Di Cui
- Heilongjiang Forestry Technology Service Center, Harbin 150010, China
| | - Wenhui Guo
- Center for Biological Disaster Prevention and Control, National Forestry and Grassland Administration, Shenyang 110034, China
| |
Collapse
|
11
|
Jo K, Lee S, Jeong SKC, Kim HB, Seong PN, Jung S, Lee DH. Cooking loss estimation of semispinalis capitis muscle of pork butt using a deep neural network on hyperspectral data. Meat Sci 2025; 222:109754. [PMID: 39799874 DOI: 10.1016/j.meatsci.2025.109754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 11/22/2024] [Accepted: 01/08/2025] [Indexed: 01/15/2025]
Abstract
This study evaluated the performance of a deep-learning-based model that predicted cooking loss in the semispinalis capitis (SC) muscle of pork butts using hyperspectral images captured 24 h postmortem. To overcome low-scale samples, 70 pork butts were used with pixel-based data augmentation. Principal component regression (PCR) and partial least squares regression (PLSR) models for predicting cooking loss in SC muscle showed higher R2 values with multiplicative signal correction, while the first derivative resulted in a lower root mean square error (RMSE). The deep learning-based model outperformed the PCR and PLSR models. The classification accuracy of the models for cooking loss grade classification decreased as the number of grades increased, with the models with three grades achieving the highest classification accuracy. The deep learning model exhibited the highest classification accuracy (0.82). Cooking loss in the SC muscle was visualized using a deep learning model. The pH and cooking loss of the SC muscle were significantly correlated with the cooking loss of pork butt slices (-0.54 and 0.69, respectively). Therefore, a deep learning model using hyperspectral images can predict the cooking loss grade of SC muscle. This suggests that nondestructive prediction of the quality properties of pork butts can be achieved using hyperspectral images obtained from the SC muscle.
Collapse
Affiliation(s)
- Kyung Jo
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Seonmin Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Seul-Ki-Chan Jeong
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Hyeun Bum Kim
- Department of Animal Resources Science, Dankook University, Cheonan 16890, Republic of Korea
| | - Pil Nam Seong
- National Institute of Animal Science, Rural Development Administration, Wanju 55365, Republic of Korea
| | - Samooel Jung
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134, Republic of Korea.
| | - Dae-Hyun Lee
- Department of Biosystems Machinery Engineering, Chungnam National University, Daejeon 34134, Republic of Korea.
| |
Collapse
|
12
|
Barrera Morelli J, McGoverin C, Nieuwoudt M, Holroyd SE, Pilkington LI. Chemometric techniques for the prediction of milk composition from MIR spectral data: A review. Food Chem 2025; 469:142465. [PMID: 39724702 DOI: 10.1016/j.foodchem.2024.142465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 11/14/2024] [Accepted: 12/11/2024] [Indexed: 12/28/2024]
Abstract
Chemometrics; use of statistical models to characterise and understand complex chemical systems/samples, is an advancing field. In the dairy industry, the accurate prediction of milk composition involves combining mid-infrared spectroscopy with chemometric techniques for the evaluation of major constituents of milk. The increased interest in determination of detailed composition of dairy products, alongside emerging and more-widespread use of chemometric methodologies, have generated continuous improvement in predictive models for this application. Herein the main chemometric techniques employed for the study of milk composition are described, compared and discussed. The capability of emerging technologies to improve predictive accuracy of models, over the gold standard technique Partial Least-Squares Regression, to provide recommendation for future directions in this area are particularly emphasised. This review will be of particular interest to researchers involved in milk and dairy product composition, alongside all working in application of statistical methodologies to complex chemical systems.
Collapse
Affiliation(s)
- Josefina Barrera Morelli
- School of Chemical Sciences, The University of Auckland, 23 Symonds St., Auckland 1142, New Zealand; Te Pūnaha Matatini, Auckland, 1142, New Zealand; MacDiarmid Institute for Advanced Materials and Nanotechnology, New Zealand; The Dodd Walls Centre for Photonic and Quantum Technologies, New Zealand.
| | - Cushla McGoverin
- The Dodd Walls Centre for Photonic and Quantum Technologies, New Zealand; Department of Physics, The University of Auckland, 23 Symonds St., Auckland 1142, New Zealand
| | - Michel Nieuwoudt
- School of Chemical Sciences, The University of Auckland, 23 Symonds St., Auckland 1142, New Zealand; MacDiarmid Institute for Advanced Materials and Nanotechnology, New Zealand
| | - Stephen E Holroyd
- Fonterra Research & Development Centre, Private Bag, 11029, Palmerston North, New Zealand
| | - Lisa I Pilkington
- School of Chemical Sciences, The University of Auckland, 23 Symonds St., Auckland 1142, New Zealand; Te Pūnaha Matatini, Auckland, 1142, New Zealand.
| |
Collapse
|
13
|
Li Z, Miao Z, Li C, Zhou Y, Qiu Y, Liu C, Teng X, Tan Y. Characterization of rice starch changes in saline and alkaline area under different fertilization conditions based on Raman spectral recognition technology. Sci Rep 2025; 15:9299. [PMID: 40102544 PMCID: PMC11920442 DOI: 10.1038/s41598-025-89102-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Accepted: 02/03/2025] [Indexed: 03/20/2025] Open
Abstract
Starch content in rice is one of the important parameters in characterizing the nutritional quality of rice, and the starch content of rice produced in saline soils under different fertilization conditions varies. In this study, Raman spectroscopy combined with three machine learning models, support vector machine (SVM), feedforward neural network, and k-nearest neighbor classification, was used to classify and evaluate the effect of different fertilizer treatments on rice. The collected rice spectral data were normalized before machine learning, then preprocessed with multiple scattering correction (MSC), standard normal variable, and Savitzky-Golay filtering algorithms to improve the quality and reliability of the data. The evaluation indexes such as the confusion matrix and the receiver operating characteristic curve comprehensively analyzed the model's performance. The research shows that the MSC preprocessing method significantly improves the classification accuracy and prediction ability in all three models, and the classification accuracy was close to 100%, while the overall performance of the SVM models after various preprocessing is the best among the three machine learning methods. The predictive coefficient of determination, predictive root mean square error, and predictive average relative error of the starch content detection model built by the SVM model after MSC preprocessing were 0.93, 0.04%, and 0.20%, respectively, which indicated that its prediction had high accuracy and low error. The results of this study used Raman spectroscopy to carry out the identification of different fertilization techniques and rice starch quality correlation characteristics, providing theoretical and experimental support for the rapid identification of rice quality.
Collapse
Affiliation(s)
- Zhipeng Li
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China
| | - Zhuang Miao
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China
| | - Changming Li
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China
| | - Yingying Zhou
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China
| | - Yixin Qiu
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China
| | - Chunyu Liu
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China.
| | - Xing Teng
- Jilin Academy of Agricultural Sciences (Northeast Agricultural Research Center of China), Changchun, 130000, China.
| | - Yong Tan
- Key Laboratory of Spectral Detection Science and Technology, School of Physics, Changchun University of Science and Technology, Changchun, 130000, China.
| |
Collapse
|
14
|
Singh G, Sharma S. Enhancing precision agriculture through cloud based transformative crop recommendation model. Sci Rep 2025; 15:9138. [PMID: 40097589 PMCID: PMC11914076 DOI: 10.1038/s41598-025-93417-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 03/06/2025] [Indexed: 03/19/2025] Open
Abstract
Modern agriculture relies more on technology to boost food production. It aims to improve both the quality and quantity of food. This paper introduces a novel TCRM (Transformative Crop Recommendation Model). It uses advanced machine learning and cloud platforms to give personalized crop recommendations. Unlike traditional methods, TCRM uses real-time data. It includes environmental and agronomic factors to optimize recommendations. The system has SMS alerts for remote farmers. It outperforms baseline algorithms like Logistic Regression, KNN(k-nearest neighbor), and AdaBoost. TCRM empowers farmers with actionable insights, reducing resource wastage while boosting yield. By offering region-specific recommendations, it enhances profitability and promotes sustainable agricultural practices. The model has 94% accuracy, 94.46% precision, and 94% recall. Its F1 score is 93.97%. The fivefold cross-validation score is 97.67%. These findings show that the model can improve precision farming. It can make agriculture more sustainable and efficient.
Collapse
Affiliation(s)
- Gurpreet Singh
- Department of Computer Engineering and Technology, Guru Nanak Dev University, Amritsar, Punjab, India.
| | - Sandeep Sharma
- Department of Computer Engineering and Technology, Guru Nanak Dev University, Amritsar, Punjab, India
| |
Collapse
|
15
|
Fan Z, Zhang J, Ma C, Cong B, Huang P. The application of vibrational spectroscopy in forensic analysis of biological evidence. Forensic Sci Med Pathol 2025; 21:406-416. [PMID: 39180652 DOI: 10.1007/s12024-024-00866-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/30/2024] [Indexed: 08/26/2024]
Abstract
Vibrational spectroscopy is a powerful analytical domain, within which Fourier transform infrared spectroscopy (FTIR) and Raman spectroscopy stand as exemplars, offering high chemical specificity and sensitivity. These methodologies have been instrumental in the characterization of chemical compounds for an extensive period. They are particularly adept at the identification and analysis of minute sample quantities. Both FTIR and Raman spectroscopy are proficient in elucidating small liquid samples and detecting nuanced molecular alterations. The application of chemometrics further augments their analytical prowess. Currently, these techniques are in the research phase within forensic medicine and have yet to be broadly implemented in examination and identification processes. Nonetheless, studies have indicated that a combined classification model utilizing FTIR and Raman spectroscopy yields exceptional results for the identification of biological fluid-related information and the determination of causes of death. The objective of this review is to delineate the current research trajectory and potential applications of these two vibrational spectroscopic techniques in the detection of body fluids and the ascertainment of causes of death within the context of forensic medicine.
Collapse
Affiliation(s)
- Zehua Fan
- Department of Forensic Pathology, Institute of Forensic Science, Shanghai Key Laboratory of Forensic Medicine, Academy of Forensic Science, Shanghai, 200063, People's Republic of China
- College of Forensic Medicine, Hebei Key Laboratory of Forensic Medicine, Hebei Medical University, Shijiazhuang, 050000, People's Republic of China
| | - Ji Zhang
- Department of Forensic Pathology, Institute of Forensic Science, Shanghai Key Laboratory of Forensic Medicine, Academy of Forensic Science, Shanghai, 200063, People's Republic of China
| | - Chunling Ma
- College of Forensic Medicine, Hebei Key Laboratory of Forensic Medicine, Hebei Medical University, Shijiazhuang, 050000, People's Republic of China
| | - Bin Cong
- College of Forensic Medicine, Hebei Key Laboratory of Forensic Medicine, Hebei Medical University, Shijiazhuang, 050000, People's Republic of China.
| | - Ping Huang
- Institute of Forensic Science, Fudan University, Shanghai, 200032, People's Republic of China.
| |
Collapse
|
16
|
Giussani B, Monti M, Riu J. From spectroscopic data variability to optimal preprocessing: leveraging multivariate error in almond powder adulteration of different grain size. Anal Bioanal Chem 2025; 417:1393-1405. [PMID: 39710779 DOI: 10.1007/s00216-024-05710-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Revised: 12/09/2024] [Accepted: 12/11/2024] [Indexed: 12/24/2024]
Abstract
Analysing samples in their original form is increasingly crucial in analytical chemistry due to the need for efficient and sustainable practices. Analytical chemists face the dual challenge of achieving accuracy while detecting minute analyte quantities in complex matrices, often requiring sample pretreatment. This necessitates the use of advanced techniques with low detection limits, but the emphasis on sensitivity can conflict with efforts to simplify procedures and reduce solvent use. This article discusses the shift towards green analytical methods, focusing on portable spectroscopic techniques in the near-infrared (NIR) region. A case study involving the prediction of adulteration in almond flour with bitter almond flour illustrates the importance of particle size and the integration between the sample and the instrument. The study emphasizes the necessity of investigating the multivariate error associated with raw data to enhance data preprocessing strategies. This research provides valuable insights for professionals in the field, presenting a methodology applicable to a broad range of analytical applications while underscoring the critical role of raw data analysis in achieving accurate and reliable results.
Collapse
Affiliation(s)
- Barbara Giussani
- Dipartimento Di Scienza e Alta Tecnologia, Università Degli Studi Dell'Insubria, Via Valleggio 9, 22100, Como, Italy.
| | - Manuel Monti
- Dipartimento Di Scienza e Alta Tecnologia, Università Degli Studi Dell'Insubria, Via Valleggio 9, 22100, Como, Italy
| | - Jordi Riu
- Department of Analytical Chemistry and Organic Chemistry, Universitat Rovira i Virgili, Carrer Marcel·lí Domingo 1, 43007, Tarragona, Spain
| |
Collapse
|
17
|
Gao P, Li W, Hashim SBH, Liang J, Xu J, Huang X, Zou X, Shi J. Distributional uniformity quantification in heterogeneous prepared dishes combined the hyperspectral imaging technology with Moran's I: A case study of pizza. Food Chem 2025; 466:141511. [PMID: 39608109 DOI: 10.1016/j.foodchem.2024.141511] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Revised: 09/13/2024] [Accepted: 09/30/2024] [Indexed: 11/30/2024]
Abstract
Quality detection is critical in the development of prepared dishes, with distributional uniformity playing a significant role. This study used hyperspectral imaging (HSI) and Moran's I to quantify distributional uniformity, employing pizza as case. Pizza ingredients' spectra were collected, pre-processed with Detrended Fluctuation Analysis (DFA), Savitzky-Golay (SG) and Standard Normal Variate (SNV), and down-scaled with Principal Component Analysis (PCA). Subsequently, the classifiers Fine Tree, Support Vector Machine (SVM), and K-Nearest Neighbors (KNN) were utilized, where KNN based on the DFA-processed data had the greatest accuracy of 99.2 %. This best-fit model was used to create visualization maps. At last, image analysis methods containing regional statistics, Grey Level Co-occurrence Matrix (GLCM) and Moran's I were used to measure distributional uniformity. Moran's I demonstrated great distinctiveness and accuracy, making it the best tool. Therefore, HSI and Moran's I combination proved feasible to indicate distributional uniformity, ensuring the high quality of prepared dishes.
Collapse
Affiliation(s)
- Peipei Gao
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Wenlong Li
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Sulafa B H Hashim
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Jing Liang
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Jialong Xu
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Xiaowei Huang
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Xiaobo Zou
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Jiyong Shi
- Agricultural Product Processing and Storage Lab, School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China; International Joint Research Laboratory of Intelligent Agriculture and Agri-products Processing (Jiangsu University), Jiangsu Education Department, Zhenjiang 212013, China; China Light Industry Key Laboratory of Food Intelligent Detection & Processing, School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, China.
| |
Collapse
|
18
|
Mei C, Wang Z, Jiang H. Determination of aflatoxin B1 in wheat using Raman spectroscopy combined with chemometrics. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2025; 327:125384. [PMID: 39500203 DOI: 10.1016/j.saa.2024.125384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 09/08/2024] [Accepted: 11/01/2024] [Indexed: 12/08/2024]
Abstract
Aflatoxin B1 (AFB1) is carcinogenic and highly susceptible to production in wheat. In this study, the quantitative detection of contaminant AFB1 in wheat was investigated by Raman spectroscopy combined with chemometric method realization. Firstly, Savitzky-Golay smoothing (SG) and baseline calibration methods were used to perform the necessary preprocessing of the collected raw Raman spectra. Then, three variable optimization methods, i.e., competitive adaptive reweighted sampling (CARS), iteratively variable subset optimization (IVSO), and bootstrap soft shrinkage (BOSS), were applied to the preprocessed wheat Raman spectra. Finally, partial least squares regression (PLSR) models were developed to determine AFB1 in wheat samples. The results showed that all three variable optimization algorithms significantly improved the predictive performance of the models. The BOSS-PLSR model has strong generalization performance and robustness. Its prediction coefficient of determination (RP2) was 0.9927, the root mean square error of prediction (RMSEP) was 2.4260 μg/kg, and the relative prediction deviation (RPD) was 11.5250, respectively. In conclusion, the combination of Raman spectroscopy and chemometrics can realize the rapid quantitative detection of AFB1 in wheat.
Collapse
Affiliation(s)
- Congli Mei
- College of Electrical Engineering, Zhejiang University of Water Resources and Electric Power, Hangzhou 310048, PR China.
| | - Ziyu Wang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, PR China
| | - Hui Jiang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, PR China.
| |
Collapse
|
19
|
Kan J, Deng J, Ding Z, Jiang H, Chen Q. Feasibility study on non-destructive detection of microplastic content in flour based on portable Raman spectroscopy system combined with mixed variable selection method. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2025; 326:125195. [PMID: 39340947 DOI: 10.1016/j.saa.2024.125195] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 09/07/2024] [Accepted: 09/22/2024] [Indexed: 09/30/2024]
Abstract
Microplastics, as emerging environmental pollutants, have garnered considerable attention due to their contamination of both the environment and food. Microplastics can infiltrate the human food chain through multiple pathways, potentially posing health risks to humans. Currently, non-destructive testing of microplastics in food is considered challenging. This study aims to investigate the feasibility of employing a portable Raman spectroscopy system for non-destructive detection of microplastic content (polystyrene, PS; polyethylene, PE) in flour. In this study, a portable spectrometer was used to collect flour spectra of different abundances of microplastics. To enhance the predictive performance of the partial least squares (PLS) model, a mixed variable selection strategy that combined the wavelength interval selection method (Synergy interval partial least squares, siPLS) and the wavelength point selection method (Least absolute shrinkage and selection operator, LASSO; Multiple feature-spaces ensemble by least absolute shrinkage and selection operator, MFE-LASSO) was proposed. Four regression models (PLS, siPLS, siPLS-LASSO, siPLS-MFE-LASSO) were developed and compared for detecting PS and PE content in flour. The siPLS-MFE-LASSO model exhibited the best generalization performance in the prediction set, and was considered to have the best generalization performance (PS: RP2 = 0.9889, RMSEP=0.0344 %; PE: RP2 = 0.9878, RMSEP=0.0361 %). In conclusion, this study has demonstrated the potential of using a portable Raman spectrometer in conjunction with a mixed variable selection algorithm for non-destructive detection of PS and PE content in flour, providing more possibilities for non-destructive detection of microplastic content in food.
Collapse
Affiliation(s)
- Jiaming Kan
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, PR China
| | - Jihong Deng
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, PR China
| | - Zhidong Ding
- Product Quality Supervision and Inspection Center of Zhenjiang City, Zhenjiang 212132, PR China
| | - Hui Jiang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, PR China.
| | - Quansheng Chen
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, PR China.
| |
Collapse
|
20
|
Feng Y, Zhu X, Wang Y. Application of spectroscopic technology with machine learning in Chinese herbs from seeds to medicinal materials: The case of genus Paris. J Pharm Anal 2025; 15:101103. [PMID: 40034863 PMCID: PMC11874543 DOI: 10.1016/j.jpha.2024.101103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/11/2024] [Accepted: 09/10/2024] [Indexed: 03/05/2025] Open
Abstract
To ensure the safety and efficacy of Chinese herbs, it is of great significance to conduct rapid quality detection of Chinese herbs at every link of their supply chain. Spectroscopic technology can reflect the overall chemical composition and structural characteristics of Chinese herbs, with the multi-component and multitarget characteristics of Chinese herbs. This review took the genus Paris as an example, and applications of spectroscopic technology with machine learning (ML) in supply chain of the genus Paris from seeds to medicinal materials were introduced. The specific contents included the confirmation of germplasm resources, identification of growth years, cultivar, geographical origin, and original processing and processing methods. The potential application of spectroscopic technology in genus Paris was pointed out, and the prospects of combining spectroscopic technology with blockchain were proposed. The summary and prospects presented in this paper will be beneficial to the quality control of the genus Paris in all links of its supply chain, so as to rationally use the genus Paris resources and ensure the safety and efficacy of medication.
Collapse
Affiliation(s)
- Yangna Feng
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Science, Kunming, 650200, China
- College of Traditional Chinese Medicine, Yunnan University of Chinese Medicine, Kunming, 650500, China
| | - Xinyan Zhu
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Science, Kunming, 650200, China
| | - Yuanzhong Wang
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Science, Kunming, 650200, China
| |
Collapse
|
21
|
Zhang X, Wang C, Pi X, Li B, Ding Y, Yu H, Sun J, Wang P, Chen Y, Wang Q, Zhang C, Meng X, Chen G, Wang D, Wang Z, Mu Z, Song H, Zhang J, Niu S, Han Z, Ren L. Bionic Recognition Technologies Inspired by Biological Mechanosensory Systems. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2025:e2418108. [PMID: 39838736 DOI: 10.1002/adma.202418108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Revised: 12/23/2024] [Indexed: 01/23/2025]
Abstract
Mechanical information is a medium for perceptual interaction and health monitoring of organisms or intelligent mechanical equipment, including force, vibration, sound, and flow. Researchers are increasingly deploying mechanical information recognition technologies (MIRT) that integrate information acquisition, pre-processing, and processing functions and are expected to enable advanced applications. However, this also poses significant challenges to information acquisition performance and information processing efficiency. The novel and exciting mechanosensory systems of organisms in nature have inspired us to develop superior mechanical information bionic recognition technologies (MIBRT) based on novel bionic materials, structures, and devices to address these challenges. Herein, first bionic strategies for information pre-processing are presented and their importance for high-performance information acquisition is highlighted. Subsequently, design strategies and considerations for high-performance sensors inspired by mechanoreceptors of organisms are described. Then, the design concepts of the neuromorphic devices are summarized in order to replicate the information processing functions of a biological nervous system. Additionally, the ability of MIBRT is investigated to recognize basic mechanical information. Furthermore, further potential applications of MIBRT in intelligent robots, healthcare, and virtual reality are explored with a view to solve a range of complex tasks. Finally, potential future challenges and opportunities for MIBRT are identified from multiple perspectives.
Collapse
Affiliation(s)
- Xiangxiang Zhang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Changguang Wang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Xiang Pi
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Bo Li
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
- The National Key Laboratory of Automotive Chassis Integration and Bionics (ACIB), College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
| | - Yuechun Ding
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Hexuan Yu
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Jialue Sun
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Pinkun Wang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - You Chen
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Qun Wang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Changchao Zhang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Xiancun Meng
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Guangjun Chen
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Dakai Wang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Ze Wang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Zhengzhi Mu
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Honglie Song
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
| | - Junqiu Zhang
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
- The National Key Laboratory of Automotive Chassis Integration and Bionics (ACIB), College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
- Institute of Structured and Architected Materials, Liaoning Academy of Materials, Shenyang, 110167, China
| | - Shichao Niu
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
- The National Key Laboratory of Automotive Chassis Integration and Bionics (ACIB), College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
- Institute of Structured and Architected Materials, Liaoning Academy of Materials, Shenyang, 110167, China
| | - Zhiwu Han
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
- The National Key Laboratory of Automotive Chassis Integration and Bionics (ACIB), College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
- Institute of Structured and Architected Materials, Liaoning Academy of Materials, Shenyang, 110167, China
| | - Luquan Ren
- Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun, Jilin, 130022, China
- The National Key Laboratory of Automotive Chassis Integration and Bionics (ACIB), College of Biological and Agricultural Engineering, Jilin University, Changchun, 130022, China
- Institute of Structured and Architected Materials, Liaoning Academy of Materials, Shenyang, 110167, China
| |
Collapse
|
22
|
Zheng Y, Luo X, Gao Y, Sun Z, Huang K, Gao W, Xu H, Xie L. Lycopene detection in cherry tomatoes with feature enhancement and data fusion. Food Chem 2025; 463:141183. [PMID: 39278075 DOI: 10.1016/j.foodchem.2024.141183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 08/15/2024] [Accepted: 09/05/2024] [Indexed: 09/17/2024]
Abstract
Lycopene, a biologically active phytochemical with health benefits, is a key quality indicator for cherry tomatoes. While ultraviolet/visible/near-infrared (UV/Vis/NIR) spectroscopy holds promise for large-scale online lycopene detection, capturing its characteristic signals is challenging due to the low lycopene concentration in cherry tomatoes. This study improved the prediction accuracy of lycopene by supplementing spectral data with image information through spectral feature enhancement and spectra-image fusion. The feasibility of using UV/Vis/NIR spectra and image features to predict lycopene content was validated. By enhancing spectral bands corresponding to colors correlated with lycopene, the performance of the spectral model was improved. Additionally, direct spectra-image fusion further enhanced the prediction accuracy, achieving RP2, RMSEP, and RPD as 0.95, 8.96 mg/kg, and 4.25, respectively. Overall, this research offers valuable insights into supplementing spectral data with image information to improve the accuracy of non-destructive lycopene detection, providing practical implications for online fruit quality prediction.
Collapse
Affiliation(s)
- Yuanhao Zheng
- College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, PR China; Key Laboratory of Intelligent Equipment and Robotics for Agriculture of Zhejiang Province, Hangzhou 310058, PR China
| | - Xuan Luo
- College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, PR China; Key Laboratory of On-Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, PR China
| | - Yuan Gao
- College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, PR China; Key Laboratory of Intelligent Equipment and Robotics for Agriculture of Zhejiang Province, Hangzhou 310058, PR China; Key Laboratory of On-Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, PR China
| | - Zhizhong Sun
- College of Chemistry and Materials Engineering, Zhejiang A&F University, Hangzhou 311300, PR China
| | - Kang Huang
- Department of Biological Systems Engineering, Washington State University, Pullman, WA 99164, USA
| | - Weilu Gao
- Department of Electrical and Computer Engineering, The University of Utah, 201 Presidents' Cir, Salt Lake City, UT 84112, USA
| | - Huirong Xu
- College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, PR China; Key Laboratory of Intelligent Equipment and Robotics for Agriculture of Zhejiang Province, Hangzhou 310058, PR China; Key Laboratory of On-Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, PR China
| | - Lijuan Xie
- College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, PR China; Key Laboratory of Intelligent Equipment and Robotics for Agriculture of Zhejiang Province, Hangzhou 310058, PR China.
| |
Collapse
|
23
|
Yu Y, Chai Y, Yan Y, Li Z, Huang Y, Chen L, Dong H. Near-infrared spectroscopy combined with support vector machine for the identification of Tartary buckwheat (Fagopyrum tataricum (L.) Gaertn) adulteration using wavelength selection algorithms. Food Chem 2025; 463:141548. [PMID: 39388874 DOI: 10.1016/j.foodchem.2024.141548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 09/29/2024] [Accepted: 10/03/2024] [Indexed: 10/12/2024]
Abstract
The frequent occurrence of adulterating Tartary buckwheat powder with crop flours in the market necessitates an urgent need for a simple analysis method to ensure the quality of Tartary buckwheat. This study employed near-infrared spectroscopy (NIRS) for the collection of spectral data from Tartary buckwheat samples adulterated with whole wheat, oat, soybean, barley, and sorghum flours. The competitive adaptive reweighted sampling (CARS) and successive projection algorithm (SPA) were deployed to identify informative wavelengths. By integrating support vector machine (SVM) and partial least squares discriminant analysis (PLS-DA), we constructed qualitative models to discern Tartary buckwheat adulteration. The PLS-DA model exhibited prediction accuracies between 89.78 % and 94.22 %, while the mean-centering (MC)-PLS-DA model showcased impressive predictive accuracy of 93.33 %. Notably, the feature-based Autoscales-CARS-CV-SVM model achieved more excellent identification accuracy. These findings exhibit the excellent potential of chemometrics as a powerful tool for detecting food product adulteration.
Collapse
Affiliation(s)
- Yue Yu
- School of Grain Science and Technology, Jiangsu University of Science and Technology, Zhenjiang 212100, China
| | - Yinghui Chai
- School of Grain Science and Technology, Jiangsu University of Science and Technology, Zhenjiang 212100, China
| | - Yujie Yan
- School of Grain Science and Technology, Jiangsu University of Science and Technology, Zhenjiang 212100, China
| | - Zhanming Li
- School of Grain Science and Technology, Jiangsu University of Science and Technology, Zhenjiang 212100, China.
| | - Yue Huang
- College of Food Science and Nutritional Engineering, China Agricultural University, Beijing 100083, China
| | - Lin Chen
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore 637459, Singapore
| | - Hao Dong
- College of Light Industry and Food Sciences, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, China.
| |
Collapse
|
24
|
Zou X, Wang Q, Chen Y, Wang J, Xu S, Zhu Z, Yan C, Shan P, Wang S, Fu Y. Fusion of convolutional neural network with XGBoost feature extraction for predicting multi-constituents in corn using near infrared spectroscopy. Food Chem 2025; 463:141053. [PMID: 39241414 DOI: 10.1016/j.foodchem.2024.141053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 07/31/2024] [Accepted: 08/21/2024] [Indexed: 09/09/2024]
Abstract
Near-infrared (NIR) spectroscopy has been widely utilized to predict multi-constituents of corn in agriculture. However, directly extracting constituent information from the NIR spectra is challenging due to many issues such as broad absorption band, overlapping and non-specific nature. To solve these problems and extract implicit features from the raw data of NIR spectra to improve performance of quantitative models, a one-dimensional shallow convolutional neural network (CNN) model based on an eXtreme Gradient Boosting (XGBoost) feature extraction method was proposed in this paper. The leaf node feature information in the XGBoost was encoded and reconstructed to obtain the implicit features of raw data in the NIR spectra. A two-parametric Swish (TSwish or TS) activation function was proposed to improve the performance of CNN, and the elastic net (EN) was also applied to avoid the overfitting problem of the CNN model. Performance of the developed XGBoost-CNN-TS-EN model was evaluated using two public NIR spectroscopy datasets of corn and soil, and the obtained determination coefficients (R2) for moisture, oil, protein, and starch of the corn on test set were 0.993, 0.991, 0.998, and 0.992, respectively, with that of the soil organic matter being 0.992. The XGBoost-CNN-TS-EN model exhibits superior stability, good prediction accuracy, and generalization ability, demonstrating its great potentials for quantitative analysis of multi-constituents in spectroscopic applications.
Collapse
Affiliation(s)
- Xin Zou
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Qiaoyun Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China; Hebei Key Laboratory of Micro-Nano Precision Optical Sensing and Measurement Technology, Qinhuangdao 066004, China.
| | - Yinji Chen
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Jilong Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Shunyuan Xu
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Ziheng Zhu
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Chongyue Yan
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Peng Shan
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Shuyu Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - YongQing Fu
- Faculty of Engineering & Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, UK
| |
Collapse
|
25
|
Wei S, Huang J, Niu Y, Tong H, Su L, Zhang X, Wu M, Yang Y. Monitoring the Concentrations of Na, Mg, Ca, Cu, Fe, and K in Sargassum fusiforme at Different Growth Stages by NIR Spectroscopy Coupled with Chemometrics. Foods 2025; 14:122. [PMID: 39796412 PMCID: PMC11719690 DOI: 10.3390/foods14010122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Revised: 12/27/2024] [Accepted: 12/31/2024] [Indexed: 01/13/2025] Open
Abstract
Sargassum fusiforme, an edible seaweed, plays a crucial role in our daily lives by providing essential nutrients, including minerals, to the human body. The detection of mineral content during different growth stages of S. fusiforme benefits the goals of ensuring product quality, meeting diverse consumer needs, and achieving quality classification. Currently, the determination of minerals in S. fusiforme primarily relies on inductively coupled plasma mass spectrometry and other methods, which are time-consuming and labor-intensive. Thus, a rapid and convenient method was developed for the determination of six minerals (i.e., Na, Mg, Ca, Cu, Fe, and K) in S. fusiforme via near-infrared (NIR) spectroscopy based on chemometrics. This study investigated the variations in minerals in S. fusiforme from different growth stages. The effects of four spectral pretreatment methods and three wavelength selection methods, including the synergy interval partial least squares (SI-PLS) algorithm, genetic algorithm (GA), and competitive adaptive reweighted sampling method (CARS) on the model optimization, were evaluated. Superior CARS-PLS models were established for Na, Mg, Ca, Cu, Fe, and K with root mean square error of prediction (RMSEP) values of 0.8196 × 103 mg kg-1, 0.4370 × 103 mg kg-1, 1.544 × 103 mg kg-1, 0.9745 mg kg-1, 49.88 mg kg-1, and 7.762 × 103 mg kg-1, respectively, and coefficient of determination of prediction (RP2) values of 0.9787, 0.9371, 0.9913, 0.9909, 0.9874, and 0.9265, respectively. S. fusiforme demonstrated higher levels of Mg and Ca at the seedling stage and lower levels of Cu and Fe at the maturation stage. Additionally, S. fusiforme exhibited higher Na and lower K at the growth stage. NIR combined with CARS-PLS is a potential alternative for monitoring the concentrations of minerals in S. fusiforme at different growth stages, aiding in the convenient evaluation and further grading of the quality of S. fusiforme.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Yue Yang
- Zhejiang Provincial Key Laboratory for Water Environment and Marine Biological Resources Protection, College of Life and Environmental Science, Wenzhou University, Wenzhou 325035, China; (S.W.); (J.H.); (Y.N.); (H.T.); (L.S.); (X.Z.); (M.W.)
| |
Collapse
|
26
|
Li J, Qian J, Chen J, Ruiz-Garcia L, Dong C, Chen Q, Liu Z, Xiao P, Zhao Z. Recent advances of machine learning in the geographical origin traceability of food and agro-products: A review. Compr Rev Food Sci Food Saf 2025; 24:e70082. [PMID: 39680486 DOI: 10.1111/1541-4337.70082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 11/02/2024] [Accepted: 11/21/2024] [Indexed: 12/18/2024]
Abstract
The geographical origin traceability of food and agro-products has been attracted worldwide. Especially with the rise of machine learning (ML) technology, it provides cutting-edge solutions to erstwhile intractable issues to identify the origin of food and agro-products. By utilizing advanced algorithms, ML can extract feature information of food and agro-products that is closely related to origin and, more accurately, identify and trace their origins, which is of great significance to the entire food and agriculture industry. This paper provides a comprehensive overview of the state-of-the-art applications of ML in the geographical origin traceability of food and agro-products. First, commonly used ML methods are summarized. The paper then outlines the whole process of preparation for modeling, model training as well as model evaluation for building traceability models-based ML. Finally, recent applications of ML combined with different traceability techniques in the field of food and agro-products are revisited. Although ML has made many achievements in solving the geographical origin traceability problem of food and agro-products, it still has great development potential. For example, the application of ML is yet insufficient in the geographical origin traceability using DNA or computer vision techniques. The ability of ML to predict the geographical origin of food and agro-products can be further improved, for example, by increasing model interpretability, incorporating data fusion strategies, and others.
Collapse
Affiliation(s)
- Jiali Li
- State Key Laboratory of Efficient Utilization of Arid and Semi-Arid Arable Land in Northern China/Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jianping Qian
- State Key Laboratory of Efficient Utilization of Arid and Semi-Arid Arable Land in Northern China/Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jinyong Chen
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Luis Ruiz-Garcia
- Department of Agroforestry Engineering, Universidad Politécnica de Madrid, Madrid, Spain
| | - Chen Dong
- College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou, China
| | - Qian Chen
- State Key Laboratory of Efficient Utilization of Arid and Semi-Arid Arable Land in Northern China/Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Zihan Liu
- School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China
| | - Pengnan Xiao
- State Key Laboratory of Efficient Utilization of Arid and Semi-Arid Arable Land in Northern China/Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Zhiyao Zhao
- School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China
| |
Collapse
|
27
|
Felizzato G, Sabo M, Petrìk M, Romolo FS. Laser Desorption-Ion Mobility Spectrometry of Explosives for Forensic and Security Applications. Molecules 2025; 30:138. [PMID: 39795197 PMCID: PMC11722068 DOI: 10.3390/molecules30010138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Revised: 12/10/2024] [Accepted: 12/24/2024] [Indexed: 01/13/2025] Open
Abstract
BACKGROUND The detection of explosives in crime scene investigations is critical for forensic science. This study explores the application of laser desorption (LD) ion mobility spectrometry (IMS) as a novel method for this purpose utilising a new IMS prototype developed by MaSaTECH. METHODS The LD sampling technique employs a laser diode module to vaporise explosive traces on surfaces, allowing immediate analysis by IMS without sample preparation. Chemometric approaches, including multivariate data analysis, were utilised for data processing and interpretation, including pre-processing of raw IMS plasmagrams and various pattern recognition techniques, such as linear discriminant analysis (LDA) and support vector machines (SVMs). RESULTS The IMS prototype was validated through experiments with pure explosives (TNT, RDX, PETN) and explosive products (SEMTEX 1A, C4) on different materials. The study found that the pre-processing method significantly impacts classification accuracy, with the PCA-LDA model demonstrating the best performance for real-world applications. CONCLUSIONS The LD-IMS prototype, coupled with effective chemometric techniques, presents a promising methodology for the detection of explosives in forensic investigations, enhancing the reliability of field applications.
Collapse
Affiliation(s)
- Giorgio Felizzato
- Department of Law, University of Bergamo, Via Moroni 255, 24127 Bergamo, Italy;
- Department of Drug Science and Technology, University of Turin, Via Giuria 9, 10125 Torino, Italy
| | - Martin Sabo
- MaSa Tech, s.r.o., Sadová 3018/10, 916 01 Stará Turá, Slovakia; (M.S.)
- Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Ilkovičova 2, Bratislava 4, 842 16 Bratislava, Slovakia
| | - Matej Petrìk
- MaSa Tech, s.r.o., Sadová 3018/10, 916 01 Stará Turá, Slovakia; (M.S.)
- Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Ilkovičova 2, Bratislava 4, 842 16 Bratislava, Slovakia
| | | |
Collapse
|
28
|
Wen Q, Wei W, Li Y, Chen D, Zhang J, Li Z, Guo DA. Combination ATR-FTIR with Multiple Classification Algorithms for Authentication of the Four Medicinal Plants from Curcuma L. in Rhizomes and Tuberous Roots. SENSORS (BASEL, SWITZERLAND) 2024; 25:50. [PMID: 39796841 PMCID: PMC11722871 DOI: 10.3390/s25010050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Revised: 12/09/2024] [Accepted: 12/20/2024] [Indexed: 01/13/2025]
Abstract
Curcumae Longae Rhizoma (CLRh), Curcumae Radix (CRa), and Curcumae Rhizoma (CRh), derived from the different medicinal parts of the Curcuma species, are blood-activating analgesics commonly used for promoting blood circulation and relieving pain. Due to their certain similarities in chemical composition and pharmacological effects, these three herbs exhibit a high risk associated with mixing and indiscriminate use. The diverse methods used for distinguishing the medicinal origins are complex, time-consuming, and limited to intraspecific differentiation, which are not suitable for rapid and systematic identification. We developed a rapid analysis method for identification of affinis and different medicinal materials using attenuated total reflection-Fourier-transform infrared spectroscopy (ATR-FTIR) combined with machine learning algorithms. The original spectroscopic data were pretreated using derivatives, standard normal variate (SNV), multiplicative scatter correction (MSC), and smoothing (S) methods. Among them, 1D + MSC + 13S emerged as the best pretreatment method. Then, t-distributed stochastic neighbor embedding (t-SNE) was applied to visualize the results, and seven kinds of classification models were constructed. The results showed that support vector machine (SVM) modeling was superior to other models and the accuracy of validation and prediction was preferable, with a modeling time of 127.76 s. The established method could be employed to rapidly and effectively distinguish the different origins and parts of Curcuma species and thus provides a technique for rapid quality evaluation of affinis species.
Collapse
Affiliation(s)
- Qiuyi Wen
- School of Pharmacy, Guangdong Pharmaceutical University, Guangzhou 510006, China;
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; (W.W.); (Y.L.); (J.Z.); (Z.L.)
| | - Wenlong Wei
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; (W.W.); (Y.L.); (J.Z.); (Z.L.)
| | - Yun Li
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; (W.W.); (Y.L.); (J.Z.); (Z.L.)
| | - Dan Chen
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing 210023, China;
| | - Jianqing Zhang
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; (W.W.); (Y.L.); (J.Z.); (Z.L.)
| | - Zhenwei Li
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; (W.W.); (Y.L.); (J.Z.); (Z.L.)
| | - De-an Guo
- School of Pharmacy, Guangdong Pharmaceutical University, Guangzhou 510006, China;
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; (W.W.); (Y.L.); (J.Z.); (Z.L.)
| |
Collapse
|
29
|
Fan C, Liu Y, Cui T, Qiao M, Yu Y, Xie W, Huang Y. Quantitative Prediction of Protein Content in Corn Kernel Based on Near-Infrared Spectroscopy. Foods 2024; 13:4173. [PMID: 39767115 PMCID: PMC11675611 DOI: 10.3390/foods13244173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 12/19/2024] [Accepted: 12/20/2024] [Indexed: 01/11/2025] Open
Abstract
Rapid and accurate detection of protein content is essential for ensuring the quality of maize. Near-infrared spectroscopy (NIR) technology faces limitations due to surface effects and sample homogeneity issues when measuring the protein content of whole maize grains. Focusing on maize grain powder can significantly improve the quality of data and the accuracy of model predictions. This study aims to explore a rapid detection method for protein content in maize grain powder based on near-infrared spectroscopy. A method for determining protein content in maize grain powder was established using near-infrared (NIR) reflectance spectra in the 940-1660 nm range. Various preprocessing techniques, including Savitzky-Golay (S-G), multiplicative scatter correction (MSC), standard normal variate (SNV), and the first derivative (1D), were employed to preprocess the raw spectral data. Near-infrared spectral data from different varieties of maize grain powder were collected, and quantitative analysis of protein content was conducted using Partial Least Squares Regression (PLSR), Support Vector Machine (SVM), and Extreme Learning Machine (ELM) models. Feature wavelengths were selected to enhance model accuracy further using the Successive Projections Algorithm (SPA) and Uninformative Variable Elimination (UVE). Experimental results indicated that the PLSR model, preprocessed with 1D + MSC, yielded the best performance, achieving a root mean square error of prediction (RMSEP) of 0.3 g/kg, a correlation coefficient (Rp) of 0.93, and a residual predictive deviation (RPD) of 3. The associated methods and theoretical foundation provide a scientific basis for the quality control and processing of maize.
Collapse
Affiliation(s)
- Chenlong Fan
- College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China; (C.F.); (Y.L.); (W.X.); (Y.H.)
| | - Ying Liu
- College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China; (C.F.); (Y.L.); (W.X.); (Y.H.)
| | - Tao Cui
- College of Engineering, China Agricultural University, Beijing 100083, China;
| | - Mengmeng Qiao
- College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China; (C.F.); (Y.L.); (W.X.); (Y.H.)
| | - Yang Yu
- Key Laboratory for Theory and Technology of Intelligent Agricultural Machinery and Equipment, Jiangsu University, Zhenjiang 212013, China;
| | - Weijun Xie
- College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China; (C.F.); (Y.L.); (W.X.); (Y.H.)
| | - Yuping Huang
- College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China; (C.F.); (Y.L.); (W.X.); (Y.H.)
| |
Collapse
|
30
|
Li ZK, Li HL, Gong XW, Wang HF, Hao GY. Prediction and mapping of leaf water content in Populus alba var. pyramidalis using hyperspectral imagery. PLANT METHODS 2024; 20:184. [PMID: 39695861 DOI: 10.1186/s13007-024-01312-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Accepted: 12/03/2024] [Indexed: 12/20/2024]
Abstract
Leaf water content (LWC) encapsulates critical aspects of tree physiology and is considered a proxy for assessing tree drought stress and the risk of forest decline; however, its measurement relies on destructive sampling and is thus less efficient. Advancements in hyperspectral imaging technology present new prospects for noninvasively evaluating LWC and mapping drought severity across forested regions. In this study, leaf samples were obtained from Populus alba var. pyramidalis, a species widely employed for constructing farmland shelterbelts in water-limited regions of northern China but notably susceptible to drought. These samples were dehydrated to varying degrees to generate concurrent LWC measurements and hyperspectral images, enabling the development of narrow-band and multivariate spectral prediction models for LWC estimation. Two visible-spectrum narrow-band indices identified, the single-band index (R627) and the band subtraction index (R437 - R444), demonstrated a strong correlation with LWC. Despite certain influences of variable preprocessing and selection on multivariate model performance, most models exhibited robust predictive accuracy for LWC. The FDRL-UVE-PLSR combination emerged as the optimal multivariate model, with R2 values reaching 0.9925 and 0.9853 and RMSE values below 0.0124 and 0.0264 for the calibration and validation datasets, respectively. Using this optimal model, along with localized spectral smoothing, moisture distribution across leaf surfaces was visualized, revealing lower water retention at the leaf margins compared to central regions. These methodologies provide critical insights into subtle water-associated physiological processes at the leaf scale and facilitate high-frequency, large-scale assessments and monitoring of drought stress levels and the risk of drought-induced tree mortality and forest degradation in drylands.
Collapse
Affiliation(s)
- Zhao-Kui Li
- School of Computer Science, Shenyang Aerospace University, Shenyang, 110136, China
| | - Hong-Li Li
- School of Computer Science, Shenyang Aerospace University, Shenyang, 110136, China
| | - Xue-Wei Gong
- CAS Key Laboratory of Forest Ecology and Silviculture, Institute of Applied Ecology, Chinese Academy of Sciences, Shenyang, 110016, China.
| | - Heng-Fang Wang
- Key Laboratory of Oasis Ecology of Education Ministry, College of Ecology and Environment, Xinjiang University, Urumqi, 830017, China
| | - Guang-You Hao
- CAS Key Laboratory of Forest Ecology and Silviculture, Institute of Applied Ecology, Chinese Academy of Sciences, Shenyang, 110016, China.
| |
Collapse
|
31
|
Wang Q, Zou X, Chen Y, Zhu Z, Yan C, Shan P, Wang S, Fu Y. XGBoost algorithm assisted multi-component quantitative analysis with Raman spectroscopy. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 323:124917. [PMID: 39094267 DOI: 10.1016/j.saa.2024.124917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 07/22/2024] [Accepted: 07/29/2024] [Indexed: 08/04/2024]
Abstract
To improve prediction performance and reduce artifacts in Raman spectra, we developed an eXtreme Gradient Boosting (XGBoost) preprocessing method to preprocess the Raman spectra of glucose, glycerol and ethanol mixtures. To ensure the robustness and reliability of the XGBoost preprocessing method, an X-LR model (which combined XGBoost preprocessing and a linear regression (LR) model) and a X-MLP model (which combined XGBoost preprocessing and a multilayer perceptron (MLP) model) were developed. These two models were used to quantitatively analyze the concentrations of glucose, glycerol and ethanol in the Raman spectra of mixed solutions. The proportion map of hyperparameters was firstly used to narrow down the search range of hyperparameters in the X-LR and the X-MLP models. Then the correlation coefficients (R2), root mean square of calibration (RMSEC), and root mean square error of prediction (RMSEP) were used to evaluate the models' performance. Experimental results indicated that the XGBoost preprocessing method achieved higher accuracy and generalization capability, with better performance than those of other preprocessing methods for both LR and MLP models.
Collapse
Affiliation(s)
- Qiaoyun Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China; Hebei Key Laboratory of Micro-Nano Precision Optical Sensing and Measurement Technology, Qinhuangdao 066004, China.
| | - Xin Zou
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Yinji Chen
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Ziheng Zhu
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Chongyue Yan
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Peng Shan
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Shuyu Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning Province 110819, China
| | - Yongqing Fu
- Faculty of Engineering & Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, UK
| |
Collapse
|
32
|
Zhang Z, Cheng H, Chen M, Zhang L, Cheng Y, Geng W, Guan J. Detection of Pear Quality Using Hyperspectral Imaging Technology and Machine Learning Analysis. Foods 2024; 13:3956. [PMID: 39683028 DOI: 10.3390/foods13233956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2024] [Revised: 12/04/2024] [Accepted: 12/06/2024] [Indexed: 12/18/2024] Open
Abstract
The non-destructive detection of fruit quality is indispensable in the agricultural and food industries. This study aimed to explore the application of hyperspectral imaging (HSI) technology, combined with machine learning, for a quality assessment of pears, so as to provide an efficient technical method. Six varieties of pears were used for inspection, including 'Sucui No.1', 'Zaojinxiang', 'Huangguan', 'Akizuki', 'Yali', and 'Hongli No.1'. Spectral data within the 398~1004 nm wavelength range were analyzed to compare the predictive performance of the Least Squares Support Vector Machine (LS-SVM) models on various quality parameters, using different preprocessing methods and the selected feature wavelengths. The results indicated that the combination of Fast Detrend-Standard Normal Variate (FD-SNV) preprocessing and Competitive Adaptive Reweighted Sampling (CARS)-selected feature wavelengths yielded the best improvement in model predictive ability for forecasting key quality parameters such as firmness, soluble solids content (SSC), pH, color, and maturity degree. They could enhance the predictive capability and reduce computational complexity. Furthermore, in order to construct a quality prediction model, integrating hyperspectral data from six pear varieties resulted in an RPD (Ratio of Performance to Deviation) exceeding 2.0 for all the quality parameters, indicating that increasing the fruit sample size and variety number further strengthened the robustness of the model. The Backpropagation Neural Network (BPNN) model could accurately distinguish six distinct pear varieties, achieving prediction accuracies of above 99% for both the calibration and test sets. In summary, the combination of HSI and machine learning models enabled an efficient, rapid, and non-destructive detection of pear quality and provided a practical value for quality control and the commercial processing of pears.
Collapse
Affiliation(s)
- Zishen Zhang
- College of Horticulture, Xinjiang Agricultural University, Urumqi 830052, China
- Institute of Biotechnology and Food Science, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang 050051, China
- Hebei Key Laboratory of Plant Genetic Engineering, Shijiazhuang 050051, China
| | - Hong Cheng
- Institute of Biotechnology and Food Science, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang 050051, China
- Hebei Key Laboratory of Plant Genetic Engineering, Shijiazhuang 050051, China
| | - Meiyu Chen
- Institute of Biotechnology and Food Science, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang 050051, China
- Hebei Key Laboratory of Plant Genetic Engineering, Shijiazhuang 050051, China
- College of Life Science and Food Engineering, Hebei University of Engineering, Handan 056000, China
| | - Lixin Zhang
- Institute of Biotechnology and Food Science, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang 050051, China
- Hebei Key Laboratory of Plant Genetic Engineering, Shijiazhuang 050051, China
| | - Yudou Cheng
- Institute of Biotechnology and Food Science, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang 050051, China
- Hebei Key Laboratory of Plant Genetic Engineering, Shijiazhuang 050051, China
| | - Wenjuan Geng
- College of Horticulture, Xinjiang Agricultural University, Urumqi 830052, China
| | - Junfeng Guan
- Institute of Biotechnology and Food Science, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang 050051, China
- Hebei Key Laboratory of Plant Genetic Engineering, Shijiazhuang 050051, China
| |
Collapse
|
33
|
León-Ecay S, López-Campos Ó, López-Maestresalas A, Insausti K, Schmidt B, Prieto N. Using portable visible and near-infrared spectroscopy to authenticate beef from grass, barley, and corn-fed cattle. Food Res Int 2024; 198:115327. [PMID: 39643339 DOI: 10.1016/j.foodres.2024.115327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Revised: 10/17/2024] [Accepted: 11/07/2024] [Indexed: 12/09/2024]
Abstract
Meat product labels including information on livestock production systems are increasingly demanded, as consumers request total traceability of the products. The aim of this study was to explore the potential of visible and near-infrared spectroscopy (Vis-NIRS) to authenticate meat and fat from steers raised under different feeding systems (barley, corn, grass-fed). In total, spectra from 45 steers were collected (380-2,500 nm) on the subcutaneous fat and intact longissimus thoracis (LT) at 72 h postmortem and, after fabrication, on the frozen-thawed ground longissimus lumborum (LL). In subcutaneous fat samples, excellent results were obtained using partial least squares-discriminant analysis (PLS-DA) with the 100 % of the samples in external Test correctly classified (Vis, NIR or Vis-NIR regions); whereas linear-support vector machine (L-SVM) discriminated 75-100 % in Test (Vis-NIR range). In intact meat samples, PLS-DA segregated 100 % of the samples in Test (Vis-NIR region). A slightly lower percentage of meat samples were correctly classified by L-SVM using the NIR region (75-100 % in Train and Test). For ground meat, 100 % of correctly classified samples in Test was achieved using Vis, NIR or Vis-NIR spectral regions with PLS-DA and the Vis with L-SVM. Variable importance in projection (VIP) reported the influence of fat and meat pigments as well as fat, fatty acids, protein, and moisture absorption for the discriminant analyses. From the results obtained with the animals and diets used in this study, NIRS technology stands out as a reliable and green analytical tool to authenticate fat and meat from different livestock production systems.
Collapse
Affiliation(s)
- Sara León-Ecay
- Agriculture and Agri-Food Canada, Lacombe Research and Development Centre, Lacombe, Alberta T4L 1W1, Canada; Institute on Innovation and Sustainable Development in Food Chain (IS-FOOD), Universidad Pública de Navarra (UPNA), Campus de Arrosadia, Pamplona 31006, Spain
| | - Óscar López-Campos
- Agriculture and Agri-Food Canada, Lacombe Research and Development Centre, Lacombe, Alberta T4L 1W1, Canada
| | - Ainara López-Maestresalas
- Institute on Innovation and Sustainable Development in Food Chain (IS-FOOD), Universidad Pública de Navarra (UPNA), Campus de Arrosadia, Pamplona 31006, Spain
| | - Kizkitza Insausti
- Institute on Innovation and Sustainable Development in Food Chain (IS-FOOD), Universidad Pública de Navarra (UPNA), Campus de Arrosadia, Pamplona 31006, Spain
| | - Bryden Schmidt
- Agriculture and Agri-Food Canada, Lacombe Research and Development Centre, Lacombe, Alberta T4L 1W1, Canada
| | - Nuria Prieto
- Agriculture and Agri-Food Canada, Lacombe Research and Development Centre, Lacombe, Alberta T4L 1W1, Canada.
| |
Collapse
|
34
|
Yuan T, Zhao Y, Zhang J, Li S, Hou Y, Yang Y, Wang Y. Characterization of volatile profiles and markers prediction of eleven popular edible boletes using SDE-GC-MS and FT-NIR combined with chemometric analysis. Food Res Int 2024; 196:115077. [PMID: 39614501 DOI: 10.1016/j.foodres.2024.115077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 08/24/2024] [Accepted: 09/10/2024] [Indexed: 12/01/2024]
Abstract
Wild edible boletes mushrooms are regarded as a delicacy in many countries and regions due to their rich nutritional contents and strong aromatic compounds. This study aimed to identify 445 samples of 11 boletes species collected from Yunnan and Sichuan provinces through molecular analysis. Using simultaneous distillation-extraction (SDE) combined with gas chromatography-mass spectrometry (GC-MS), 97 volatile compounds were identified. Chemometric methods were then applied to analyze the heterogeneity of these volatile compounds among the different species. The results showed that, 22 and 21 volatile compounds were selected using variable importance in projection (VIP > 1) and relative odor activity values (ROAV > 0.1), respectively. Partial least squares discrimnatint analysis (PLS-DA) was then employed to develop pattern recognition models for 11 species, which demonstrated strong identification performance. Furthermore, correlation heat maps, volcano plots, and Fisher linear discriminant analysis identified five volatile organic compounds, including methyl (9E)-9-octadecenoate, 2, 6-dimethylpyrazine, 1-decen-3-one, furfural, and methional as markers for distinguishing 11 boletes species. Ultimately, the rapid content prediction models of partial least squares regression (PLSR) were established by combining Fourier Transform Near-Infrared Spectroscopy (FT-NIR) with the concentrations of these five marker compounds. These findings provide a methodological strategy for the effective species identification of wild edible mushrooms and the rapid prediction of their characteristic aroma compounds.
Collapse
Affiliation(s)
- Tianjun Yuan
- School of Science, Mae Fah Luang University, Chiang Rai 57100, Thailand; Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming 650200, China; Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming 650205, China
| | - Yanli Zhao
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming 650200, China
| | - Ji Zhang
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming 650200, China
| | - Shuhong Li
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming 650205, China
| | - Ying Hou
- Southwest Forestry University Materials and Chemical Engineering College, Kunming 650224, China
| | - Yan Yang
- Yunnan Forestry Technological College, Kunming 650224, China
| | - Yuanzhong Wang
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming 650200, China.
| |
Collapse
|
35
|
Jiang H, Wang Z, Deng J, Ding Z, Chen Q. Quantitative detection of heavy metal Cd in vegetable oils: A nondestructive method based on Raman spectroscopy combined with chemometrics. J Food Sci 2024; 89:8054-8065. [PMID: 39366770 DOI: 10.1111/1750-3841.17436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 09/03/2024] [Accepted: 09/16/2024] [Indexed: 10/06/2024]
Abstract
Heavy metal contaminants in vegetable oils can cause irreversible damage to human health. In this study, the quantitative detection of Cd in vegetable oils was investigated based on Raman spectroscopy combined with chemometric methods. The necessary preprocessing of the Raman signal was performed using baseline calibration and the Savitzky-Golay method. Three variable optimization methods were applied to the preprocessed Raman spectra. Namely, bootstrap soft shrinkage, multiple feature spaces ensemble strategy with least absolute shrinkage and selection operator, and competitive adaptive reweighted sampling (CARS), respectively. Partial least squares regression (PLSR) modeling for the determination of Cd in vegetable oils. The results show that three variable optimization algorithms improved the predictive performance of the model. Among them, the CARS-PLSR model has strong generalization performance and robustness. Its prediction coefficient of determination (R P 2 $R_{\mathrm{P}}^2$ ) was 0.9995, the root mean square error of prediction was 0.3533 mg/kg, and the relative prediction deviation was 44.3748, respectively. In summary, rapid quantitative analysis of Cd contamination in vegetable oils can be realized based on Raman spectroscopy combined with chemometrics.
Collapse
Affiliation(s)
- Hui Jiang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, P. R. China
| | - Ziyu Wang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, P. R. China
| | - Jihong Deng
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, P. R. China
| | - Zhidong Ding
- Product Quality Supervision and Inspection Center of Zhenjiang City, Zhenjiang, P. R. China
| | - Quansheng Chen
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, P. R. China
| |
Collapse
|
36
|
Kim SS, Yun DY, Lee G, Park SK, Lim JH, Choi JH, Park KJ, Cho JS. Prediction and Visualization of Total Volatile Basic Nitrogen in Yellow Croaker ( Larimichthys polyactis) Using Shortwave Infrared Hyperspectral Imaging. Foods 2024; 13:3228. [PMID: 39456290 PMCID: PMC11507500 DOI: 10.3390/foods13203228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 10/04/2024] [Accepted: 10/08/2024] [Indexed: 10/28/2024] Open
Abstract
In the present investigation, we have devised a hyperspectral imaging (HSI) apparatus to assess the chemical characteristics and freshness of the yellow croaker (Larimichthys polyactis) throughout its storage period. This system operates within the shortwave infrared spectrum, specifically ranging from 900 to 1700 nm. A variety of spectral pre-processing techniques, including standard normal variate (SNV), multiple scatter correction, and Savitzky-Golay (SG) derivatives, were employed to augment the predictive accuracy of total volatile basic nitrogen (TVB-N)-which serves as a critical freshness parameter. Among the assessed methodologies, SG-1 pre-processing demonstrated superior predictive accuracy (Rp2 = 0.8166). Furthermore, this investigation visualized freshness indicators as concentration images to elucidate the spatial distribution of TVB-N across the samples. These results indicate that HSI, in conjunction with chemometric analysis, constitutes an efficacious instrument for the surveillance of quality and safety in yellow croakers during its storage phase. Moreover, this methodology guarantees the freshness and safety of seafood products within the aquatic food sector.
Collapse
Affiliation(s)
- Sang Seop Kim
- Food Safety and Distribution Research Group, Korea Food Research Institute, Wanju 55365, Republic of Korea; (S.S.K.)
| | - Dae-Yong Yun
- Food Safety and Distribution Research Group, Korea Food Research Institute, Wanju 55365, Republic of Korea; (S.S.K.)
| | - Gyuseok Lee
- Smart Food Manufacturing Project Group, Korea Food Research Institute, Wanju 55365, Republic of Korea
| | - Seul-Ki Park
- Smart Food Manufacturing Project Group, Korea Food Research Institute, Wanju 55365, Republic of Korea
| | - Jeong-Ho Lim
- Food Safety and Distribution Research Group, Korea Food Research Institute, Wanju 55365, Republic of Korea; (S.S.K.)
- Smart Food Manufacturing Project Group, Korea Food Research Institute, Wanju 55365, Republic of Korea
| | - Jeong-Hee Choi
- Food Safety and Distribution Research Group, Korea Food Research Institute, Wanju 55365, Republic of Korea; (S.S.K.)
- Smart Food Manufacturing Project Group, Korea Food Research Institute, Wanju 55365, Republic of Korea
| | - Kee-Jai Park
- Food Safety and Distribution Research Group, Korea Food Research Institute, Wanju 55365, Republic of Korea; (S.S.K.)
- Smart Food Manufacturing Project Group, Korea Food Research Institute, Wanju 55365, Republic of Korea
| | - Jeong-Seok Cho
- Food Safety and Distribution Research Group, Korea Food Research Institute, Wanju 55365, Republic of Korea; (S.S.K.)
- Smart Food Manufacturing Project Group, Korea Food Research Institute, Wanju 55365, Republic of Korea
| |
Collapse
|
37
|
Li C, Fan Y, Wang D, Chu C, Shen X, Wang H, Luo X, Nan L, Ren X, Chen S, Yan Q, Ni J, Li J, Ma Y, Zhang S. The Genetic Characteristics of FT-MIRS-Predicted Milk Fatty Acids in Chinese Holstein Cows. Animals (Basel) 2024; 14:2901. [PMID: 39409850 PMCID: PMC11476120 DOI: 10.3390/ani14192901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 10/03/2024] [Accepted: 10/04/2024] [Indexed: 10/20/2024] Open
Abstract
Fourier Transform Mid-Infrared Spectroscopy (FT-MIRS) can be used for quantitative detection of milk components. Here, milk samples of 458 Chinese Holstein cows from 11 provinces in China were collected and we established a total of 22 quantitative prediction models in milk fatty acids by FT-MIRS. The coefficient of determination of the validation set ranged from 0.59 (C18:0) to 0.76 (C4:0). The models were adopted to predict the milk fatty acids from 2138 cows and a new high-throughput computing software HiBLUP was employed to construct a multi-trait model to estimate and analyze genetic parameters in dairy cows. Finally, genome-wide association analysis was performed and seven novel SNPs significantly associated with fatty acid content were selected, investigated, and verified with the FarmCPU method, which stands for "Fixed and random model Circulating Probability Unification". The findings of this study lay a foundation and offer technical support for the study of fatty acid trait breeding and the screening and grouping of characteristic dairy cows in China with rich, high-quality fatty acids. It is hoped that in the future, the method established in this study will be able to screen milk sources rich in high-quality fatty acids.
Collapse
Affiliation(s)
- Chunfang Li
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
- Hebei Livestock Breeding Station, Shijiazhuang 050060, China; (J.N.); (J.L.)
| | - Yikai Fan
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Dongwei Wang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Chu Chu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiong Shen
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Haitong Wang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Xuelu Luo
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Liangkang Nan
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiaoli Ren
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| | - Shaohu Chen
- Dairy Association of China, Beijing 100192, China; (S.C.); (Q.Y.)
| | - Qingxia Yan
- Dairy Association of China, Beijing 100192, China; (S.C.); (Q.Y.)
| | - Junqing Ni
- Hebei Livestock Breeding Station, Shijiazhuang 050060, China; (J.N.); (J.L.)
| | - Jianming Li
- Hebei Livestock Breeding Station, Shijiazhuang 050060, China; (J.N.); (J.L.)
| | - Yabin Ma
- Hebei Livestock Breeding Station, Shijiazhuang 050060, China; (J.N.); (J.L.)
| | - Shujun Zhang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China; (C.L.); (Y.F.); (D.W.); (C.C.); (X.S.); (H.W.); (X.L.); (L.N.); (X.R.)
- Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
38
|
Chen L, Hu J, Wang H, He Y, Deng Q, Wu F. Predicting Cd(II) adsorption capacity of biochar materials using typical machine learning models for effective remediation of aquatic environments. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 944:173955. [PMID: 38879031 DOI: 10.1016/j.scitotenv.2024.173955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 05/12/2024] [Accepted: 06/10/2024] [Indexed: 06/18/2024]
Abstract
The screening and design of "green" biochar materials with high adsorption capacity play a pivotal role in promoting the sustainable treatment of Cd(II)-containing wastewater. In this study, six typical machine learning (ML) models, namely Linear Regression, Random Forest, Gradient Boosting Decision Tree, CatBoost, K-Nearest Neighbors, and Backpropagation Neural Network, were employed to accurately predict the adsorption capacity of Cd(II) onto biochars. A large dataset with 1051 data points was generated using 21 input variables obtained from batch adsorption experiments, including preparation conditions for biochar (2 features), physical properties of biochar (4 features), chemical composition of biochar (9 features), and adsorption experiment conditions (6 features). The rigorous evaluation and comparison of the ML models revealed that the CatBoost model exhibited the highest test R2 value (0.971) and the lowest RMSE (20.54 mg/g), significantly outperforming all other models. The feature importance analysis using Shapley Additive Explanations (SHAP) indicated that biochar chemical compositions had the greatest impact on model predictions of adsorption capacity (42.2 %), followed by adsorption conditions (37.57 %), biochar physical characteristics (12.38 %), and preparation conditions (7.85 %). The optimal experimental conditions optimized by partial dependence plots (PDP) are as follows: as high Cd(II) concentration as possible, C(%) of 33 %, N(%) of 0.3 %, adsorption time of 600 min, pyrolysis time of 50 min, biochar dosage of less than 2 g/L, O(%) of 42 %, biochar pH value of 11.2, and DBE of 1.15. This study unveils novel insights into the adsorption of Cd(II) and provides a comprehensive reference for the sustainable engineering of biochars in Cd(II) wastewater treatment.
Collapse
Affiliation(s)
- Long Chen
- School of Chemistry and Materials Science, Hunan Engineering Research Center for Biochar, Hunan Agricultural University, Changsha, Hunan 410128, China; State Key Laboratory of Pollution Control and Resource Reuse, College of Environmental Science and Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, China
| | - Jian Hu
- School of Chemistry and Materials Science, Hunan Engineering Research Center for Biochar, Hunan Agricultural University, Changsha, Hunan 410128, China
| | - Hong Wang
- State Key Laboratory of Pollution Control and Resource Reuse, College of Environmental Science and Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, China
| | - Yanying He
- School of Chemistry and Materials Science, Hunan Engineering Research Center for Biochar, Hunan Agricultural University, Changsha, Hunan 410128, China
| | - Qianyi Deng
- School of Chemistry and Materials Science, Hunan Engineering Research Center for Biochar, Hunan Agricultural University, Changsha, Hunan 410128, China
| | - Fangfang Wu
- School of Chemistry and Materials Science, Hunan Engineering Research Center for Biochar, Hunan Agricultural University, Changsha, Hunan 410128, China.
| |
Collapse
|
39
|
Diaz-Olivares JA, Bendoula R, Saeys W, Ryckewaert M, Adriaens I, Fu X, Pastell M, Roger JM, Aernouts B. PROSAC as a selection tool for SO-PLS regression: A strategy for multi-block data fusion. Anal Chim Acta 2024; 1319:342965. [PMID: 39122277 DOI: 10.1016/j.aca.2024.342965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 06/08/2024] [Accepted: 07/09/2024] [Indexed: 08/12/2024]
Abstract
BACKGROUND Spectral data from multiple sources can be integrated into multi-block fusion chemometric models, such as sequentially orthogonalized partial-least squares (SO-PLS), to improve the prediction of sample quality features. Pre-processing techniques are often applied to mitigate extraneous variability, unrelated to the response variables. However, the selection of suitable pre-processing methods and identification of informative data blocks becomes increasingly complex and time-consuming when dealing with a large number of blocks. The problem addressed in this work is the efficient pre-processing, selection, and ordering of data blocks for targeted applications in SO-PLS. RESULTS We introduce the PROSAC-SO-PLS methodology, which employs pre-processing ensembles with response-oriented sequential alternation calibration (PROSAC). This approach identifies the best pre-processed data blocks and their sequential order for specific SO-PLS applications. The method uses a stepwise forward selection strategy, facilitated by the rapid Gram-Schmidt process, to prioritize blocks based on their effectiveness in minimizing prediction error, as indicated by the lowest prediction residuals. To validate the efficacy of our approach, we showcase the outcomes of three empirical near-infrared (NIR) datasets. Comparative analyses were performed against partial-least-squares (PLS) regressions on single-block pre-processed datasets and a methodology relying solely on PROSAC. The PROSAC-SO-PLS approach consistently outperformed these methods, yielding significantly lower prediction errors. This has been evidenced by a reduction in the root-mean-squared error of prediction (RMSEP) ranging from 5 to 25 % across seven out of the eight response variables analyzed. SIGNIFICANCE The PROSAC-SO-PLS methodology offers a versatile and efficient technique for ensemble pre-processing in NIR data modeling. It enables the use of SO-PLS minimizing concerns about pre-processing sequence or block order and effectively manages a large number of data blocks. This innovation significantly streamlines the data pre-processing and model-building processes, enhancing the accuracy and efficiency of chemometric models.
Collapse
Affiliation(s)
- Jose A Diaz-Olivares
- KU Leuven, Department of Biosystems, Division of Animal and Human Health Engineering, Campus Geel, Kleinhoefstraat 4, 2440, Geel, Belgium.
| | - Ryad Bendoula
- ITAP, Univ. Montpellier, INRAE, Institute Agro, Montpellier, France
| | - Wouter Saeys
- KU Leuven, Department of Biosystems, MeBioS unit, Kasteelpark Arenberg 30, 3001, Leuven, Belgium
| | | | - Ines Adriaens
- KU Leuven, Department of Biosystems, Division of Animal and Human Health Engineering, Campus Geel, Kleinhoefstraat 4, 2440, Geel, Belgium; Department of Data Analysis and Mathematical Modelling, Division BioVism, Campus Coupure, Coupure Links 653, 9000, Ghent, Belgium
| | - Xinyue Fu
- KU Leuven, Department of Biosystems, Division of Animal and Human Health Engineering, Campus Geel, Kleinhoefstraat 4, 2440, Geel, Belgium
| | - Matti Pastell
- Production Systems, Natural Resources Institute Finland (Luke), Latokartanonkaari 9, 00790, Helsinki, Finland
| | - Jean-Michel Roger
- ITAP, Univ. Montpellier, INRAE, Institute Agro, Montpellier, France; ChemHouse Research Group, Montpellier, France
| | - Ben Aernouts
- KU Leuven, Department of Biosystems, Division of Animal and Human Health Engineering, Campus Geel, Kleinhoefstraat 4, 2440, Geel, Belgium.
| |
Collapse
|
40
|
Wang K. Optimized ensemble deep learning for predictive analysis of student achievement. PLoS One 2024; 19:e0309141. [PMID: 39186491 PMCID: PMC11346664 DOI: 10.1371/journal.pone.0309141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 08/07/2024] [Indexed: 08/28/2024] Open
Abstract
Education is essential for individuals to lead fulfilling lives and attain greatness by enhancing their value. It improves self-assurance and enables individuals to navigate the complexities of modern society effectively. Despite the obstacles it faces, education continues to develop. The objective of numerous pedagogical approaches is to enhance academic performance. The development of technology, especially artificial intelligence, has caused a significant change in learning. This has made instructional materials available anytime and wherever easily accessible. Higher education institutions are adding technology to conventional teaching strategies to improve learning. This work presents an innovative approach to student performance prediction in educational settings. The strategy combines the DistilBERT with LSTM (DBTM) hybrid approach with the Spotted Hyena Optimizer (SHO) to change parameters. Regarding accuracy, log loss, and execution time, the model significantly improved over earlier models. The challenges presented by the increasing volume of data in graduate and postgraduate programs are effectively addressed by the proposed method. It produces exceptional performance metrics, including a 15-25% decrease in processing time through optimization, 98.7% accuracy, and 0.03% log loss. This work additionally demonstrates the effectiveness of DBTM-SHO in administering extensive datasets and makes an important improvement to educational data mining. It provides a robust foundation for organizations facing the challenges of evaluating student achievement in the era of vast data.
Collapse
Affiliation(s)
- Kaitong Wang
- Student Affairs Department, Institute of Science and Technology, Luoyang, Henan Province, China
- Department of Education, Keimyung University, Daegu, Korea
| |
Collapse
|
41
|
Huang H, Fang Z, Xu Y, Lu G, Feng C, Zeng M, Tian J, Ping Y, Han Z, Zhao Z. Stacking and ridge regression-based spectral ensemble preprocessing method and its application in near-infrared spectral analysis. Talanta 2024; 276:126242. [PMID: 38761656 DOI: 10.1016/j.talanta.2024.126242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 05/08/2024] [Accepted: 05/09/2024] [Indexed: 05/20/2024]
Abstract
Spectral preprocessing techniques can, to a certain extent, eliminate irrelevant information, such as current noise and stray light from spectral data, thereby enhancing the performance of prediction models. However, current preprocessing techniques mostly attempt to find the best single preprocessing method or their combination, overlooking the complementary information among different preprocessing methods. These preprocessing techniques fail to maximize the utilization of useful information in spectral data and restrict the performance of prediction models. This study proposed a spectral ensemble preprocessing method based on the rapidly developing ensemble learning methods in recent years and the ridge regression (RR) model, named stacking preprocessing ridge regression (SPRR), to address the aforementioned issues. Different from conventional ensemble learning methods, the proposed SPRR method applied multiple different preprocessing techniques to the original spectral data, generating multiple preprocessed datasets. These datasets were then individually inputted into RR base models for training. Ultimately, RR still served as the meta-model, integrating the output results of each RR base model through stacking. This approach not only produced diversity in base models but also achieved higher accuracy and lower computational complexity by using a single type of base model. On the apple spectral dataset collected by our team, correlation analysis showed significant complementary information among the data produced by different preprocessing techniques. This provided robust theoretical support for the proposed SPRR method. By introducing the currently popular averaging ensemble preprocessing method in a comparative experiment, the results of applying the proposed SPRR method to six datasets (apple, meat, wheat, olive oil, tablet, and corn) demonstrated that compared to the single preprocessing method and averaging ensemble preprocessing method, SPRR yielded the best accuracy and reliability for all six datasets. Furthermore, under the same conditions of the training and test datasets, the proposed SPRR method demonstrated better performance than the four commonly used ensemble preprocessing methods.
Collapse
Affiliation(s)
- Haowen Huang
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Zile Fang
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Yuelong Xu
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Guosheng Lu
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Can Feng
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Min Zeng
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Jiaju Tian
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Yongfu Ping
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Zhuolin Han
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Zhigang Zhao
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China.
| |
Collapse
|
42
|
Luo B, Sun H, Zhang L, Chen F, Wu K. Advances in the tea plants phenotyping using hyperspectral imaging technology. FRONTIERS IN PLANT SCIENCE 2024; 15:1442225. [PMID: 39148615 PMCID: PMC11324491 DOI: 10.3389/fpls.2024.1442225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2024] [Accepted: 07/15/2024] [Indexed: 08/17/2024]
Abstract
Rapid detection of plant phenotypic traits is crucial for plant breeding and cultivation. Traditional measurement methods are carried out by rich-experienced agronomists, which are time-consuming and labor-intensive. However, with the increasing demand for rapid and high-throughput testing in tea plants traits, digital breeding and smart cultivation of tea plants rely heavily on precise plant phenotypic trait measurement techniques, among which hyperspectral imaging (HSI) technology stands out for its ability to provide real-time and rich-information. In this paper, we provide a comprehensive overview of the principles of hyperspectral imaging technology, the processing methods of cubic data, and relevant algorithms in tea plant phenomics, reviewing the progress of applying hyperspectral imaging technology to obtain information on tea plant phenotypes, growth conditions, and quality indicators under environmental stress. Lastly, we discuss the challenges faced by HSI technology in the detection of tea plant phenotypic traits from different perspectives, propose possible solutions, and envision the potential development prospects of HSI technology in the digital breeding and smart cultivation of tea plants. This review aims to provide theoretical and technical support for the application of HSI technology in detecting tea plant phenotypic information, further promoting the trend of developing high quality and high yield tea leaves.
Collapse
Affiliation(s)
- Baidong Luo
- College of Artificial Intelligence, Hangzhou Dianzi University, Hangzhou, China
| | - Hongwei Sun
- College of Artificial Intelligence, Hangzhou Dianzi University, Hangzhou, China
| | - Leilei Zhang
- Key Laboratory of Specialty Agri-Products Quality and Hazard Controlling Technology of Zhejiang Province, College of Life Sciences, China Jiliang University, Hangzhou, China
| | - Fengnong Chen
- College of Artificial Intelligence, Hangzhou Dianzi University, Hangzhou, China
| | - Kaihua Wu
- College of Artificial Intelligence, Hangzhou Dianzi University, Hangzhou, China
| |
Collapse
|
43
|
Zhu Y, Chen L, Chen X, Chen J, Zhang H. Near-infrared spectroscopy identification method of cashmere and wool fibers based on an optimized wavelength selection algorithm. Heliyon 2024; 10:e34537. [PMID: 39149029 PMCID: PMC11324818 DOI: 10.1016/j.heliyon.2024.e34537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 07/11/2024] [Indexed: 08/17/2024] Open
Abstract
Cashmere and wool fibers have similar chemical compositions, making them difficult to distinguish based on their absorption peaks and band positions in near-infrared spectroscopy. Existing studies commonly use wavelength selection or feature extraction algorithms to obtain significant spectral features, but traditional algorithms often overlook the correlations between wavelengths, resulting in weak adaptability and local optimum issues. To address this problem, this paper proposes a recognition algorithm based on optimal wavelength selection, which can remove redundant information and make the model effective in capturing patterns and key features of the data. The wavelengths are rearranged by computing the information gain ratio for each wavelength. Then, the sorted wavelengths are grouped based on equal density, which ensures that all wavelengths within each group have equal information and avoids over-focusing on individual groups. Meanwhile, the group genetic algorithm is used to find the wavelengths with highly informative and search optimal grouped combinations, in order to explore the entire spectrum wavelength. Finally, combined with a partial least squares discriminant analysis(PLS-DA) model, the recognition accuracy reached 97.3 %. The results indicate that, compared to traditional methods such as CARS, SPA, and GA, our method effectively reduces redundant information, selects fewer but more informative wavelengths, and improves classification accuracy and model adaptability.
Collapse
Affiliation(s)
- Yaolin Zhu
- School of Electronics and Information, Xi'an Polytechnic University, Xi'an, 710000, China
| | - Long Chen
- School of Electronics and Information, Xi'an Polytechnic University, Xi'an, 710000, China
| | - Xin Chen
- School of Electronics and Information, Xi'an Polytechnic University, Xi'an, 710000, China
| | - Jinni Chen
- School of Electronics and Information, Xi'an Polytechnic University, Xi'an, 710000, China
| | - Hongsong Zhang
- Shanghai Ranzi Industrial Co., Ltd, Shanghai, 201800, China
| |
Collapse
|
44
|
Ozen B, Cavdaroglu C, Tokatli F. Trends in authentication of edible oils using vibrational spectroscopic techniques. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2024; 16:4216-4233. [PMID: 38899503 DOI: 10.1039/d4ay00562g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
The authentication of edible oils has become increasingly important for ensuring product quality, safety, and compliance with regulatory standards. Some prevalent authenticity issues found in edible oils include blending expensive oils with cheaper substitutes or lower-grade oils, incorrect labeling regarding the oil's source or type, and falsely stating the oil's origin. Vibrational spectroscopy techniques, such as infrared (IR) and Raman spectroscopy, have emerged as effective tools for rapidly and non-destructively analyzing edible oils. This review paper offers a comprehensive overview of recent advancements in using vibrational spectroscopy for authenticating edible oils. The fundamental principles underlying vibrational spectroscopy are introduced and chemometric approaches that enhance the accuracy and reliability of edible oil authentication are summarized. Recent research trends highlighted in the review include authenticating newly introduced oils, identifying oils based on their specific origins, adopting handheld/portable spectrometers and hyperspectral imaging, and integrating modern data handling techniques into the use of vibrational spectroscopic techniques for edible oil authentication. Overall, this review provides insights into the current state-of-the-art techniques and prospects for utilizing vibrational spectroscopy in the authentication of edible oils, thereby facilitating quality control and consumer protection in the food industry.
Collapse
Affiliation(s)
- Banu Ozen
- Izmir Institute of Technology, Department of Food Engineering, Urla, Izmir, Turkiye.
| | - Cagri Cavdaroglu
- Izmir Institute of Technology, Department of Food Engineering, Urla, Izmir, Turkiye.
| | - Figen Tokatli
- Izmir Institute of Technology, Department of Food Engineering, Urla, Izmir, Turkiye.
| |
Collapse
|
45
|
Yenurkar G, Mal S, Nyangaresi VO, Kamble S, Damahe L, Bankar N. Revolutionizing Chronic Heart Disease Management: The Role of IoT-Based Ambulatory Blood Pressure Monitoring System. Diagnostics (Basel) 2024; 14:1297. [PMID: 38928712 PMCID: PMC11203318 DOI: 10.3390/diagnostics14121297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/11/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024] Open
Abstract
Chronic heart disease (CHD) is a widespread and persistent health challenge that demands immediate attention. Early detection and accurate diagnosis are essential for effective treatment and management of this condition. To overcome this difficulty, we created a state-of-the-art IoT-Based Ambulatory Blood Pressure Monitoring System that provides real-time blood pressure readings, systolic, diastolic, and pulse rates at predefined intervals. This unique technology comes with a module that forecasts CHD's early warning score. Various machine learning algorithms employed comprise Naïve Bayes, K-Nearest Neighbors (K-NN), random forest, decision tree, and Support Vector Machine (SVM). Using Naïve Bayes, the proposed model has achieved an impressive 99.44% accuracy in predicting blood pressure, a vital aspect of real-time intensive care for CHD. This IoT-based ambulatory blood pressure monitoring (IABPM) system will provide some advancement in the field of healthcare. The system overcomes the limitations of earlier BP monitoring devices, significantly reduces healthcare costs, and efficiently detects irregularities in chronic heart diseases. By implementing this system, we can take a significant step forward in improving patient outcomes and reducing the global burden of CHD. The system's advanced features provide an accurate and reliable diagnosis that is essential for treating and managing CHD. Overall, this IoT-based ambulatory blood pressure monitoring system is an important tool for the early identification and treatment of CHD in the field of healthcare.
Collapse
Affiliation(s)
- Ganesh Yenurkar
- Department of Computer Technology, Yeshwantrao Chavan College of Engineering, Wanadongri, Nagpur 441110, Maharashtra, India
| | - Sandip Mal
- School of Computing Science and Engineering, VIT Bhopal University, Bhopal 466114, Madhya Pradesh, India
| | - Vincent O. Nyangaresi
- Department of Computer Science and Engineering, Jaramogi Oginga Odinga University of Science & Technology, Bondo 40601, Kenya
- Department of Applied Electronics, Saveetha School of Engineering, SIMATS, Chennai 602105, Tamilnadu, India
| | - Shailesh Kamble
- Department of Artificial Intelligence and Data Science, Indira Gandhi Delhi Technical University for Women, New Delhi 110006, Delhi, India
| | - Lalit Damahe
- Department of Computer Science and Engineering, Yeshwantrao Chavan College of Engineering, Wanadongri, Nagpur 441110, Maharashtra, India
| | - Nandkishor Bankar
- Department of Microbiology, Jawaharlal Nehru Medical College Sawangi Meghe, Wardha 442005, Maharashtra, India
| |
Collapse
|
46
|
Kiani H, Beheshti B, Borghei AM, Rahmati MH. Determination of heavy metals in edible oils by a novel voltammetry taste sensor array. JOURNAL OF FOOD SCIENCE AND TECHNOLOGY 2024; 61:1126-1137. [PMID: 38562596 PMCID: PMC10981641 DOI: 10.1007/s13197-024-05933-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Revised: 12/17/2023] [Accepted: 01/09/2024] [Indexed: 04/04/2024]
Abstract
Herein, a novel voltammetry taste sensor array (VTSA) using pencil graphite electrode, screen-printed electrode, and glassy carbon electrode was used to identify heavy metals (HM) including Cad, Pb, Sn and Ni in soybean and rapeseed oils. HMs were added to edible oils at three concentrations of 0.05, 0.1 and 0.25 ppm, and then, the output of the device was classified using a chemometric classification method. According to the principal component analysis results, PG electrode explains 96% and 81% of the variance between the data in rapeseed and soybean edible oils, respectively. Additionally, the SP electrode explains 91% of the variance between the data in rapeseed and soybean oils. Moreover, the GC electrode explains 100% and 99% of the variance between the data in rapeseed and soybean edible oils, respectively. K-nearest neighbor exhibited high capability in classifying HMs in edible oils. In addition, partial least squares in the combine of VTSA shows a predict 99% in rapeseed oil. The best electrode for soybean edible oil was GC.
Collapse
Affiliation(s)
- Hasan Kiani
- Department of Biosystem Mechanical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Babak Beheshti
- Department of Biosystem Mechanical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Ali Mohammad Borghei
- Department of Biosystem Mechanical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Mohammad Hashem Rahmati
- Department of Biosystem Mechanical Engineering, Gorgan University of Agricultural Sciences and Natural Resources, Gorgān, Iran
| |
Collapse
|
47
|
Ambre D, Sheyyab M, Lynch P, Mayhew EK, Brezinsky K. A Raman spectroscopy based chemometric approach to predict the derived cetane number of hydrocarbon jet fuels and their mixtures. Talanta 2024; 271:125635. [PMID: 38219321 DOI: 10.1016/j.talanta.2024.125635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 12/29/2023] [Accepted: 01/04/2024] [Indexed: 01/16/2024]
Abstract
Fuel ignition quality, measured in the form of Derived Cetane Number (DCN), is an important part of integrating fuels, including sustainable aviation fuels, in compression ignition engines. DCN has been correlated with simulated and/or real spectroscopic measurements as well as other physical and chemical properties, but rarely have these correlations developed into a pathway to application. One application of the correlations is the use of miniaturized onboard fuel sensors that could assist, by using predicted DCN, in real-time feedforward engine control. To aid in the application of developing such DCN fuel sensors, Raman spectra coupled with chemometrics and a selection of influential spectral features were investigated. In this study, the Raman spectra were obtained from a database that included jet fuels, jet fuel mixtures, pure hydrocarbon components, and their weighted mixtures. The resulting Raman spectral database from the experimental measurements included spectra of components that span a wide range of DCNs and covered all the expected chemical functional groups present in a standard jet fuel. Chemometric models were developed to associate Raman spectra with DCN in subsets of the spectral range to aid in sensor miniaturization. The models were tested on jet fuels such as National Jet Fuel Combustion Program fuels designated A-1, A-2, and A-3 along with mixtures of jet fuels that spanned a wide range of DCN, simulating fuels that could represent real-world scenarios. An Artificial Neural Network (ANN) model trained on the fingerprint region (500 cm-1 - 1800 cm-1) of the Raman spectra was able to capture the non-linearity of the association between the Raman spectra and DCN with a test R2 score of 0.926, a test MSE of 3.61, and a test MPE of 3.41. Around 97 % of the unseen test samples were predicted within 10 % of the DCN measured with an Ignition Quality Tester. One hundred features of the fingerprint region influencing DCN predictions in the optimal ANN model were extracted using a Global Surrogate (GS) model. A reduced ANN model trained on only these one hundred features performed slightly better with a test R2 score of 0.935, test MSE of 3.19, test MPE of 3.20 and with the entire set of unseen test samples predicted within 10 % of the measured DCN. For assessing applicability of real-time and online DCN sensing, the Raman spectrometer was integrated with a flow cell capable of allowing measurements of DCN in flowing fuel samples and included the optimal ANN model of the fingerprint region and the 100-feature GS-ANN model on a Raspberry Pi computer. A number of unseen F-24/alcohol-to-jet fuel mixtures composed of unknown volumes were tested using the flow cell for DCN, and all of these samples were predicted within 10 % of the measured DCN.
Collapse
Affiliation(s)
- Dhananjay Ambre
- Department of Mechanical and Industrial Engineering, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Manaf Sheyyab
- Department of Mechanical and Industrial Engineering, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Patrick Lynch
- Department of Mechanical and Industrial Engineering, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Eric K Mayhew
- US Army Combat Capabilities Development Command, Army Research Laboratory, Aberdeen Proving Ground, MD 21005, USA
| | - Kenneth Brezinsky
- Department of Mechanical and Industrial Engineering, University of Illinois at Chicago, Chicago, IL, 60607, USA.
| |
Collapse
|
48
|
Nian Y, Su X, Yue H, Zhu Y, Li J, Wang W, Sheng Y, Ma Q, Liu J, Li X. Estimation of the rice aboveground biomass based on the first derivative spectrum and Boruta algorithm. FRONTIERS IN PLANT SCIENCE 2024; 15:1396183. [PMID: 38726299 PMCID: PMC11079175 DOI: 10.3389/fpls.2024.1396183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 04/11/2024] [Indexed: 05/12/2024]
Abstract
Aboveground biomass (AGB) is regarded as a critical variable in monitoring crop growth and yield. The use of hyperspectral remote sensing has emerged as a viable method for the rapid and precise monitoring of AGB. Due to the extensive dimensionality and volume of hyperspectral data, it is crucial to effectively reduce data dimensionality and select sensitive spectral features to enhance the accuracy of rice AGB estimation models. At present, derivative transform and feature selection algorithms have become important means to solve this problem. However, few studies have systematically evaluated the impact of derivative spectrum combined with feature selection algorithm on rice AGB estimation. To this end, at the Xiaogang Village (Chuzhou City, China) Experimental Base in 2020, this study used an ASD FieldSpec handheld 2 ground spectrometer (Analytical Spectroscopy Devices, Boulder, Colorado, USA) to obtain canopy spectral data at the critical growth stage (tillering, jointing, booting, heading, and maturity stages) of rice, and evaluated the performance of the recursive feature elimination (RFE) and Boruta feature selection algorithm through partial least squares regression (PLSR), principal component regression (PCR), support vector machine (SVM) and ridge regression (RR). Moreover, we analyzed the importance of the optimal derivative spectrum. The findings indicate that (1) as the growth stage progresses, the correlation between rice canopy spectrum and AGB shows a trend from high to low, among which the first derivative spectrum (FD) has the strongest correlation with AGB. (2) The number of feature bands selected by the Boruta algorithm is 19~35, which has a good dimensionality reduction effect. (3) The combination of FD-Boruta-PCR (FB-PCR) demonstrated the best performance in estimating rice AGB, with an increase in R² of approximately 10% ~ 20% and a decrease in RMSE of approximately 0.08% ~ 14%. (4) The best estimation stage is the booting stage, with R2 values between 0.60 and 0.74 and RMSE values between 1288.23 and 1554.82 kg/hm2. This study confirms the accuracy of hyperspectral remote sensing in estimating vegetation biomass and further explores the theoretical foundation and future direction for monitoring rice growth dynamics.
Collapse
Affiliation(s)
- Ying Nian
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Xiangxiang Su
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Hu Yue
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Yongji Zhu
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Jun Li
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Weiqiang Wang
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Yali Sheng
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Qiang Ma
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
| | - Jikai Liu
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
- Anhui Province Crop Intelligent Planting and Processing Technology Engineering Research Center, Anhui Science and Technology University, Chuzhou, Anhui, China
| | - Xinwei Li
- College of Resource and Environment, Anhui Science and Technology University, Chuzhou, China
- Anhui Province Crop Intelligent Planting and Processing Technology Engineering Research Center, Anhui Science and Technology University, Chuzhou, Anhui, China
- Anhui Province Agricultural Waste Fertilizer Utilization and Cultivated Land Quality Improvement Engineering Research Center, Anhui Science and Technology University, Chuzhou, China
| |
Collapse
|
49
|
Ouyang H, Tang L, Ma J, Pang T. Application of Hyperspectral Technology with Machine Learning for Brix Detection of Pastry Pears. PLANTS (BASEL, SWITZERLAND) 2024; 13:1163. [PMID: 38674571 PMCID: PMC11055027 DOI: 10.3390/plants13081163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/12/2024] [Accepted: 04/19/2024] [Indexed: 04/28/2024]
Abstract
Sugar content is an essential indicator for evaluating crisp pear quality and categorization, being used for fruit quality identification and market sales prediction. In this study, we paired a support vector machine (SVM) algorithm with genetic algorithm optimization to reliably estimate the sugar content in crisp pears. We evaluated the spectral data and actual sugar content in crisp pears, then applied three preprocessing methods to the spectral data: standard normal variable transformation (SNV), multivariate scattering correction (MSC), and convolution smoothing (SG). Support vector regression (SVR) models were built using processing approaches. According to the findings, the SVM model preprocessed with convolution smoothing (SG) was the most accurate, with a correlation coefficient 0.0742 higher than that of the raw spectral data. Based on this finding, we used competitive adaptive reweighting (CARS) and the continuous projection algorithm (SPA) to select key representative wavelengths from the spectral data. Finally, we used the retrieved characteristic wavelength data to create a support vector machine model (GASVR) that was genetically tuned. The correlation coefficient of the SG-GASVR model in the prediction set was higher by 0.0321 and the root mean square prediction error (RMSEP) was lower by 0.0267 compared with those of the SG-SVR model. The SG-CARS-GASVR model had the highest correlation coefficient, at 0.8992. In conclusion, the developed SG-CARS-GASVR model provides a reliable method for detecting the sugar content in crisp pear using hyperspectral technology, thereby increasing the accuracy and efficiency of the quality assessment of crisp pear.
Collapse
Affiliation(s)
- Hongkun Ouyang
- College of Mechanical and Electrical Engineering, Sichuan Agriculture University, Ya’an 625000, China; (L.T.); (J.M.)
| | | | | | - Tao Pang
- College of Mechanical and Electrical Engineering, Sichuan Agriculture University, Ya’an 625000, China; (L.T.); (J.M.)
| |
Collapse
|
50
|
Frempong SB, Salbreiter M, Mostafapour S, Pistiki A, Bocklitz TW, Rösch P, Popp J. Illuminating the Tiny World: A Navigation Guide for Proper Raman Studies on Microorganisms. Molecules 2024; 29:1077. [PMID: 38474589 PMCID: PMC10934050 DOI: 10.3390/molecules29051077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 02/13/2024] [Accepted: 02/18/2024] [Indexed: 03/14/2024] Open
Abstract
Raman spectroscopy is an emerging method for the identification of bacteria. Nevertheless, a lot of different parameters need to be considered to establish a reliable database capable of identifying real-world samples such as medical or environmental probes. In this review, the establishment of such reliable databases with the proper design in microbiological Raman studies is demonstrated, shining a light into all the parts that require attention. Aspects such as the strain selection, sample preparation and isolation requirements, the phenotypic influence, measurement strategies, as well as the statistical approaches for discrimination of bacteria, are presented. Furthermore, the influence of these aspects on spectra quality, result accuracy, and read-out are discussed. The aim of this review is to serve as a guide for the design of microbiological Raman studies that can support the establishment of this method in different fields.
Collapse
Affiliation(s)
- Sandra Baaba Frempong
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
- InfectoGnostics Research Campus Jena, Center of Applied Research, Philosophenweg 7, 07743 Jena, Germany
| | - Markus Salbreiter
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
- InfectoGnostics Research Campus Jena, Center of Applied Research, Philosophenweg 7, 07743 Jena, Germany
| | - Sara Mostafapour
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
| | - Aikaterini Pistiki
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
- InfectoGnostics Research Campus Jena, Center of Applied Research, Philosophenweg 7, 07743 Jena, Germany
- Leibniz-Institute of Photonic Technology, Member of the Leibniz Research Alliance-Leibniz Health Technologies, Albert-Einstein-Str. 9, 07745 Jena, Germany
| | - Thomas W. Bocklitz
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
- Leibniz-Institute of Photonic Technology, Member of the Leibniz Research Alliance-Leibniz Health Technologies, Albert-Einstein-Str. 9, 07745 Jena, Germany
| | - Petra Rösch
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
- InfectoGnostics Research Campus Jena, Center of Applied Research, Philosophenweg 7, 07743 Jena, Germany
| | - Jürgen Popp
- Institute of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University, Helmholtzweg 4, 07743 Jena, Germany; (S.B.F.); (M.S.); (S.M.); (A.P.); (T.W.B.); (J.P.)
- InfectoGnostics Research Campus Jena, Center of Applied Research, Philosophenweg 7, 07743 Jena, Germany
- Leibniz-Institute of Photonic Technology, Member of the Leibniz Research Alliance-Leibniz Health Technologies, Albert-Einstein-Str. 9, 07745 Jena, Germany
- Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743 Jena, Germany
| |
Collapse
|