1
|
Shi S, Tang Z, Ma Y, Cao C, Jiang Y. Application of spectroscopic techniques combined with chemometrics to the authenticity and quality attributes of rice. Crit Rev Food Sci Nutr 2023:1-23. [PMID: 38010116 DOI: 10.1080/10408398.2023.2284246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Rice is a staple food for two-thirds of the world's population and is grown in over a hundred countries around the world. Due to its large scale, it is vulnerable to adulteration. In addition, the quality attribute of rice is an important factor affecting the circulation and price, which is also paid more and more attention. The combination of spectroscopy and chemometrics enables rapid detection of authenticity and quality attributes in rice. This article described the application of seven spectroscopic techniques combined with chemometrics to the rice industry. For a long time, near-infrared spectroscopy and linear chemometric methods (e.g., PLSR and PLS-DA) have been widely used in the rice industry. Although some studies have achieved good accuracy, with models in many studies having greater than 90% accuracy. However, higher accuracy and stability were more likely to be obtained using multiple spectroscopic techniques, nonlinear chemometric methods, and key wavelength selection algorithms. Future research should develop larger rice databases to include more rice varieties and larger amounts of rice depending on the type of rice, and then combine various spectroscopic techniques, nonlinear chemometric methods, and key wavelength selection algorithms. This article provided a reference for a more efficient and accurate determination of rice quality and authenticity.
Collapse
Affiliation(s)
- Shijie Shi
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Zihan Tang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Yingying Ma
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Cougui Cao
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, China
- Shuangshui Shuanglü Institute, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Yang Jiang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, China
- Shuangshui Shuanglü Institute, Huazhong Agricultural University, Wuhan, Hubei, China
| |
Collapse
|
2
|
Yu S, Liu J. Ensemble calibration model of near-infrared spectroscopy based on functional data analysis. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2022; 280:121569. [PMID: 35780759 DOI: 10.1016/j.saa.2022.121569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 05/26/2022] [Accepted: 06/25/2022] [Indexed: 06/15/2023]
Abstract
As a nondestructive detection technology, near-infrared spectroscopy has been widely applied in various fields. With the wide application of near-infrared spectroscopy, the research on data processing has attracted more attention. Different from the existing discrete data model and based on the functional data analysis method, an ensemble calibration model FDA-EM-PLS (functional data analysis-ensemble learning-partial least squares) of near-infrared spectroscopy is proposed in this paper. Firstly, the near-infrared spectroscopy of each sample is divided into several intervals, and the functional data analysis is carried out on each interval. Then, the samples are clustered according to the generated functions, which can not only reduce the influence of noise, but also provide a theoretical basis for selecting variables. Further, Monte Carlo sampling is used to generate training subsets from clustering samples for ensemble learning, which not only solves the problem of small samples, but also improves the robustness of the model. The relevant experimental results show that the absolute relative error of FDA-EM-PLS for the corn and soil data are both less than 10%.
Collapse
Affiliation(s)
- Shaohui Yu
- School of Mathematics and Statistics, Hefei Normal University, Hefei 230061, China
| | - Jing Liu
- Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China.
| |
Collapse
|
3
|
Cruz-Tirado J, Amigo JM, Barbin DF. Determination of protein content in single black fly soldier (Hermetia illucens L.) larvae by near infrared hyperspectral imaging (NIR-HSI) and chemometrics. Food Control 2022. [DOI: 10.1016/j.foodcont.2022.109266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
4
|
Detection of nutshells in cumin powder using NIR hyperspectral imaging and chemometrics tools. J Food Compost Anal 2022. [DOI: 10.1016/j.jfca.2022.104407] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
5
|
Dumancas GG, Setijadi C, Dufour B, Aglobo J, Carisma MS, Bello GA, Dalisay DS, Saludes JP. Comparison of Genetic and Non-genetic Algorithm Partial Least Squares for Sugar Quantification in Philippine Honeys. ANAL LETT 2022. [DOI: 10.1080/00032719.2022.2033985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
- Gerard G. Dumancas
- Department of Chemistry, Loyola Science Center, The University of Scranton, Scranton, PA, USA
- Balik Scientist Program, Philippine Council for Health Research and Development, Department of Science and Technology, Taguig City, Philippines
| | - Catherine Setijadi
- Department of Mathematics and Physical Sciences, Louisiana State University–Alexandria, Alexandria, LA, USA
| | - Ben Dufour
- Department of Mathematics and Physical Sciences, Louisiana State University–Alexandria, Alexandria, LA, USA
| | - Jastine Aglobo
- Gregor Mendel Research Laboratories, University of San Agustin, Iloilo City, Philippines
| | - Marjorie S. Carisma
- Gregor Mendel Research Laboratories, University of San Agustin, Iloilo City, Philippines
- Department of Chemistry, College of Liberal Arts, Sciences, and Education, University of San Agustin, Iloilo City, Philippines
| | - Ghalib A. Bello
- Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Doralyn S. Dalisay
- Balik Scientist Program, Philippine Council for Health Research and Development, Department of Science and Technology, Taguig City, Philippines
- Department of Biology, College of Liberal Arts, Sciences, and Education, University of San Agustin, Iloilo City, Philippines
- Center for Chemical Biology and Biotechnology (C2B2), University of San Agustin, Iloilo City, Philippines
| | - Jonel P. Saludes
- Balik Scientist Program, Philippine Council for Health Research and Development, Department of Science and Technology, Taguig City, Philippines
- Gregor Mendel Research Laboratories, University of San Agustin, Iloilo City, Philippines
- Department of Chemistry, College of Liberal Arts, Sciences, and Education, University of San Agustin, Iloilo City, Philippines
- Center for Natural Drug Discovery and Development (CND3), University of San Agustin, Iloilo City, Philippines
| |
Collapse
|
6
|
Kumar K. Competitive adaptive reweighted sampling assisted partial least square analysis of excitation-emission matrix fluorescence spectroscopic data sets of certain polycyclic aromatic hydrocarbons. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2021; 244:118874. [PMID: 32889337 DOI: 10.1016/j.saa.2020.118874] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 07/30/2020] [Accepted: 08/18/2020] [Indexed: 06/11/2023]
Abstract
Excitation-emission matrix fluorescence (EEMF) spectroscopy is a simple and sensitive analytical technique. EEMF spectrum is essentially a collection of emission and excitation spectra acquired as increasing functions of excitation and emission wavelengths, respectively. EEMF spectral data sets produced per sample are highly correlated and larger in amount that need the assistance of chemometric techniques such partial least square (PLS) analysis if one desire to build robust calibration model. The objective of the PLS algorithm is to explain maximum variation of the spectral and concentration data matrices and to maximise the correlation between them. The application of a suitable variable selection technique can significantly improve the performance of PLS calibration model. Towards this, the present work proposes application of competitive adaptive reweighted sampling (CARS) as a variable selection approach prior to PLS analysis of EEMF spectral data sets. The utility of proposed approach was successfully demonstrated by analysing the significantly overlapped EEMF spectral data set of aqueous mixtures of Anthracene, Chrysene, Fluoranthene and Pyrene that are highly carcinogenic and mutagenic in nature. The developed procedure was also successfully used for the analysis of Chrysene and Pyrene mixtures in gasoline spiked ground water samples. The CARS assisted PLS model was also compared with full spectrum PLS, genetic algorithm assisted PLS, ant colony optimisation assisted PLS and N-way PLS models. The obtained results of the present work clearly indicated that application of PLS algorithm on CARS optimised EEMF spectral variables significantly improved the performance of the calibration models.
Collapse
Affiliation(s)
- Keshav Kumar
- Present Address: Hochschule Geisenheim University, Geisenheim 65366, Germany.
| |
Collapse
|
7
|
Zhi R, Hu X, Wang C, Liu S. Development of a direct mapping model between hedonic rating and facial responses by dynamic facial expression representation. Food Res Int 2020; 137:109411. [PMID: 33233098 DOI: 10.1016/j.foodres.2020.109411] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Revised: 06/02/2020] [Accepted: 06/03/2020] [Indexed: 11/17/2022]
Abstract
Consumer tests are one of the most important activities in product development. More evidence indicates that consumer emotions in real life are mostly driven by unconscious mechanisms, and implicit measurements are regarded as beneficial by an increasing number of sensory and consumer scientists. Nonverbal manner such as facial expression analysis is a supplement to the declarative method and brings very insightful results. Up until now, the facial expression analysis for consumers' acceptance identification is limited to investigate the relationship between hedonic rating and facial expression descriptors, such as facial coding system (FACS or MAX), discrete facial expressions (i.e. happiness, sadness, surprise, fear, anger, and disgust), and affective dimensional model (valence and activation). In this study, we attempt to develop a direct mapping model between the hedonic rating and facial responses evoked by various taste stimuli. Basic taste solutions (sourness, sweetness, bitterness, umami, and saltiness) with six levels, and five types of juice are used as stimuli. Firstly, the hedonic rating categories are defined based on the nine-point hedonic scale, with a coarse-to-fine division of scale levels based on two directions of like and dislike. Secondly, the facial dynamic optical flow method is employed to analyze facial characteristics of the subjects' facial responses evoked by taste stimuli. And the genetic algorithm is conducted to select facial regions that have high contribution to hedonic rating identification. It indicates that the texture changes of eye area, wrinkles at the nasal root, and mouth area can effectively reflect the facial reaction corresponding to hedonic rating. The research shows that it is feasible to establish a direct mapping model between hedonic rating and facial responses. The hedonic rating can be predicted through automatic facial reading technology, without extra transformation from predefined emotional models. In general, this is the first try to discuss the direct prediction of hedonic rating through facial expressions up to now, and it is a complex problem due to various influence factors.
Collapse
Affiliation(s)
- Ruicong Zhi
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, PR China; Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, PR China.
| | - Xin Hu
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, PR China; Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, PR China
| | - Chenyang Wang
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, PR China; Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, PR China
| | - Shuai Liu
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, PR China; Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, PR China
| |
Collapse
|
8
|
Anderssen KE, Stormo SK, Skåra T, Skjelvareid MH, Heia K. Predicting liquid loss of frozen and thawed cod from hyperspectral imaging. Lebensm Wiss Technol 2020. [DOI: 10.1016/j.lwt.2020.110093] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
9
|
Li H, Huang M, Xu H. High accuracy determination of copper in copper concentrate with double genetic algorithm and partial least square in laser-induced breakdown spectroscopy. OPTICS EXPRESS 2020; 28:2142-2155. [PMID: 32121910 DOI: 10.1364/oe.381582] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 01/04/2020] [Indexed: 06/10/2023]
Abstract
There are many challenges in the determination of elements in complex matrix such as soil, coal and minerals by laser induced breakdown spectroscopy (LIBS) method. Due to the influence of matrix effect, instability of laser plasma and fluctuation of laser parameters, the repeatability and accuracy of quantitative results are always not satisfactory. In order to improve the accuracy, high-energy laser (30mJ-100mJ) with precise control was utilized in many laboratories. In this paper, quantitative analysis of copper in copper concentrate by low-energy (10µJ) LIBS is studied. In order to reduce the influence of matrix effect and other factors, a partial least square regression method based on double genetic algorithm (DGA-PLS) is proposed. The detail operations are as follow: the reference spectral lines are automatically selected by GA as the optimal internal standard for spectral normalization. Then the GA is used to select variables from the normalized spectra for PLS. The results showed that, for univariate model, the coefficient of determination (R2) was improved from 0.6 to 0.97 by the optimal internal standard normalization. Compared with tradition PLS, the root mean square error of cross validation (RMSECV) and root mean square error of prediction (RMSEP) of PLS trained by the normalized spectral data decreased from 1.4% and 0.42% to 0.9% and 0.29% respectively. Compared with the normalized PLS, the RMSECV and RMSEP of the DGA-PLS trained by the normalized and feature selected spectral data decreased from 0.9% and 0.29% to 0.26% and 0.21% respectively. The results show that DGA-PLS can significantly reduce matrix effect, improve prediction accuracy and reduce the risk of overfitting in determination of copper in copper concentrate.
Collapse
|
10
|
Luo Z, Thorp KR, Abdel-Haleem H. A high-throughput quantification of resin and rubber contents in Parthenium argentatum using near-infrared (NIR) spectroscopy. PLANT METHODS 2019; 15:154. [PMID: 31889978 PMCID: PMC6916029 DOI: 10.1186/s13007-019-0544-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Accepted: 12/09/2019] [Indexed: 06/10/2023]
Abstract
BACKGROUND Guayule (Parthenium argentatum A. Gray), a plant native to semi-arid regions of northern Mexico and southern Texas in the United States, is an alternative source for natural rubber (NR). Rapid screening tools are needed to replace the current labor-intensive and cost-inefficient method for quantifying rubber and resin contents. Near-infrared (NIR) spectroscopy is a promising technique that simplifies and speeds up the quantification procedure without losing precision. In this study, two spectral instruments were used to rapidly quantify resin and rubber contents in 315 ground samples harvested from a guayule germplasm collection grown under different irrigation conditions at Maricopa, AZ. The effects of eight different pretreatment approaches on improving prediction models using partial least squares regression (PLSR) were investigated and compared. Important characteristic wavelengths that contribute to prominent absorbance peaks were identified. RESULTS Using two different NIR devices, ASD FieldSpec®3 performed better than Polychromix Phazir™ in improving R2 and residual predicative deviation (RPD) values of PLSR models. Compared to the models based on full-range spectra (750-2500 nm), using a subset of wavelengths (1100-2400 nm) with high sensitivity to guayule rubber and resin contents could lead to better prediction accuracy. The prediction power of the models for quantifying resin content was better than rubber content. CONCLUSIONS In summary, the calibrated PLSR models for resin and rubber contents were successfully developed for a diverse guayule germplasm collection and were applied to roughly screen samples in a low-cost and efficient way. This improved efficiency could enable breeders to rapidly screen large guayule populations to identify cultivars that are high in rubber and resin contents.
Collapse
Affiliation(s)
- Zinan Luo
- US Arid-Land Agricultural Research Center, USDA-ARS, Maricopa, AZ 85138 USA
| | - Kelly R. Thorp
- US Arid-Land Agricultural Research Center, USDA-ARS, Maricopa, AZ 85138 USA
| | | |
Collapse
|
11
|
Takamura A, Halamkova L, Ozawa T, Lednev IK. Phenotype Profiling for Forensic Purposes: Determining Donor Sex Based on Fourier Transform Infrared Spectroscopy of Urine Traces. Anal Chem 2019; 91:6288-6295. [PMID: 30986037 DOI: 10.1021/acs.analchem.9b01058] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Forensic science is an important field of analytical chemistry where vibrational spectroscopy, in particular Fourier transform infrared spectroscopy and Raman spectroscopy, present advantages as they have a nondestructive nature, high selectivity, and no need for sample preparation. Herein, we demonstrate a method for determination of donor sex, based on attenuated total reflection Fourier transform infrared (ATR FT-IR) spectroscopy of dry urine traces. Trace body fluid evidence is of special importance to the modern criminal investigation as a source of individualizing DNA evidence. However, individual identification of a urine donor is generally difficult because of the small amount of DNA. Therefore, the development of an innovative method to provide phenotype information about the urine donor-including sex-is highly desirable. In this study, we developed a multivariate discriminant model for the ATR FT-IR spectra of dry urine to identify the donor sex. Rigorous selection of significant wavenumbers on the spectrum using a genetic algorithm enabled superb discrimination performance for the model and conclusively indicated a chemical origin for donor sex differences, which was supported by physiological knowledge. Although further investigations need to be conducted, this proof-of-concept study demonstrates the great potential of the developed methodology for phenotype profiling based on the analysis of urine traces.
Collapse
Affiliation(s)
- Ayari Takamura
- Department of Chemistry, Graduate School of Science , The University of Tokyo , 7-3-1, Hongo , Bunkyo, Tokyo 113-0033 , Japan.,First Department of Forensic Science , National Research Institute of Police Science , 6-3-1, Kashiwanoha , Kashiwa , Chiba 277-0882 , Japan
| | - Lenka Halamkova
- Department of Chemistry , University at Albany, SUNY , 1400 Washington Avenue , Albany , New York 12222 , United States
| | - Takeaki Ozawa
- Department of Chemistry, Graduate School of Science , The University of Tokyo , 7-3-1, Hongo , Bunkyo, Tokyo 113-0033 , Japan
| | - Igor K Lednev
- Department of Chemistry , University at Albany, SUNY , 1400 Washington Avenue , Albany , New York 12222 , United States
| |
Collapse
|
12
|
Cultivar Classification of Single Sweet Corn Seed Using Fourier Transform Near-Infrared Spectroscopy Combined with Discriminant Analysis. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9081530] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Seed purity is a key indicator of crop seed quality. The conventional methods for cultivar identification are time-consuming, expensive, and destructive. Fourier transform near-infrared (FT-NIR) spectroscopy combined with discriminant analyses, was studied as a rapid and nondestructive technique to classify the cultivars of sweet corn seeds. Spectra with a range of 1000–2500 nm collected from 760 seeds of two cultivars were used for the discriminant analyses. Thereafter, 126 feature wavelengths were identified from 1557 wavelengths using a genetic algorithm (GA) to build simplified classification models. Four classification algorithms, namely K-nearest neighbor (KNN), soft independent method of class analogy (SIMCA), partial least-squares discriminant analysis (PLS-DA), and support vector machine discriminant analysis (SVM-DA) were tested on full-range wavelengths and feature wavelengths, respectively. With the full-range wavelengths, all four algorithms achieved a high classification accuracy range from 97.56% to 99.59%, and the SVM-DA worked better than other models. From the feature wavelengths, no significant decline in accuracies was observed in most of the models and a high accuracy of 99.19% was still obtained by the PLS-DA model. This study demonstrated that using the FT-NIR technique with discriminant analyses could be a feasible way to classify sweet corn seed cultivars and the proper classification model could be embedded in seed sorting machinery to select high-purity seeds.
Collapse
|
13
|
Luo N, Han P, Wang S, Wang D, Zhao C. Near-Infrared Spectroscopy Analytical Model Using Ensemble Partial Least Squares Regression. ANAL LETT 2019. [DOI: 10.1080/00032719.2019.1568447] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Na Luo
- College of Information and Electrical Engineering, Shenyang Agricultural University, Liaoning, China
- Beijing Research Center for Agricultural Standards and Testing, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Ping Han
- Beijing Research Center for Agricultural Standards and Testing, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Shifang Wang
- Beijing Research Center for Agricultural Standards and Testing, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Dong Wang
- Beijing Research Center for Agricultural Standards and Testing, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Chunjiang Zhao
- College of Information and Electrical Engineering, Shenyang Agricultural University, Liaoning, China
- National Engineering Research Center for Information Technology in Agriculture, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| |
Collapse
|
14
|
Kumar K. Discrete Wavelet Transform (DWT) Assisted Partial Least Square (PLS) Analysis of Excitation-Emission Matrix Fluorescence (EEMF) Spectroscopic Data Sets: Improving the Quantification Accuracy of EEMF Technique. J Fluoresc 2018; 29:185-193. [PMID: 30488232 DOI: 10.1007/s10895-018-2327-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 11/19/2018] [Indexed: 11/25/2022]
Abstract
In the present work, it is shown that quantitative estimation efficiency of the partial least square (PLS) calibration model can be significantly improved by pre-processing the EEMF with discrete wavelet transform (DWT) analysis. The application of DWT essentially reduces the volume of data sets retaining all the analytically relevant information that subsequently helps in establishing a better correlation between the spectral and concentration data matrices. The utility of the proposed approach is successfully validated by analyzing the dilute aqueous mixtures of four fluorophores having significant spectral overlap with each other. The analytical procedure developed in the present study could be useful for analyzing the environmental, agricultural, and biological samples containing the fluorescent molecules at low concentration levels.
Collapse
Affiliation(s)
- Keshav Kumar
- Institute for Wine analysis and Beverage Research, Hochschule Geisenheim University, 65366, Geisenheim, Germany.
| |
Collapse
|
15
|
Kumar K. Introducing 'Simple Variable Selection (SVS) Approach' for Improving the Quantitative Accuracy of Chemometric Assisted Fluorimetric Estimations of Dilute Aqueous Mixtures. J Fluoresc 2018; 28:1163-1171. [PMID: 30117072 DOI: 10.1007/s10895-018-2280-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 08/09/2018] [Indexed: 11/27/2022]
Abstract
Excitation emission matrix fluorescence (EEMF) spectroscopy is a multiparametric fluorescence technique where the fluorescence intensity of a fluorophore is a function of excitation wavelength, emission wavelength and its concentration. The manual analysis of large volume of highly correlated EEMF data sets towards developing a calibration model for quantifying each fluorophores present in multifluorophoric mixtures is a difficult and time-consuming task. Over the years, Partial least square (PLS) algorithm has found its application towards providing swift and efficient analyses of large volumes of highly correlated spectral data sets. The PLS assisted EEMF spectroscopy has been successfully used towards quantifying the fluorophores in multifluorophoric mixtures without involving any pre-separation. However, the accuracy and robustness of developed calibration model can be significantly improved provided PLS analysis is carried out on the analytically relevant EEMF spectral variables. In the present work, a variable selection method baptized as simple variable selection (SVS) approach is introduced that provides a simple and computationally economical means of identifying the useful spectral variables for subsequent PLS analysis. The proposed SVS approach is successfully validated by analyzing the complex EEMF data sets of multifluorophoric mixtures of consisting of multifluorophoric mixtures of biological relevance. The proposed approach is found to provide a simple, swift and efficient means for developing a robust PLS assisted EEMF spectroscopy based calibration model for simultaneous quantification of various fluorophores present in multifluorophoric mixtures.
Collapse
Affiliation(s)
- Keshav Kumar
- Institute for Wine analysis and Beverage Research, Hochschule Geisenheim University, 65366, Geisenheim, Germany.
| |
Collapse
|
16
|
Tang R, Chen X, Li C. Detection of Nitrogen Content in Rubber Leaves Using Near-Infrared (NIR) Spectroscopy with Correlation-Based Successive Projections Algorithm (SPA). APPLIED SPECTROSCOPY 2018; 72:740-749. [PMID: 29617151 DOI: 10.1177/0003702818755142] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Near-infrared spectroscopy is an efficient, low-cost technology that has potential as an accurate method in detecting the nitrogen content of natural rubber leaves. Successive projections algorithm (SPA) is a widely used variable selection method for multivariate calibration, which uses projection operations to select a variable subset with minimum multi-collinearity. However, due to the fluctuation of correlation between variables, high collinearity may still exist in non-adjacent variables of subset obtained by basic SPA. Based on analysis to the correlation matrix of the spectra data, this paper proposed a correlation-based SPA (CB-SPA) to apply the successive projections algorithm in regions with consistent correlation. The result shows that CB-SPA can select variable subsets with more valuable variables and less multi-collinearity. Meanwhile, models established by the CB-SPA subset outperform basic SPA subsets in predicting nitrogen content in terms of both cross-validation and external prediction. Moreover, CB-SPA is assured to be more efficient, for the time cost in its selection procedure is one-twelfth that of the basic SPA.
Collapse
Affiliation(s)
- Rongnian Tang
- School of Mechanical and Electrical Engineering, Hainan University, Haikou, China
| | - Xupeng Chen
- School of Mechanical and Electrical Engineering, Hainan University, Haikou, China
| | - Chuang Li
- School of Mechanical and Electrical Engineering, Hainan University, Haikou, China
| |
Collapse
|
17
|
Kumar K. Application of Genetic Algorithm (GA) Assisted Partial Least Square (PLS) Analysis on Trilinear and Non-trilinear Fluorescence Data Sets to Quantify the Fluorophores in Multifluorophoric Mixtures: Improving Quantification Accuracy of Fluorimetric Estimations of Dilute Aqueous Mixtures. J Fluoresc 2018; 28:589-596. [PMID: 29594637 DOI: 10.1007/s10895-018-2221-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 03/19/2018] [Indexed: 11/29/2022]
Abstract
Excitation-emission matrix fluorescence (EEMF) and total synchronous fluorescence spectroscopy (TSFS) are the 2 fluorescence techniques that are commonly used for the analysis of multifluorophoric mixtures. These 2 fluorescence techniques are conceptually different and provide certain advantages over each other. The manual analysis of such highly correlated large volume of EEMF and TSFS towards developing a calibration model is difficult. Partial least square (PLS) analysis can analyze the large volume of EEMF and TSFS data sets by finding important factors that maximize the correlation between the spectral and concentration information for each fluorophore. However, often the application of PLS analysis on entire data sets does not provide a robust calibration model and requires application of suitable pre-processing step. The present work evaluates the application of genetic algorithm (GA) analysis prior to PLS analysis on EEMF and TSFS data sets towards improving the precision and accuracy of the calibration model. The GA algorithm essentially combines the advantages provided by stochastic methods with those provided by deterministic approaches and can find the set of EEMF and TSFS variables that perfectly correlate well with the concentration of each of the fluorophores present in the multifluorophoric mixtures. The utility of the GA assisted PLS analysis is successfully validated using (i) EEMF data sets acquired for dilute aqueous mixture of four biomolecules and (ii) TSFS data sets acquired for dilute aqueous mixtures of four carcinogenic polycyclic aromatic hydrocarbons (PAHs) mixtures. In the present work, it is shown that by using the GA it is possible to significantly improve the accuracy and precision of the PLS calibration model developed for both EEMF and TSFS data set. Hence, GA must be considered as a useful pre-processing technique while developing an EEMF and TSFS calibration model.
Collapse
Affiliation(s)
- Keshav Kumar
- Institute for Wine analysis and Beverage Research, Hochschule Geisenheim University, 65366, Geisenheim, Germany.
| |
Collapse
|
18
|
Lavine BK, White CG. Boosting the Performance of Genetic Algorithms for Variable Selection in Partial Least Squares Spectral Calibrations. APPLIED SPECTROSCOPY 2017; 71:2092-2101. [PMID: 28537475 DOI: 10.1177/0003702817713501] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
A genetic algorithm (GA) for variable selection in partial least squares (PLS) regression that incorporates adaptive boosting to identify informative wavelengths in near-infrared (NIR) spectra has been developed. Three studies demonstrating the advantages of incorporating an adaptive boosting routine into a GA that employs the root mean square error of calibration as its fitness function are highlighted: (1) prediction of hydroxyl number of terpolymers from NIR diffuse reflectance spectra; (2) calibration of acetone from NIR transmission spectra of mixtures of water, acetone, t-butyl alcohol and isopropyl alcohol; and (3) determination of the active pharmaceutical ingredients in drug tablets from NIR diffuse reflectance spectra. The performance of the GA with adaptive boosting to select wavelengths was compared with one without adaptive boosting. For all three NIR data sets, variable selected PLS models developed by a GA with adaptive boosting performed better. Analysis of the wavelengths selected by the GA with adaptive boosting also demonstrate that chemical information indicative of the analyte was captured by the selected wavelengths.
Collapse
Affiliation(s)
- Barry K Lavine
- Department of Chemistry, Oklahoma State University, Stillwater, OK, USA
| | - Collin G White
- Department of Chemistry, Oklahoma State University, Stillwater, OK, USA
| |
Collapse
|
19
|
Mansour MF, ElKady EF, El-Guindi NM, El-Moghazy SM, Van Schepdael A, Adams E. Simultaneous Spectrophotometric Determination of Imipramine Hydrochloride with Chlordiazepoxide and Nortriptyline Hydrochloride with Fluphenazine Hydrochloride. ANAL LETT 2017. [DOI: 10.1080/00032719.2016.1251448] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Marwa F. Mansour
- Department of Pharmaceutical and Pharmacological Sciences, Pharmaceutical Analysis, KU Leuven, University of Leuven, Leuven, Belgium
- National Organization for Drug Control and Research, Giza, Egypt
| | - Ehab F. ElKady
- Pharmaceutical Chemistry Department, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | | | - Samir M. El-Moghazy
- Pharmaceutical Chemistry Department, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Ann Van Schepdael
- Department of Pharmaceutical and Pharmacological Sciences, Pharmaceutical Analysis, KU Leuven, University of Leuven, Leuven, Belgium
| | - Erwin Adams
- Department of Pharmaceutical and Pharmacological Sciences, Pharmaceutical Analysis, KU Leuven, University of Leuven, Leuven, Belgium
| |
Collapse
|
20
|
Attia KAM, Nassar MWI, El-Zeiny MB, Serag A. Effect of genetic algorithm as a variable selection method on different chemometric models applied for the analysis of binary mixture of amoxicillin and flucloxacillin: A comparative study. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2016; 156:54-62. [PMID: 26641286 DOI: 10.1016/j.saa.2015.11.024] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Revised: 11/20/2015] [Accepted: 11/23/2015] [Indexed: 06/05/2023]
Abstract
Different chemometric models were applied for the quantitative analysis of amoxicillin (AMX), and flucloxacillin (FLX) in their binary mixtures, namely, partial least squares (PLS), spectral residual augmented classical least squares (SRACLS), concentration residual augmented classical least squares (CRACLS) and artificial neural networks (ANNs). All methods were applied with and without variable selection procedure (genetic algorithm GA). The methods were used for the quantitative analysis of the drugs in laboratory prepared mixtures and real market sample via handling the UV spectral data. Robust and simpler models were obtained by applying GA. The proposed methods were found to be rapid, simple and required no preliminary separation steps.
Collapse
Affiliation(s)
- Khalid A M Attia
- Pharmaceutical Analytical Chemistry Department, Faculty of Pharmacy, Al-Azhar University, 11751, Nasr City, Cairo, Egypt
| | - Mohammed W I Nassar
- Pharmaceutical Analytical Chemistry Department, Faculty of Pharmacy, Al-Azhar University, 11751, Nasr City, Cairo, Egypt
| | - Mohamed B El-Zeiny
- Analytical Chemistry Department, Faculty of Pharmacy, Modern University for Technology and Information (MTI), 12582, Al Hadaba Al Wosta, Cairo, Egypt
| | - Ahmed Serag
- Pharmaceutical Analytical Chemistry Department, Faculty of Pharmacy, Al-Azhar University, 11751, Nasr City, Cairo, Egypt.
| |
Collapse
|
21
|
Variable selection in multivariate calibration based on clustering of variable concept. Anal Chim Acta 2015; 902:70-81. [PMID: 26703255 DOI: 10.1016/j.aca.2015.11.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Revised: 10/31/2015] [Accepted: 11/04/2015] [Indexed: 11/22/2022]
Abstract
Recently we have proposed a new variable selection algorithm, based on clustering of variable concept (CLoVA) in classification problem. With the same idea, this new concept has been applied to a regression problem and then the obtained results have been compared with conventional variable selection strategies for PLS. The basic idea behind the clustering of variable is that, the instrument channels are clustered into different clusters via clustering algorithms. Then, the spectral data of each cluster are subjected to PLS regression. Different real data sets (Cargill corn, Biscuit dough, ACE QSAR, Soy, and Tablet) have been used to evaluate the influence of the clustering of variables on the prediction performances of PLS. Almost in the all cases, the statistical parameter especially in prediction error shows the superiority of CLoVA-PLS respect to other variable selection strategies. Finally the synergy clustering of variable (sCLoVA-PLS), which is used the combination of cluster, has been proposed as an efficient and modification of CLoVA algorithm. The obtained statistical parameter indicates that variable clustering can split useful part from redundant ones, and then based on informative cluster; stable model can be reached.
Collapse
|
22
|
Abstract
Spectroscopy is carried out in almost every field of science, whenever light interacts with matter. Although sophisticated instruments with impressive performance characteristics are available, much effort continues to be invested in the development of miniaturized, cheap and easy-to-use systems. Current microspectrometer designs mostly use interference filters and interferometric optics that limit their photon efficiency, resolution and spectral range. Here we show that many of these limitations can be overcome by replacing interferometric optics with a two-dimensional absorptive filter array composed of colloidal quantum dots. Instead of measuring different bands of a spectrum individually after introducing temporal or spatial separations with gratings or interference-based narrowband filters, a colloidal quantum dot spectrometer measures a light spectrum based on the wavelength multiplexing principle: multiple spectral bands are encoded and detected simultaneously with one filter and one detector, respectively, with the array format allowing the process to be efficiently repeated many times using different filters with different encoding so that sufficient information is obtained to enable computational reconstruction of the target spectrum. We illustrate the performance of such a quantum dot microspectrometer, made from 195 different types of quantum dots with absorption features that cover a spectral range of 300 nanometres, by measuring shifts in spectral peak positions as small as one nanometre. Given this performance, demonstrable avenues for further improvement, the ease with which quantum dots can be processed and integrated, and their numerous finely tuneable bandgaps that cover a broad spectral range, we expect that quantum dot microspectrometers will be useful in applications where minimizing size, weight, cost and complexity of the spectrometer are critical.
Collapse
|
23
|
Multivariate calibration of NIR spectroscopic sensors for continuous glucose monitoring. Trends Analyt Chem 2015. [DOI: 10.1016/j.trac.2014.12.005] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
24
|
Goodarzi M, dos Santos Coelho L. Firefly as a novel swarm intelligence variable selection method in spectroscopy. Anal Chim Acta 2014; 852:20-7. [DOI: 10.1016/j.aca.2014.09.045] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Revised: 09/22/2014] [Accepted: 09/25/2014] [Indexed: 10/24/2022]
|
25
|
Song J, Xie J, Li C, Lu JH, Meng QF, Yang Z, Lee RJ, Wang D, Teng LS. Near infrared spectroscopic (NIRS) analysis of drug-loading rate and particle size of risperidone microspheres by improved chemometric model. Int J Pharm 2014; 472:296-303. [PMID: 24954726 DOI: 10.1016/j.ijpharm.2014.06.033] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Revised: 05/19/2014] [Accepted: 06/10/2014] [Indexed: 11/18/2022]
Abstract
Microspheres have been developed as drug carriers in controlled drug delivery systems for years. In our present study, near infrared spectroscopy (NIRS) is applied to analyze the particle size and drug loading rate in risperidone poly(d,l-lactide-co-glycolide) (PLGA) microspheres. Various batches of risperidone PLGA microspheres were designed and prepared successfully. The particle size and drug-loading rate of all the samples were determined by a laser diffraction particle size analyzer and high performance liquid chromatography (HPLC) system. Monte Carlo algorithm combined with partial least squares (MCPLS) method was applied to identify the outliers and choose the numbers of calibration set. Furthermore, a series of preprocessing methods were performed to remove signal noise in NIR spectra. Moving window PLS and radical basis function neural network (RBFNN) methods were employed to establish calibration model. Our data demonstrated that PLS-developed model was only suitable for drug loading analysis in risperidone PLGA microspheres. Comparatively, RBFNN-based predictive models possess better fitting quality, predictive effect, and stability for both drug loading rate and particle size analysis. The correlation coefficients of calibration set (Rc(2)) were 0.935 and 0.880, respectively. The performance of optimum RBFNN models was confirmed by independent verification test with 15 samples. Collectively, our method is successfully performed to monitor drug-loading rate and particle size during risperidone PLGA microspheres preparation.
Collapse
Affiliation(s)
- Jia Song
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China
| | - Jing Xie
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China
| | - Chenliang Li
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China
| | - Jia-Hui Lu
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China
| | - Qing-Fan Meng
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China
| | - Zhaogang Yang
- Division of Pharmaceutics, College of Pharmacy, The Ohio State University, Columbus, OH, USA
| | - Robert J Lee
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China; Division of Pharmaceutics, College of Pharmacy, The Ohio State University, Columbus, OH, USA
| | - Di Wang
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China.
| | - Le-Sheng Teng
- School of Life Sciences, Jilin University, No. 2699, Qianjin Avenue, Changchun, Jilin, China.
| |
Collapse
|
26
|
Yun YH, Wang WT, Tan ML, Liang YZ, Li HD, Cao DS, Lu HM, Xu QS. A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration. Anal Chim Acta 2014; 807:36-43. [DOI: 10.1016/j.aca.2013.11.032] [Citation(s) in RCA: 133] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Revised: 11/13/2013] [Accepted: 11/14/2013] [Indexed: 11/12/2022]
|
27
|
Burley JC, Aina A, Matousek P, Brignell C. Quantification of pharmaceuticals via transmission Raman spectroscopy: data sub-selection. Analyst 2014; 139:74-8. [DOI: 10.1039/c3an01293j] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
28
|
Zhang X, Li W, Yin B, Chen W, Kelly DP, Wang X, Zheng K, Du Y. Improvement of near infrared spectroscopic (NIRS) analysis of caffeine in roasted Arabica coffee by variable selection method of stability competitive adaptive reweighted sampling (SCARS). SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2013; 114:350-6. [PMID: 23786975 DOI: 10.1016/j.saa.2013.05.053] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2013] [Revised: 04/08/2013] [Accepted: 05/10/2013] [Indexed: 05/26/2023]
Abstract
Coffee is the most heavily consumed beverage in the world after water, for which quality is a key consideration in commercial trade. Therefore, caffeine content which has a significant effect on the final quality of the coffee products requires to be determined fast and reliably by new analytical techniques. The main purpose of this work was to establish a powerful and practical analytical method based on near infrared spectroscopy (NIRS) and chemometrics for quantitative determination of caffeine content in roasted Arabica coffees. Ground coffee samples within a wide range of roasted levels were analyzed by NIR, meanwhile, in which the caffeine contents were quantitative determined by the most commonly used HPLC-UV method as the reference values. Then calibration models based on chemometric analyses of the NIR spectral data and reference concentrations of coffee samples were developed. Partial least squares (PLS) regression was used to construct the models. Furthermore, diverse spectra pretreatment and variable selection techniques were applied in order to obtain robust and reliable reduced-spectrum regression models. Comparing the respective quality of the different models constructed, the application of second derivative pretreatment and stability competitive adaptive reweighted sampling (SCARS) variable selection provided a notably improved regression model, with root mean square error of cross validation (RMSECV) of 0.375 mg/g and correlation coefficient (R) of 0.918 at PLS factor of 7. An independent test set was used to assess the model, with the root mean square error of prediction (RMSEP) of 0.378 mg/g, mean relative error of 1.976% and mean relative standard deviation (RSD) of 1.707%. Thus, the results provided by the high-quality calibration model revealed the feasibility of NIR spectroscopy for at-line application to predict the caffeine content of unknown roasted coffee samples, thanks to the short analysis time of a few seconds and non-destructive advantages of NIRS.
Collapse
Affiliation(s)
- Xuan Zhang
- Shanghai Key Laboratory of Functional Materials Chemistry, and Research Center of Analysis and Test, East China University of Science and Technology, Shanghai 200237, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Li Z, Lu H, Yang J, Zeng X, Zhao L, Li H, Liao Q, Peng S, Zhou M, Wu M, Xiang J, Wang Y, Li G. Analysis of the raw serum peptidomic pattern in glioma patients. Clin Chim Acta 2013; 425:221-6. [DOI: 10.1016/j.cca.2013.08.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Revised: 07/17/2013] [Accepted: 08/02/2013] [Indexed: 12/19/2022]
|
30
|
Shi B, Zhao L, Zhi R, Xi X. Optimization of electronic nose sensor array by genetic algorithms in Xihu-Longjing Tea quality analysis. ACTA ACUST UNITED AC 2013. [DOI: 10.1016/j.mcm.2012.12.029] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
31
|
Yun YH, Li HD, Wood LRE, Fan W, Wang JJ, Cao DS, Xu QS, Liang YZ. An efficient method of wavelength interval selection based on random frog for multivariate spectral calibration. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2013; 111:31-6. [PMID: 23602956 DOI: 10.1016/j.saa.2013.03.083] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Revised: 03/08/2013] [Accepted: 03/16/2013] [Indexed: 05/16/2023]
Abstract
Wavelength selection is a critical step for producing better prediction performance when applied to spectral data. Considering the fact that the vibrational and rotational spectra have continuous features of spectral bands, we propose a novel method of wavelength interval selection based on random frog, called interval random frog (iRF). To obtain all the possible continuous intervals, spectra are first divided into intervals by moving window of a fix width over the whole spectra. These overlapping intervals are ranked applying random frog coupled with PLS and the optimal ones are chosen. This method has been applied to two near-infrared spectral datasets displaying higher efficiency in wavelength interval selection than others. The source code of iRF can be freely downloaded for academy research at the website: http://code.google.com/p/multivariate-calibration/downloads/list.
Collapse
Affiliation(s)
- Yong-Huan Yun
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Andries JPM, Heyden YV, Buydens LMC. Predictive-property-ranked variable reduction with final complexity adapted models in partial least squares modeling for multiple responses. Anal Chem 2013; 85:5444-53. [PMID: 23679857 DOI: 10.1021/ac400339e] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
For partial least-squares regression with one response (PLS1), many variable-reduction methods have been developed. However, only a few address the case of multiple-response partial-least-squares (PLS2) modeling. The calibration performance of PLS1 can be improved by elimination of uninformative variables. Many variable-reduction methods are based on various PLS-model-related parameters, called predictor-variable properties. Recently, an important adaptation, in which the model complexity is optimized, was introduced in these methods. This method was called Predictive-Property-Ranked Variable Reduction with Final Complexity Adapted Models, denoted as PPRVR-FCAM or simply FCAM. In this study, variable reduction for PLS2 models, using an adapted FCAM method, FCAM-PLS2, is investigated. The utility and effectiveness of four new predictor-variable properties, derived from the multiple response PLS2 regression coefficients, are studied for six data sets consisting of ultraviolet-visible (UV-vis) spectra, near-infrared (NIR) spectra, NMR spectra, and two simulated sets, one with correlated and one with uncorrelated responses. The four properties include the mean of the absolute values as well as the norm of the PLS2 regression coefficients and their significances. The four properties were found to be applicable by the FCAM-PLS2 method for variable reduction. The predictive abilities of models resulting from the four properties are similar. The norm of the PLS2 regression coefficients has the best selective abilities, low numbers of variables with an informative meaning to the responses are retained. The significance of the mean of the PLS2 regression coefficients is found to be the least-selective property.
Collapse
Affiliation(s)
- Jan P M Andries
- Department of Life Sciences, Avans Hogeschool, University of Professional Education, P.O. Box 90116, 4800 RA Breda, The Netherlands
| | | | | |
Collapse
|
33
|
Andries JP, Heyden YV, Buydens LM. Predictive-property-ranked variable reduction in partial least squares modelling with final complexity adapted models: Comparison of properties for ranking. Anal Chim Acta 2013; 760:34-45. [DOI: 10.1016/j.aca.2012.11.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Revised: 10/31/2012] [Accepted: 11/08/2012] [Indexed: 11/25/2022]
|
34
|
Xu H, Qi B, Sun T, Fu X, Ying Y. Variable selection in visible and near-infrared spectra: Application to on-line determination of sugar content in pears. J FOOD ENG 2012. [DOI: 10.1016/j.jfoodeng.2011.09.022] [Citation(s) in RCA: 90] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
35
|
Affiliation(s)
- Jennifer Pittman
- Jennifer Pittman is Visiting Assistant Professor, Institute of Statistics and Decision Sciences, Duke University, Durham, NC 27708
| |
Collapse
|
36
|
Andries JP, Vander Heyden Y, Buydens LM. Improved variable reduction in partial least squares modelling based on Predictive-Property-Ranked Variables and adaptation of partial least squares complexity. Anal Chim Acta 2011; 705:292-305. [DOI: 10.1016/j.aca.2011.06.037] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2011] [Revised: 06/16/2011] [Accepted: 06/21/2011] [Indexed: 10/18/2022]
|
37
|
|
38
|
A new and efficient variable selection algorithm based on ant colony optimization. Applications to near infrared spectroscopy/partial least-squares analysis. Anal Chim Acta 2011; 699:18-25. [DOI: 10.1016/j.aca.2011.04.061] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2011] [Revised: 04/06/2011] [Accepted: 04/28/2011] [Indexed: 11/19/2022]
|
39
|
Muehlethaler C, Massonnet G, Esseiva P. The application of chemometrics on Infrared and Raman spectra as a tool for the forensic analysis of paints. Forensic Sci Int 2011; 209:173-82. [DOI: 10.1016/j.forsciint.2011.01.025] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Revised: 01/17/2011] [Accepted: 01/19/2011] [Indexed: 11/16/2022]
|
40
|
|
41
|
AFIUNI-ZADEH S, AZIMI G. A QSAR Study for Modeling of 8-Azaadenine Analogues Proposed as A1 Adenosine Receptor Antagonists Using Genetic Algorithm Coupling Adaptive Neuro-Fuzzy Inference System (ANFIS). ANAL SCI 2010; 26:897-902. [DOI: 10.2116/analsci.26.897] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
42
|
Singh G, Kaur S, Naik DG, Gupta VK. Evolutionary computing approach for evaluating flory distribution curves in gel permeation chromatography: Study of the poly(1-octene) system. J Appl Polym Sci 2010. [DOI: 10.1002/app.32097] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
43
|
Hegazy MA, Metwaly FH, Abdelkawy M, Abdelwahab NS. Spectrophotometric and chemometric determination of hydrochlorothiazide and spironolactone in binary mixture in the presence of their impurities and degradants. Drug Test Anal 2010; 2:243-51. [DOI: 10.1002/dta.125] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
44
|
Lin P, Chen Y, He Y. Identification of Geographical Origin of Olive Oil Using Visible and Near-Infrared Spectroscopy Technique Combined with Chemometrics. FOOD BIOPROCESS TECH 2009. [DOI: 10.1007/s11947-009-0302-z] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
45
|
Gourzi M, Rouane A, Guelaz R, Nadi M, Jaspard F. Study of a new electromagnetic sensor for glycaemia measurement:in vitroresults on blood pig. J Med Eng Technol 2009; 27:276-81. [PMID: 14602519 DOI: 10.1080/0309190031000098845] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
In this paper, we develop a non-invasive method to measure glycaemia levels. The method is based on an electromagnetic sensor associated with a specific electronic card. Results obtained on whole pig blood show the influence of the temperature and the linearity between the glucose concentration and the process card signal output. Measurements are reproducible and the resolution obtained permits determination of the glucose level around 0.1 g l(-1).
Collapse
Affiliation(s)
- M Gourzi
- L.I.E.N., H. Poincare University, B.P. 239 - 54506 Vandoeuvre Lès Nancy, France
| | | | | | | | | |
Collapse
|
46
|
Li H, Liang Y, Xu Q, Cao D. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. Anal Chim Acta 2009; 648:77-84. [DOI: 10.1016/j.aca.2009.06.046] [Citation(s) in RCA: 679] [Impact Index Per Article: 45.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2009] [Revised: 06/18/2009] [Accepted: 06/18/2009] [Indexed: 10/20/2022]
|
47
|
Two new extensions of principal component transform to compute a PLS2 model between two wide matrices: PCT-PLS2 and segmented PCT-PLS2. Anal Chim Acta 2009; 642:37-44. [DOI: 10.1016/j.aca.2009.01.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2008] [Revised: 01/12/2009] [Accepted: 01/12/2009] [Indexed: 11/18/2022]
|
48
|
Qu N, Mi H, Wang B, Ren Y. Application of GA-RBF networks to the nondestructive determination of active component in pharmaceutical powder by NIR spectroscopy. J Taiwan Inst Chem Eng 2009. [DOI: 10.1016/j.jtice.2008.08.002] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
49
|
Kramer KE, Small GW. Digital filtering and model updating methods for improving the robustness of near-infrared multivariate calibrations. APPLIED SPECTROSCOPY 2009; 63:246-255. [PMID: 19215656 DOI: 10.1366/000370209787392076] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Fourier transform near-infrared (NIR) transmission spectra are used for quantitative analysis of glucose for 17 sets of prediction data sampled as much as six months outside the timeframe of the corresponding calibration data. Aqueous samples containing physiological levels of glucose in a matrix of bovine serum albumin and triacetin are used to simulate clinical samples such as blood plasma. Background spectra of a single analyte-free matrix sample acquired during the instrumental warm-up period on the prediction day are used for calibration updating and for determining the optimal frequency response of a preprocessing infinite impulse response time-domain digital filter. By tuning the filter and the calibration model to the specific instrumental response associated with the prediction day, the calibration model is given enhanced ability to operate over time. This methodology is demonstrated in conjunction with partial least squares calibration models built with a spectral range of 4700-4300 cm(-1). By using a subset of the background spectra to evaluate the prediction performance of the updated model, projections can be made regarding the success of subsequent glucose predictions. If a threshold standard error of prediction (SEP) of 1.5 mM is used to establish successful model performance with the glucose samples, the corresponding threshold for the SEP of the background spectra is found to be 1.3 mM. For calibration updating in conjunction with digital filtering, SEP values of all 17 prediction sets collected over 3-178 days displaced from the calibration data are below 1.5 mM. In addition, the diagnostic based on the background spectra correctly assesses the prediction performance in 16 of the 17 cases.
Collapse
Affiliation(s)
- Kirsten E Kramer
- Optical Science and Technology Center and Department of Chemistry, University of Iowa, Iowa City, Iowa 52242, USA
| | | |
Collapse
|
50
|
Bayesian linear regression and variable selection for spectroscopic calibration. Anal Chim Acta 2009; 631:13-21. [DOI: 10.1016/j.aca.2008.10.014] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2008] [Revised: 10/03/2008] [Accepted: 10/03/2008] [Indexed: 11/22/2022]
|