1
|
Yue J, Zhang H, Gao L, Tian W, Luo J, Nie L, Li L, Wu A, Zang H. Benchtop and different miniaturized near-infrared spectrometers application study: Calibration transfer and 2D-COS for in-situ analysis of moisture content in HPMC. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2025; 333:125889. [PMID: 39955911 DOI: 10.1016/j.saa.2025.125889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Revised: 01/23/2025] [Accepted: 02/08/2025] [Indexed: 02/18/2025]
Abstract
The demand for miniaturized near-infrared (NIR) spectrometers has surged due to their potential for in-situ analysis. However, their predictive accuracy has not yet matched that of traditional benchtop instruments. This study evaluates the effectiveness of rapid quantitative moisture analysis in hydroxypropyl methylcellulose (HPMC) using a benchtop spectrometer, Antaris II from Thermo Fisher Scientific Inc., and five miniaturized spectrometers (MicroNIR 1700 from Viavi Solutions, OTO-SW2540 from OtOPhotonics, IAS DLP 1700 from Dallas, NIRONE Sensor 2.2 from Helsinki, and NIRS M1800 from Alian Optoelectronics). This study employed an Improved Principal Component Analysis (IPCA) transfer method to standardize spectra from the diverse miniaturized NIR spectrometers, facilitating calibration transfer across different spectroscopic technologies. The benchtop (Antaris II) delivered the most superior results, indicating that miniaturized spectrometers must refine their methodologies to approach the predictive performance of benchtop counterparts. Further, this work conducted a two-dimensional correlation spectroscopy (2D-COS) analysis on the spectra from various spectrometers. This analysis bolstered the partial least squares regression (PLSR) model, highlighting discrepancies between miniaturized and benchtop spectrometers and deepening understanding of the factors influencing the PLSR models. The IPCA leverages the benchtop model to enhance the precision and reliability of miniaturized NIR spectrometers. This innovative and versatile research approach aims to further optimize the performance of miniaturized NIR spectrometers for specific applications, representing a significant step forward in their development.
Collapse
Affiliation(s)
- Jianan Yue
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China
| | - Hui Zhang
- National Glycoengineering Research Center, Shandong University, Qingdao 266237 China
| | - Lele Gao
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China
| | - Weilu Tian
- Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan 250021 China
| | - Junsha Luo
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China
| | - Lei Nie
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China
| | - Lian Li
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China
| | - Aoli Wu
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China
| | - Hengchang Zang
- NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012 China; Key Laboratory of Chemical Biology (Ministry of Education), Shandong University, Jinan 250012 China; National Glycoengineering Research Center, Shandong University, Jinan 250012 China.
| |
Collapse
|
2
|
Li B, Li W, Guo J, Wang H, Wan R, Liu Y, Fan M, Wang C, Yang S, Zhao L, Nie C. Outlier Removal with Weight Penalization and Aggregation: A Robust Variable Selection Method for Enhancing Near-Infrared Spectral Analysis Performance. Anal Chem 2025; 97:7325-7332. [PMID: 39970051 DOI: 10.1021/acs.analchem.4c07007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Full-wavelength near-infrared (NIR) spectroscopy faces significant challenges due to the strong collinearity among spectral variables and the presence of variables that are highly sensitive to sample fluctuations. Additionally, not all spectral variables contribute equally to the NIR model. Weakly influential variables, although not important on their own, can provide substantial improvement when combined with stronger variables, thus increasing both model stability and prediction accuracy. Therefore, this study proposes a new variable selection method called outlier removal with weight penalization and aggregation (OR-WPA). The method begins by removing outlier spectral variables with high coefficient of variation, which enhances model stability. During the variable selection process, multiple submodels are constructed based on variable subsets, with variable weights assigned according to the absolute values of regression coefficients. A moving window is applied to average the weights, and variables with excessively high weights are penalized, promoting the selection of weakly influential variables that positively contribute to model accuracy. The variable space is iteratively reduced, and the subset of variables associated with the highest predictive accuracy is selected as the final characteristic variable combination. The OR-WPA method was evaluated on three NIR spectral data sets, involving corn, heated tobacco substrate, and flue-cured tobacco. The results were compared with three advanced variable selection methods: Monte Carlo uninformative variable elimination, competitive adaptive reweighted sampling, and bootstrapping soft shrinkage. The results indicate that OR-WPA demonstrates better predictive performance, particularly in predicting low-content components, where it significantly enhances both the accuracy and stability of the NIR model.
Collapse
Affiliation(s)
- Beibei Li
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Wenting Li
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Junwei Guo
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Hongbo Wang
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Ran Wan
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Yu Liu
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Meijuan Fan
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Cong Wang
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Song Yang
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Le Zhao
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Cong Nie
- Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| |
Collapse
|
3
|
Sha W, Hu K, Weng S. Statistic and Network Features of RGB and Hyperspectral Imaging for Determination of Black Root Mold Infection in Apples. Foods 2023; 12:foods12081608. [PMID: 37107403 PMCID: PMC10137991 DOI: 10.3390/foods12081608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 03/24/2023] [Accepted: 03/31/2023] [Indexed: 04/29/2023] Open
Abstract
Apples damaged by black root mold (BRM) lose moisture, vitamins, and minerals as well as carry dangerous toxins. Determination of the infection degree can allow for customized use of apples, reduce financial losses, and ensure food safety. In this study, red-green-blue (RGB) imaging and hyperspectral imaging (HSI) are combined to detect the infection degree of BRM in apple fruits. First, RGB and HSI images of healthy, mildly, moderately, and severely infected fruits are measured, and those with effective wavelengths (EWs) are screened from HSI by random frog. Second, the statistic and network features of images are extracted by using color moment and convolutional neural network. Meanwhile, random forest (RF), K-nearest neighbor, and support vector machine are used to construct classification models with the above two features of RGB and HSI images of EWs. Optimal results with the 100% accuracy of training set and 96% accuracy of prediction set are obtained by RF with the statistic and network features of the two images, outperforming the other cases. The proposed method furnishes an accurate and effective solution for determining the BRM infection degree in apples.
Collapse
Affiliation(s)
- Wen Sha
- School of Electrical Engineering and Automation, Anhui University, 111 Jiulong Road Hefei, Hefei 230601, China
- Engineering Research Center of Autonomous Unmanned System Technology, Ministry of Education, Anhui University, 111 Jiulong Road Hefei, Hefei 230601, China
| | - Kang Hu
- School of Electrical Engineering and Automation, Anhui University, 111 Jiulong Road Hefei, Hefei 230601, China
| | - Shizhuang Weng
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road Hefei, Hefei 230601, China
| |
Collapse
|
4
|
de Araújo Gomes A, Azcarate SM, Diniz PHGD, de Sousa Fernandes DD, Veras G. Variable selection in the chemometric treatment of food data: A tutorial review. Food Chem 2022; 370:131072. [PMID: 34537434 DOI: 10.1016/j.foodchem.2021.131072] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 07/15/2021] [Accepted: 09/03/2021] [Indexed: 12/13/2022]
Abstract
Food analysis covers aspects of quality and detection of possible frauds to ensure the integrity of the food. The arsenal of analytical instruments available for food analysis is broad and allows the generation of a large volume of information per sample. But this instrumental information may not yet give the desired answer; it must be processed to provide a final answer for decision making. The possibility of discarding non-informative and/or redundant signals can lead to models of better accuracy, robustness, and chemical interpretability, in line with the principle of parsimony. Thus, in this tutorial review, we cover aspects of variable selection in food analysis, including definitions, theoretical aspects of variable selection, and case studies showing the advantages of variable selection-based models concerning the use of a wide range of non-informative and redundant instrumental information in the analysis of food matrices.
Collapse
Affiliation(s)
- Adriano de Araújo Gomes
- Universidade Federal do Rio Grande do Sul, Instituto de Química, 90650-001 Porto Alegre, RS, Brazil
| | - Silvana M Azcarate
- Facultad de Ciencias Exactas y Naturales, Universidad Nacional de La Pampa, Instituto de Ciencias de la Tierra y Ambientales de La Pampa (INCITAP), Av. Uruguay 151, 630 0 Santa Rosa, La Pampa, Argentina; Consejo Nacional de Investigaciones Científicas y Tecnicas (CONICET), Godoy Cruz 2290 CABA (C1425FQB), Argentina
| | | | | | - Germano Veras
- Laboratório de Química Analítica e Quimiometria, Centro de Ciências e Tecnologia, Universidade Estadual da Paraíba, 58429-500 Campina Grande, PB, Brazil
| |
Collapse
|
5
|
Weng S, Chu Z, Wang M, Han K, Zhu G, Liu C, Li X, Huang L. Reflectance spectroscopy with operator difference for determination of behenic acid in edible vegetable oils by using convolutional neural network and polynomial correction. Food Chem 2021; 367:130668. [PMID: 34343814 DOI: 10.1016/j.foodchem.2021.130668] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 07/18/2021] [Accepted: 07/20/2021] [Indexed: 12/23/2022]
Abstract
A novel polynomial correction method, order-adaptive polynomial correction (OAPC), was proposed to correct reflectance spectra with operator differences, and convolutional neural network (CNN) was used to develop analysis model to predict behenic acid in edible oils. With application of OAPC, CNN performed well with coefficient of determination of correction (R2cor) of 0.8843 and root mean square error of correction (RMSEcor) of 0.1182, outperforming partial least squares regression, support vector regression and random forest with OAPC, as well as the cases without OAPC. Based on 16 effective wavelengths selected by combination of bootstrapping soft shrinkage, random frog and Pearson's correlation, CNN and OAPC exhibited excellent performance with R2cor of 0.9560 and RMSEcor of 0.0730. Meanwhile, only 5% correction samples were selected by Kennard-Stone for OAPC. Overall, the proposed method could alleviate the impact of operator differences on spectral analysis, thereby providing potential to correct differences from measurement instruments or environments.
Collapse
Affiliation(s)
- Shizhuang Weng
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China.
| | - Zhaojie Chu
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| | - Manqin Wang
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| | - Kaixuan Han
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| | - Gongqin Zhu
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| | - Cunchuan Liu
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| | - Xinhua Li
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| | - Linsheng Huang
- National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, 111 Jiulong Road, Hefei, China
| |
Collapse
|
6
|
SUN JJ, YANG WD, FENG MC, XIAO LJ, SUN H, KUBAR MS. Adaptive Variable Re-weighting and Shrinking Approach for Variable Selection in Multivariate Calibration for Near-infrared Spectroscopy. CHINESE JOURNAL OF ANALYTICAL CHEMISTRY 2021. [DOI: 10.1016/s1872-2040(21)60102-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
7
|
Sun J, Yang W, Feng M, Liu Q, Kubar MS. An efficient variable selection method based on random frog for the multivariate calibration of NIR spectra. RSC Adv 2020; 10:16245-16253. [PMID: 35498850 PMCID: PMC9052783 DOI: 10.1039/d0ra00922a] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 04/08/2020] [Indexed: 11/29/2022] Open
Abstract
Variable selection is a critical step for spectrum modeling. In this study, a new method of variable interval selection based on random frog (RF), known as Interval Selection based on Random Frog (ISRF), is developed. In the ISRF algorithm, RF is used to search the most likely informative variables and then, a local search is applied to expand the interval width of the informative variables. Through multiple runs and visualization of the results, the best informative interval variables are obtained. This method was tested on three near infrared (NIR) datasets. Four variable selection methods, namely, genetic algorithm PLS (GA-PLS), random frog, interval random frog (iRF) and interval variable iterative space shrinkage approach (iVISSA) were used for comparison. The results show that the proposed method is very efficient to find the best interval variables and improve the model's prediction performance and interpretation.
Collapse
Affiliation(s)
- Jingjing Sun
- College of Agriculture, Shanxi Agricultural University South Min-Xian Road, Taigu Shanxi China
- College of Arts and Science, Shanxi Agricultural University South Min-Xian Road, Taigu Shanxi China
| | - Wude Yang
- College of Agriculture, Shanxi Agricultural University South Min-Xian Road, Taigu Shanxi China
| | - Meichen Feng
- College of Agriculture, Shanxi Agricultural University South Min-Xian Road, Taigu Shanxi China
| | - Qifang Liu
- College of Information Science and Engineering, Shanxi Agricultural University South Min-Xian Road, Taigu Shanxi China
| | - Muhammad Saleem Kubar
- College of Agriculture, Shanxi Agricultural University South Min-Xian Road, Taigu Shanxi China
| |
Collapse
|
8
|
Yun YH, Li HD, Deng BC, Cao DS. An overview of variable selection methods in multivariate analysis of near-infrared spectra. Trends Analyt Chem 2019. [DOI: 10.1016/j.trac.2019.01.018] [Citation(s) in RCA: 118] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
9
|
Guo Q, Nie L, Li L, Zang H. Estimation of the critical quality attributes for hydroxypropyl methylcellulose with near-infrared spectroscopy and chemometrics. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2017; 177:158-163. [PMID: 28160714 DOI: 10.1016/j.saa.2017.01.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2016] [Revised: 12/31/2016] [Accepted: 01/04/2017] [Indexed: 06/06/2023]
Abstract
With the implementation of quality by design (QbD), critical attributes of raw material (drug substance and excipients) are of significantly importance in pharmaceutical manufacturing process. It is desirable for the quality control of critical material attributes (CMAs) of excipients to ensure the quality of end product. This paper explored the feasibility of an at-line method for the quantitative analysis of hydroxypropoxy group in hydroxypropyl methylcellulose (HPMC) with near infrared spectroscopy (NIRS). Hydroxypropoxy group content can be seen as a CMA of HPMC for quality control. The partial least squares (PLS) model was built with 61 samples including 47 samples as calibration set, 14 samples as validation set by sample set partitioning based on joint x-y distances (SPXY) method. Multiplicative scattering correction (MSC) combined with Savitzkye-Golay (SG) smoothing with first derivative was used as the appropriate pretreatment method. Three variable selection methods including interval partial least-squares (iPLS), competitive adaptive reweighted Sampling (CARS), and the combination of the two methods (iPLS-CARS) were performed for optimizing the model. The results indicated that NIRS could predict rapidly and effectively the content of hydroxypropoxy group in HPMC. NIRS could be a potential method for the quality control of CMAs.
Collapse
Affiliation(s)
- Qingli Guo
- National Glycoengineering Research Center, Shandong University, Wenhuaxi Road 44, Jinan 250012, China; School of Pharmaceutical Sciences, Shandong University, Wenhuaxi Road 44, Jinan 250012, China
| | - Lei Nie
- School of Pharmaceutical Sciences, Shandong University, Wenhuaxi Road 44, Jinan 250012, China
| | - Lian Li
- National Glycoengineering Research Center, Shandong University, Wenhuaxi Road 44, Jinan 250012, China; School of Pharmaceutical Sciences, Shandong University, Wenhuaxi Road 44, Jinan 250012, China
| | - Hengchang Zang
- National Glycoengineering Research Center, Shandong University, Wenhuaxi Road 44, Jinan 250012, China; School of Pharmaceutical Sciences, Shandong University, Wenhuaxi Road 44, Jinan 250012, China.
| |
Collapse
|
10
|
Zhang H, Jiang H, Liu G, Mei C, Huang Y. Identification of Radix puerariae starch from different geographical origins by FT-NIR spectroscopy. INTERNATIONAL JOURNAL OF FOOD PROPERTIES 2017. [DOI: 10.1080/10942912.2017.1283325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Hang Zhang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, PR China
| | - Hui Jiang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, PR China
| | - Guohai Liu
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, PR China
| | - Congli Mei
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, PR China
| | - Yonghong Huang
- School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, PR China
| |
Collapse
|
11
|
Deng BC, Yun YH, Cao DS, Yin YL, Wang WT, Lu HM, Luo QY, Liang YZ. A bootstrapping soft shrinkage approach for variable selection in chemical modeling. Anal Chim Acta 2016; 908:63-74. [PMID: 26826688 DOI: 10.1016/j.aca.2016.01.001] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Revised: 12/14/2015] [Accepted: 01/04/2016] [Indexed: 10/22/2022]
Abstract
In this study, a new variable selection method called bootstrapping soft shrinkage (BOSS) method is developed. It is derived from the idea of weighted bootstrap sampling (WBS) and model population analysis (MPA). The weights of variables are determined based on the absolute values of regression coefficients. WBS is applied according to the weights to generate sub-models and MPA is used to analyze the sub-models to update weights for variables. The optimization procedure follows the rule of soft shrinkage, in which less important variables are not eliminated directly but are assigned smaller weights. The algorithm runs iteratively and terminates until the number of variables reaches one. The optimal variable set with the lowest root mean squared error of cross-validation (RMSECV) is selected. The method was tested on three groups of near infrared (NIR) spectroscopic datasets, i.e. corn datasets, diesel fuels datasets and soy datasets. Three high performing variable selection methods, i.e. Monte Carlo uninformative variable elimination (MCUVE), competitive adaptive reweighted sampling (CARS) and genetic algorithm partial least squares (GA-PLS) are used for comparison. The results show that BOSS is promising with improved prediction performance. The Matlab codes for implementing BOSS are freely available on the website: http://www.mathworks.com/matlabcentral/fileexchange/52770-boss.
Collapse
Affiliation(s)
- Bai-Chuan Deng
- College of Animal Science, South China Agricultural University, Guangzhou 510642, PR China; School of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China; Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha 410125, PR China
| | - Yong-Huan Yun
- School of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China.
| | - Yu-Long Yin
- College of Animal Science, South China Agricultural University, Guangzhou 510642, PR China; Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha 410125, PR China
| | - Wei-Ting Wang
- School of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Hong-Mei Lu
- School of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Qian-Yi Luo
- School of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Yi-Zeng Liang
- School of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China.
| |
Collapse
|
12
|
Chen J, Ma Q, Hu X, Zhang M, Qin D, Lu X. Gene selection and cancer classification using Monte Carlo and nonnegative matrix factorization. RSC Adv 2016. [DOI: 10.1039/c6ra05694f] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Cancer classification is a key problem for identifying the genomic biomarkers and treating cancerous tumors in clinical research.
Collapse
Affiliation(s)
- Jing Chen
- Key Laboratory of Bioelectrochemistry & Environmental Analysis of Gansu Province
- College of Chemistry & Chemical Engineering
- Northwest Normal University
- P. R. China
| | - Qin Ma
- Key Laboratory of Bioelectrochemistry & Environmental Analysis of Gansu Province
- College of Chemistry & Chemical Engineering
- Northwest Normal University
- P. R. China
| | - Xiaoyan Hu
- Key Laboratory of Bioelectrochemistry & Environmental Analysis of Gansu Province
- College of Chemistry & Chemical Engineering
- Northwest Normal University
- P. R. China
| | - Miao Zhang
- Key Laboratory of Bioelectrochemistry & Environmental Analysis of Gansu Province
- College of Chemistry & Chemical Engineering
- Northwest Normal University
- P. R. China
| | - Dongdong Qin
- Key Laboratory of Bioelectrochemistry & Environmental Analysis of Gansu Province
- College of Chemistry & Chemical Engineering
- Northwest Normal University
- P. R. China
| | - Xiaoquan Lu
- Key Laboratory of Bioelectrochemistry & Environmental Analysis of Gansu Province
- College of Chemistry & Chemical Engineering
- Northwest Normal University
- P. R. China
| |
Collapse
|
13
|
Deng BC, Yun YH, Liang YZ, Yi LZ. A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling. Analyst 2015; 139:4836-45. [PMID: 25083512 DOI: 10.1039/c4an00730a] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
In this study, a new optimization algorithm called the Variable Iterative Space Shrinkage Approach (VISSA) that is based on the idea of model population analysis (MPA) is proposed for variable selection. Unlike most of the existing optimization methods for variable selection, VISSA statistically evaluates the performance of variable space in each step of optimization. Weighted binary matrix sampling (WBMS) is proposed to generate sub-models that span the variable subspace. Two rules are highlighted during the optimization procedure. First, the variable space shrinks in each step. Second, the new variable space outperforms the previous one. The second rule, which is rarely satisfied in most of the existing methods, is the core of the VISSA strategy. Compared with some promising variable selection methods such as competitive adaptive reweighted sampling (CARS), Monte Carlo uninformative variable elimination (MCUVE) and iteratively retaining informative variables (IRIV), VISSA showed better prediction ability for the calibration of NIR data. In addition, VISSA is user-friendly; only a few insensitive parameters are needed, and the program terminates automatically without any additional conditions. The Matlab codes for implementing VISSA are freely available on the website: https://sourceforge.net/projects/multivariateanalysis/files/VISSA/.
Collapse
Affiliation(s)
- Bai-chuan Deng
- Department of Chemistry, University of Bergen, Bergen N-5007, Norway
| | | | | | | |
Collapse
|
14
|
Deng BC, Yun YH, Ma P, Lin CC, Ren DB, Liang YZ. A new method for wavelength interval selection that intelligently optimizes the locations, widths and combinations of the intervals. Analyst 2015; 140:1876-85. [PMID: 25665981 DOI: 10.1039/c4an02123a] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In this study, a new algorithm for wavelength interval selection, known as interval variable iterative space shrinkage approach (iVISSA), is proposed based on the VISSA algorithm. It combines global and local searches to iteratively and intelligently optimize the locations, widths and combinations of the spectral intervals. In the global search procedure, it inherits the merit of soft shrinkage from VISSA to search the locations and combinations of informative wavelengths, whereas in the local search procedure, it utilizes the information of continuity in spectroscopic data to determine the widths of wavelength intervals. The global and local search procedures are carried out alternatively to realize wavelength interval selection. This method was tested using three near infrared (NIR) datasets. Some high-performing wavelength selection methods, such as synergy interval partial least squares (siPLS), moving window partial least squares (MW-PLS), competitive adaptive reweighted sampling (CARS), genetic algorithm PLS (GA-PLS) and interval random frog (iRF), were used for comparison. The results show that the proposed method is very promising with good results both on prediction capability and stability. The MATLAB codes for implementing iVISSA are freely available on the website: .
Collapse
Affiliation(s)
- Bai-Chuan Deng
- Department of Chemistry, University of Bergen, Bergen N-5007, Norway
| | | | | | | | | | | |
Collapse
|
15
|
Yi L, Dong N, Yun Y, Deng B, Liu S, Zhang Y, Liang Y. WITHDRAWN: Recent advances in chemometric methods for plant metabolomics: A review. Biotechnol Adv 2014:S0734-9750(14)00183-9. [PMID: 25461504 DOI: 10.1016/j.biotechadv.2014.11.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 11/17/2014] [Accepted: 11/18/2014] [Indexed: 12/17/2022]
Abstract
This article has been withdrawn at the request of the author(s) and/or editor. The Publisher apologizes for any inconvenience this may cause. The full Elsevier Policy on Article Withdrawal can be found at http://www.elsevier.com/locate/withdrawalpolicy.
Collapse
Affiliation(s)
- Lunzhao Yi
- Yunnan Food Safety Research Institute, Kunming University of Science and Technology, Kunming 650500, China.
| | - Naiping Dong
- Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong 999077, Hong Kong, China
| | - Yonghuan Yun
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Baichuan Deng
- Department of Chemistry, University of Bergen, Bergen N-5007, Norway
| | - Shao Liu
- Xiangya Hospital, Central South University, Changsha 410008, China
| | - Yi Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Yizeng Liang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
16
|
Zheng K, Hu H, Tong P, Du Y. Ensemble Regression Coefficient Analysis for Application to Near-Infrared Spectroscopy. ANAL LETT 2014. [DOI: 10.1080/00032719.2014.900776] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
17
|
Yun YH, Wang WT, Tan ML, Liang YZ, Li HD, Cao DS, Lu HM, Xu QS. A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration. Anal Chim Acta 2014; 807:36-43. [DOI: 10.1016/j.aca.2013.11.032] [Citation(s) in RCA: 133] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Revised: 11/13/2013] [Accepted: 11/14/2013] [Indexed: 11/12/2022]
|
18
|
|