1
|
Bhattacharjee R, Gaur S, Chander S, Ohri A, Srivastava PK, Mishra A. Stacked Ensemble with Machine Learning Regressors on Optimal Features (SMOF) of hyperspectral sensor PRISMA for inland water turbidity prediction. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:65464-65480. [PMID: 39581926 DOI: 10.1007/s11356-024-35481-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 10/27/2024] [Indexed: 11/26/2024]
Abstract
Leveraging hyperspectral data across various domains yields substantial benefits, yet managing many spectral bands and identifying the essential ones poses a formidable challenge. This study identifies the most relevant bands within a hyperspectral data cube for turbidity prediction in inland water. Nine machine learning regressors Cat Boost, Decision Trees, Extra Trees, Gradient Boost, Light Gradient Boost (LightGBM), Recursive Feature Elimination (RFE), Random Forest, Support Vector Regressor (SVR), and Xtreme Gradient Boost (XGBoost) have been used to compute the feature importance of the hyperspectral bands for predicting turbidity. Random Forest has outperformed the other models with a mean absolute percentage error (MAPE) of 1.61%, and the R2 of the linear fit is 0.96. Band 77, with a central wavelength of 1067.61 nm, is the most dominating band regarding feature importance. We have also developed a novel framework for turbidity prediction: Stacked Ensemble with Machine Learning Regressors on Optimal Features (SMOF). It employs a stacking ensemble of the nine regressors mentioned above with Random Forest as both base and meta-model, leveraging feature selection outputs. With this framework, the MAPE (%) reached 1.21, while the R2 stood at 0.95. The present study also presents a simple statistical algorithm to detect noisy bands in the Hyperspectral Precursor of the Application Mission (PRISMA) data cube. The approach assesses quadrat-wise intra-band spatial coherence using Renyi's entropy thresholding for noisy band segregation. Radiometric calibration error and absorption due to water vapour are the two primary sources of noise within the data cube. Moreover, this research implements the open-source Water Colour Simulator (WASI) to simulate inland water spectra with varied proportions of turbidity. Overall, the study presents an approach to identify noisy bands and integrates the potential wavelengths for turbidity prediction of inland waters.
Collapse
Affiliation(s)
- Rajarshi Bhattacharjee
- Department of Civil Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India.
| | - Shishir Gaur
- Department of Civil Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India
| | - Shard Chander
- Land Hydrology Division, Space Application Centre, ISRO, Ahmedabad, 380015, India
| | - Anurag Ohri
- Department of Civil Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India
| | - Prashant K Srivastava
- Institute of Environment and Sustainable Development, Banaras Hindu University (BHU), Varanasi, 221005, India
| | - Anurag Mishra
- Department of Civil Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India
| |
Collapse
|
2
|
Hamed AA, Wu X. Detection of ChatGPT fake science with the xFakeSci learning algorithm. Sci Rep 2024; 14:16231. [PMID: 39004625 PMCID: PMC11247077 DOI: 10.1038/s41598-024-66784-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 07/03/2024] [Indexed: 07/16/2024] Open
Abstract
Generative AI tools exemplified by ChatGPT are becoming a new reality. This study is motivated by the premise that "AI generated content may exhibit a distinctive behavior that can be separated from scientific articles". In this study, we show how articles can be generated using means of prompt engineering for various diseases and conditions. We then show how we tested this premise in two phases and prove its validity. Subsequently, we introduce xFakeSci, a novel learning algorithm, that is capable of distinguishing ChatGPT-generated articles from publications produced by scientists. The algorithm is trained using network models driven from both sources. To mitigate overfitting issues, we incorporated a calibration step that is built upon data-driven heuristics, including proximity and ratios. Specifically, from a total of a 3952 fake articles for three different medical conditions, the algorithm was trained using only 100 articles, but calibrated using folds of 100 articles. As for the classification step, it was performed using 300 articles per condition. The actual label steps took place against an equal mix of 50 generated articles and 50 authentic PubMed abstracts. The testing also spanned publication periods from 2010 to 2024 and encompassed research on three distinct diseases: cancer, depression, and Alzheimer's. Further, we evaluated the accuracy of the xFakeSci algorithm against some of the classical data mining algorithms (e.g., Support Vector Machines, Regression, and Naive Bayes). The xFakeSci algorithm achieved F1 scores ranging from 80 to 94%, outperforming common data mining algorithms, which scored F1 values between 38 and 52%. We attribute the noticeable difference to the introduction of calibration and a proximity distance heuristic, which underscores this promising performance. Indeed, the prediction of fake science generated by ChatGPT presents a considerable challenge. Nonetheless, the introduction of the xFakeSci algorithm is a significant step on the way to combating fake science.
Collapse
Affiliation(s)
- Ahmed Abdeen Hamed
- Complex Adaptive Systems and Computational Intelligence Laboratory, State University of New York at Binghamton, Binghamton, NY, 13902, USA.
| | - Xindong Wu
- Hefei University of Technology, Key Laboratory of Knowledge Engineering with Big Data (the Ministry of Education of China), Hefei, 230009, China.
- China Electronics Data Industry Group, Shenzhen, 518057, China.
| |
Collapse
|
3
|
Guo S, Ao X, Ma X, Cheng S, Men C, Harada H, Saroj DP, Mang HP, Li Z, Zheng L. Machine-learning-aided application of high-gravity technology to enhance ammonia recovery of fresh waste leachate. WATER RESEARCH 2023; 235:119891. [PMID: 36965295 DOI: 10.1016/j.watres.2023.119891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 02/27/2023] [Accepted: 03/17/2023] [Indexed: 06/18/2023]
Abstract
Stripping is widely applied for the removal of ammonia from fresh waste leachate. However, the development of air stripping technology is restricted by the requirements for large-scale equipment and long operation periods. This paper describes a high-gravity technology that improves ammonia stripping from actual fresh waste leachate and a machine learning approach that predicts the stripping performance under different operational parameters. The high-gravity field is implemented in a co-current-flow rotating packed bed in multi-stage cycle series mode. The eXtreme Gradient Boosting algorithm is applied to the experimental data to predict the liquid volumetric mass transfer coefficient (KLa) and removal efficiency (η) for various rotation speeds, numbers of stripping stages, gas flow rates, and liquid flow rates. Ammonia stripping under a high-gravity field achieves η = 82.73% and KLa = 5.551 × 10-4 s-1 at a pH value of 10 and ambient temperature. The results suggest that the eXtreme Gradient Boosting model provides good accuracy and predictive performance, with R2 values of 0.9923 and 0.9783 for KLa and η, respectively. The machine learning models developed in this study are combined with experimental results to provide more comprehensive information on rotating packed bed operations and more accurate predictions of KLa and η. The information mining behind the model is an important reference for the rational design of high-gravity-field-coupled ammonia stripping projects.
Collapse
Affiliation(s)
- Shaomin Guo
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China
| | - Xiuwei Ao
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China
| | - Xin Ma
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China
| | - Shikun Cheng
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China
| | - Cong Men
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China
| | - Hidenori Harada
- Graduate School of Asian and African Area Studies, Kyoto University, Kyoto 606-8501, Japan
| | - Devendra P Saroj
- Department of Civil and Environmental Engineering, Centre for Environmental Health Engineering (CEHE), Faculty of Engineering and Physical Sciences, University of Surrey, Surrey GU27XH, United Kingdom
| | - Heinz-Peter Mang
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China
| | - Zifu Li
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China.
| | - Lei Zheng
- School of Energy and Environmental Engineering, Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, University of Science and Technology Beijing, Beijing 100083, PR China.
| |
Collapse
|
4
|
UAV Multispectral Image-Based Urban River Water Quality Monitoring Using Stacked Ensemble Machine Learning Algorithms—A Case Study of the Zhanghe River, China. REMOTE SENSING 2022. [DOI: 10.3390/rs14143272] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Timely monitoring of inland water quality using unmanned aerial vehicle (UAV) remote sensing is critical for water environmental conservation and management. In this study, two UAV flights were conducted (one in February and the other in December 2021) to acquire images of the Zhanghe River (China), and a total of 45 water samples were collected concurrently with the image acquisition. Machine learning (ML) methods comprising Multiple Linear Regression, the Least Absolute Shrinkage and Selection Operator, a Backpropagation Neural Network (BP), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost) were applied to retrieve four water quality parameters: chlorophyll-a (Chl-a), total nitrogen (TN), total phosphors (TP), and permanganate index (CODMn). Then, ML models based on the stacking approach were developed. Results show that stacked ML models could achieve higher accuracy than a single ML model; the optimal methods for Chl-a, TN, TP, and CODMn were RF-XGB, BP-RF, RF, and BP-RF, respectively. For the testing dataset, the R2 values of the best inversion models for Chl-a, TN, TP, and CODMn were 0.504, 0.839, 0.432, and 0.272, the root mean square errors were 1.770 μg L−1, 0.189 mg L−1, 0.053 mg L−1, and 0.767 mg L−1, and the mean absolute errors were 1.272 μg L−1, 0.632 mg L−1, 0.045 mg L−1, and 0.674 mg L−1, respectively. This study demonstrated the great potential of combined UAV remote sensing and stacked ML algorithms for water quality monitoring.
Collapse
|
5
|
Kong PH, Chiang CH, Lin TC, Kuo SC, Li CF, Hsiung CA, Shiue YL, Chiou HY, Wu LC, Tsou HH. Discrimination of Methicillin-resistant Staphylococcus aureus by MALDI-TOF Mass Spectrometry with Machine Learning Techniques in Patients with Staphylococcus aureus Bacteremia. Pathogens 2022; 11:586. [PMID: 35631107 PMCID: PMC9143686 DOI: 10.3390/pathogens11050586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 05/11/2022] [Accepted: 05/13/2022] [Indexed: 11/29/2022] Open
Abstract
Early administration of proper antibiotics is considered to improve the clinical outcomes of Staphylococcus aureus bacteremia (SAB), but routine clinical antimicrobial susceptibility testing takes an additional 24 h after species identification. Recent studies elucidated matrix-assisted laser desorption/ionization time-of-flight mass spectra to discriminate methicillin-resistant strains (MRSA) or even incorporated with machine learning (ML) techniques. However, no universally applicable mass peaks were revealed, which means that the discrimination model might need to be established or calibrated by local strains' data. Here, a clinically feasible workflow was provided. We collected mass spectra from SAB patients over an 8-month duration and preprocessed by binning with reference peaks. Machine learning models were trained and tested by samples independently of the first six months and the following two months, respectively. The ML models were optimized by genetic algorithm (GA). The accuracy, sensitivity, specificity, and AUC of the independent testing of the best model, i.e., SVM, under the optimal parameters were 87%, 75%, 95%, and 87%, respectively. In summary, almost all resistant results were truly resistant, implying that physicians might escalate antibiotics for MRSA 24 h earlier. This report presents an attainable method for clinical laboratories to build an MRSA model and boost the performance using their local data.
Collapse
Affiliation(s)
- Po-Hsin Kong
- Institute of Biomedical Sciences, National Sun Yat-sen University, Kaohsiung 80424, Taiwan; (P.-H.K.); (Y.-L.S.)
- Center for Precision Medicine, Chi Mei Medical Center, Tainan 71004, Taiwan;
| | - Cheng-Hsiung Chiang
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli 35053, Taiwan; (C.-H.C.); (C.A.H.); (H.-Y.C.)
| | - Ting-Chia Lin
- Center for Precision Medicine, Chi Mei Medical Center, Tainan 71004, Taiwan;
- Institute of Precision Medicine, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
| | - Shu-Chen Kuo
- National Institute of Infectious Diseases and Vaccinology, National Health Research Institutes, Zhunan, Miaoli 35053, Taiwan;
| | - Chien-Feng Li
- Department of Medical Research, Chi Mei Medical Center, Tainan 71004, Taiwan;
| | - Chao A. Hsiung
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli 35053, Taiwan; (C.-H.C.); (C.A.H.); (H.-Y.C.)
| | - Yow-Ling Shiue
- Institute of Biomedical Sciences, National Sun Yat-sen University, Kaohsiung 80424, Taiwan; (P.-H.K.); (Y.-L.S.)
- Institute of Precision Medicine, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
| | - Hung-Yi Chiou
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli 35053, Taiwan; (C.-H.C.); (C.A.H.); (H.-Y.C.)
- School of Public Health, College of Public Health, Taipei Medical University, Taipei 11031, Taiwan
- Master’s Program in Applied Epidemiology, College of Public Health, Taipei Medical University, Taipei 11031, Taiwan
| | - Li-Ching Wu
- Institute of Biomedical Sciences, National Sun Yat-sen University, Kaohsiung 80424, Taiwan; (P.-H.K.); (Y.-L.S.)
- Center for Precision Medicine, Chi Mei Medical Center, Tainan 71004, Taiwan;
| | - Hsiao-Hui Tsou
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli 35053, Taiwan; (C.-H.C.); (C.A.H.); (H.-Y.C.)
- Graduate Institute of Biostatistics, College of Public Health, China Medical University, Taichung 40402, Taiwan
| |
Collapse
|
6
|
Zhao Y, Yu Y, Wang H, Li Y, Deng Y, Jiang G, Luo Y. Machine Learning in Causal Inference: Application in Pharmacovigilance. Drug Saf 2022; 45:459-476. [PMID: 35579811 PMCID: PMC9114053 DOI: 10.1007/s40264-022-01155-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2022] [Indexed: 01/28/2023]
Abstract
Monitoring adverse drug events or pharmacovigilance has been promoted by the World Health Organization to assure the safety of medicines through a timely and reliable information exchange regarding drug safety issues. We aim to discuss the application of machine learning methods as well as causal inference paradigms in pharmacovigilance. We first reviewed data sources for pharmacovigilance. Then, we examined traditional causal inference paradigms, their applications in pharmacovigilance, and how machine learning methods and causal inference paradigms were integrated to enhance the performance of traditional causal inference paradigms. Finally, we summarized issues with currently mainstream correlation-based machine learning models and how the machine learning community has tried to address these issues by incorporating causal inference paradigms. Our literature search revealed that most existing data sources and tasks for pharmacovigilance were not designed for causal inference. Additionally, pharmacovigilance was lagging in adopting machine learning-causal inference integrated models. We highlight several currently trending directions or gaps to integrate causal inference with machine learning in pharmacovigilance research. Finally, our literature search revealed that the adoption of causal paradigms can mitigate known issues with machine learning models. We foresee that the pharmacovigilance domain can benefit from the progress in the machine learning field.
Collapse
Affiliation(s)
- Yiqing Zhao
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yue Yu
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Hanyin Wang
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yikuan Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yu Deng
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA.
| |
Collapse
|
7
|
A new neutrosophic TF-IDF term weighting for text mining tasks: text classification use case. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS 2021. [DOI: 10.1108/ijwis-11-2020-0067] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
This paper aims to present a new term weighting approach for text classification as a text mining task. The original method, neutrosophic term frequency – inverse term frequency (NTF-IDF), is an extended version of the popular fuzzy TF-IDF (FTF-IDF) and uses the neutrosophic reasoning to analyze and generate weights for terms in natural languages. The paper also propose a comparative study between the popular FTF-IDF and NTF-IDF and their impacts on different machine learning (ML) classifiers for document categorization goals.
Design/methodology/approach
After preprocessing textual data, the original Neutrosophic TF-IDF applies the neutrosophic inference system (NIS) to produce weights for terms representing a document. Using the local frequency TF, global frequency IDF and text N's length as NIS inputs, this study generate two neutrosophic weights for a given term. The first measure provides information on the relevance degree for a word, and the second one represents their ambiguity degree. Next, the Zhang combination function is applied to combine neutrosophic weights outputs and present the final term weight, inserted in the document's representative vector. To analyze the NTF-IDF impact on the classification phase, this study uses a set of ML algorithms.
Findings
Practicing the neutrosophic logic (NL) characteristics, the authors have been able to study the ambiguity of the terms and their degree of relevance to represent a document. NL's choice has proven its effectiveness in defining significant text vectorization weights, especially for text classification tasks. The experimentation part demonstrates that the new method positively impacts the categorization. Moreover, the adopted system's recognition rate is higher than 91%, an accuracy score not attained using the FTF-IDF. Also, using benchmarked data sets, in different text mining fields, and many ML classifiers, i.e. SVM and Feed-Forward Network, and applying the proposed term scores NTF-IDF improves the accuracy by 10%.
Originality/value
The novelty of this paper lies in two aspects. First, a new term weighting method, which uses the term frequencies as components to define the relevance and the ambiguity of term; second, the application of NL to infer weights is considered as an original model in this paper, which also aims to correct the shortcomings of the FTF-IDF which uses fuzzy logic and its drawbacks. The introduced technique was combined with different ML models to improve the accuracy and relevance of the obtained feature vectors to fed the classification mechanism.
Collapse
|
8
|
Fei S, Hassan MA, Ma Y, Shu M, Cheng Q, Li Z, Chen Z, Xiao Y. Entropy Weight Ensemble Framework for Yield Prediction of Winter Wheat Under Different Water Stress Treatments Using Unmanned Aerial Vehicle-Based Multispectral and Thermal Data. FRONTIERS IN PLANT SCIENCE 2021; 12:730181. [PMID: 34987529 PMCID: PMC8721222 DOI: 10.3389/fpls.2021.730181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 11/08/2021] [Indexed: 05/05/2023]
Abstract
Crop breeding programs generally perform early field assessments of candidate selection based on primary traits such as grain yield (GY). The traditional methods of yield assessment are costly, inefficient, and considered a bottleneck in modern precision agriculture. Recent advances in an unmanned aerial vehicle (UAV) and development of sensors have opened a new avenue for data acquisition cost-effectively and rapidly. We evaluated UAV-based multispectral and thermal images for in-season GY prediction using 30 winter wheat genotypes under 3 water treatments. For this, multispectral vegetation indices (VIs) and normalized relative canopy temperature (NRCT) were calculated and selected by the gray relational analysis (GRA) at each growth stage, i.e., jointing, booting, heading, flowering, grain filling, and maturity to reduce the data dimension. The elastic net regression (ENR) was developed by using selected features as input variables for yield prediction, whereas the entropy weight fusion (EWF) method was used to combine the predicted GY values from multiple growth stages. In our results, the fusion of dual-sensor data showed high yield prediction accuracy [coefficient of determination (R 2) = 0.527-0.667] compared to using a single multispectral sensor (R 2 = 0.130-0.461). Results showed that the grain filling stage was the optimal stage to predict GY with R 2 = 0.667, root mean square error (RMSE) = 0.881 t ha-1, relative root-mean-square error (RRMSE) = 15.2%, and mean absolute error (MAE) = 0.721 t ha-1. The EWF model outperformed at all the individual growth stages with R 2 varying from 0.677 to 0.729. The best prediction result (R 2 = 0.729, RMSE = 0.831 t ha-1, RRMSE = 14.3%, and MAE = 0.684 t ha-1) was achieved through combining the predicted values of all growth stages. This study suggests that the fusion of UAV-based multispectral and thermal IR data within an ENR-EWF framework can provide a precise and robust prediction of wheat yield.
Collapse
Affiliation(s)
- Shuaipeng Fei
- Institute of Farmland Irrigation, Chinese Academy of Agricultural Sciences, Xinxiang, China
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- College of Land Science and Technology, China Agricultural University, Beijing, China
| | - Muhammad Adeel Hassan
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- Dezhou Academy of Agricultural Sciences, Dezhou, China
| | - Yuntao Ma
- College of Land Science and Technology, China Agricultural University, Beijing, China
| | - Meiyan Shu
- College of Land Science and Technology, China Agricultural University, Beijing, China
| | - Qian Cheng
- Institute of Farmland Irrigation, Chinese Academy of Agricultural Sciences, Xinxiang, China
| | - Zongpeng Li
- Institute of Farmland Irrigation, Chinese Academy of Agricultural Sciences, Xinxiang, China
| | - Zhen Chen
- Institute of Farmland Irrigation, Chinese Academy of Agricultural Sciences, Xinxiang, China
- *Correspondence: Zhen Chen,
| | - Yonggui Xiao
- National Wheat Improvement Center, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- Yonggui Xiao,
| |
Collapse
|
9
|
Liu X, Zhou Y, Zhao J, Yao R, Liu B, Ma D, Zheng Y. Multiobjective ResNet pruning by means of EMOAs for remote sensing scene classification. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.11.097] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
10
|
Farmer J, Merino Z, Gray A, Jacobs D. Universal Sample Size Invariant Measures for Uncertainty Quantification in Density Estimation. ENTROPY 2019. [PMCID: PMC7514464 DOI: 10.3390/e21111120] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Previously, we developed a high throughput non-parametric maximum entropy method (PLOS ONE, 13(5): e0196937, 2018) that employs a log-likelihood scoring function to characterize uncertainty in trial probability density estimates through a scaled quantile residual (SQR). The SQR for the true probability density has universal sample size invariant properties equivalent to sampled uniform random data (SURD). Alternative scoring functions are considered that include the Anderson-Darling test. Scoring function effectiveness is evaluated using receiver operator characteristics to quantify efficacy in discriminating SURD from decoy-SURD, and by comparing overall performance characteristics during density estimation across a diverse test set of known probability distributions.
Collapse
Affiliation(s)
- Jenny Farmer
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA; (J.F.); (Z.M.); (A.G.)
| | - Zach Merino
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA; (J.F.); (Z.M.); (A.G.)
| | - Alexander Gray
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA; (J.F.); (Z.M.); (A.G.)
| | - Donald Jacobs
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA; (J.F.); (Z.M.); (A.G.)
- Center for Biomedical Engineering and Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
- Correspondence:
| |
Collapse
|