Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Sagi O, Rokach L. Approximating XGBoost with an interpretable decision tree. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.05.055] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Number

Cited by Other Article(s)

Lu X, Chen Y, Zhang G, Zeng X, Lai L, Qu C. Application of interpretable machine learning algorithms to predict acute kidney injury in patients with cerebral infarction in ICU. J Stroke Cerebrovasc Dis 2024;33:107729. [PMID: 38657830 DOI: 10.1016/j.jstrokecerebrovasdis.2024.107729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 04/14/2024] [Accepted: 04/20/2024] [Indexed: 04/26/2024] Open

Hyun S, Lee H, Park W. Individual-specific postural discomfort prediction using decision tree models. APPLIED ERGONOMICS 2024;118:104282. [PMID: 38574593 DOI: 10.1016/j.apergo.2024.104282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 03/19/2024] [Accepted: 03/29/2024] [Indexed: 04/06/2024]

Li X, Zhang F, Zheng L, Guo J. Advancing ecotoxicity assessment: Leveraging pre-trained model for bee toxicity and compound degradability prediction. JOURNAL OF HAZARDOUS MATERIALS 2024;475:134828. [PMID: 38876015 DOI: 10.1016/j.jhazmat.2024.134828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/09/2024] [Accepted: 06/03/2024] [Indexed: 06/16/2024]

Sun Z, Wang Z, Qi X, Wang D, Gu X, Wang J, Lu H, Chen Y. Understanding key contributing factors on the severity of traffic violations by elderly drivers: a hybrid approach of latent class analysis and XGBoost based SHAP. Int J Inj Contr Saf Promot 2024;31:273-293. [PMID: 38284989 DOI: 10.1080/17457300.2023.2300479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 12/24/2023] [Indexed: 01/30/2024]

Abstract

Traffic violation is one of the leading causes of traffic crashes. In the context of global aging, it is important to study traffic violations by elderly drivers for improving traffic safety in preparation for a worldwide aging population. In this study, a hybrid approach of Latent Class Analysis (LCA) and XGBoost based SHAP is proposed to identify hidden clusters and to understand the key contributing factors on the severity of traffic violations by elderly drivers, based on the police-reported traffic violation dataset of Beijing (China). First, LCA is applied to segment the dataset into several latent homogeneous clusters, then XGBoost based SHAP is established on each cluster to identify feature contributions and the interaction effects of the key contributing factors on the severity of traffic violations by elderly drivers. Two comparison groups were set up to analyze factors, which are responsible for the different severities of traffic violations. The results show that elderly drivers can be classified into four groups by age, urban or not, license, and season; factors such as less annual number of traffic violations, national & provincial highway, night and winter are key contributing factors for higher severity of traffic violations, which are consistent with common cognition; key contributing factors for all clusters are similar but not identical, for example, more annual number of traffic violations contribute to more severe violation for all clusters except for Cluster 2; some factors which are not key contributing factors may affect the severity of traffic violations when they are combined with other factors, for example, the combination of lower annual number of traffic violations and county & township highway contributes to more severe violation for Cluster 1. These findings can help government to formulate targeted countermeasures to decrease the severity of traffic violations by specific elderly groups and improve road service for the driving population.

Collapse

Liu H, Dong S, Yang H, Wang L, Liu J, Du Y, Liu J, Lyu Z, Wang Y, Jiang L, Yu S, Fu X. Comparing the accuracy of four machine learning models in predicting type 2 diabetes onset within the Chinese population: a retrospective study. J Int Med Res 2024;52:3000605241253786. [PMID: 38870271 DOI: 10.1177/03000605241253786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2024] Open

Ziaikin E, Tello E, Peterson DG, Niv MY. BitterMasS: Predicting Bitterness from Mass Spectra. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024;72:10537-10547. [PMID: 38685906 PMCID: PMC11082931 DOI: 10.1021/acs.jafc.3c09767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 04/18/2024] [Accepted: 04/18/2024] [Indexed: 05/02/2024]

Matin M, Dehghanian A, Dastranj M, Darijani H. Explainable artificial intelligence modeling of internal arc in a medium voltage switchgear based on different CFD simulations. Heliyon 2024;10:e29594. [PMID: 38665570 PMCID: PMC11044042 DOI: 10.1016/j.heliyon.2024.e29594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/17/2024] [Accepted: 04/10/2024] [Indexed: 04/28/2024] Open

Liu X, Niu H, Peng J. Improving predictions: Enhancing in-hospital mortality forecast for ICU patients with sepsis-induced coagulopathy using a stacking ensemble model. Medicine (Baltimore) 2024;103:e37634. [PMID: 38579092 PMCID: PMC10994494 DOI: 10.1097/md.0000000000037634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 02/26/2024] [Indexed: 04/07/2024] Open

Liang X, Liu S, Li Z, Deng Y, Jiang Y, Yang H. Efficient cocrystal coformer screening based on a Machine learning Strategy: A case study for the preparation of imatinib cocrystal with enhanced physicochemical properties. Eur J Pharm Biopharm 2024;196:114201. [PMID: 38309538 DOI: 10.1016/j.ejpb.2024.114201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Revised: 01/18/2024] [Accepted: 01/29/2024] [Indexed: 02/05/2024]

Joe H, Kim HG. Multi-label classification with XGBoost for metabolic pathway prediction. BMC Bioinformatics 2024;25:52. [PMID: 38297220 PMCID: PMC10832249 DOI: 10.1186/s12859-024-05666-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 01/22/2024] [Indexed: 02/02/2024] Open

Qian S, Qiao X, Zhang W, Yu Z, Dong S, Feng J. Machine learning-based prediction for settling velocity of microplastics with various shapes. WATER RESEARCH 2024;249:121001. [PMID: 38113602 DOI: 10.1016/j.watres.2023.121001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 11/22/2023] [Accepted: 12/07/2023] [Indexed: 12/21/2023]

Feng S, Wang J. Prediction of Organic-Inorganic Hybrid Perovskite Band Gap by Multiple Machine Learning Algorithms. Molecules 2024;29:499. [PMID: 38276577 PMCID: PMC10820808 DOI: 10.3390/molecules29020499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Revised: 01/13/2024] [Accepted: 01/16/2024] [Indexed: 01/27/2024] Open

Abstract

As an indicator of the optical characteristics of perovskite materials, the band gap is a crucial parameter that impacts the functionality of a wide range of optoelectronic devices. Obtaining the band gap of a material via a labor-intensive, time-consuming, and inefficient high-throughput calculation based on first principles is possible. However, it does not yield the most accurate results. Machine learning techniques emerge as a viable and effective substitute for conventional approaches in band gap prediction. This paper collected 201 pieces of data through the literature and open-source databases. By separating the features related to bits A, B, and X, a dataset of 1208 pieces of data containing 30 feature descriptors was established. The dataset underwent preprocessing, and the Pearson correlation coefficient method was employed to eliminate non-essential features as a subset of features. The band gap was predicted using the GBR algorithm, the random forest algorithm, the LightGBM algorithm, and the XGBoost algorithm, in that order, to construct a prediction model for organic-inorganic hybrid perovskites. The outcomes demonstrate that the XGBoost algorithm yielded an MAE value of 0.0901, an MSE value of 0.0173, and an R2 value of 0.991310. These values suggest that, compared to the other two models, the XGBoost model exhibits the lowest prediction error, suggesting that the input features may better fit the prediction model. Finally, analysis of the XGBoost-based prediction model's prediction results using the SHAP model interpretation method reveals that the occupancy rate of the A-position ion has the greatest impact on the prediction of the band gap and has an A-negative correlation with the prediction results of the band gap. The findings provide valuable insights into the relationship between the prediction of band gaps and significant characteristics of organic-inorganic hybrid perovskites.

Collapse

Munshi RM. Novel ensemble learning approach with SVM-imputed ADASYN features for enhanced cervical cancer prediction. PLoS One 2024;19:e0296107. [PMID: 38198475 PMCID: PMC10781159 DOI: 10.1371/journal.pone.0296107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/06/2023] [Indexed: 01/12/2024] Open

Ma J, Zhang S, Liu X, Wang J. Machine learning prediction of biochar yield based on biomass characteristics. BIORESOURCE TECHNOLOGY 2023;389:129820. [PMID: 37805089 DOI: 10.1016/j.biortech.2023.129820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 10/01/2023] [Accepted: 10/01/2023] [Indexed: 10/09/2023]

Mavaie P, Holder L, Skinner MK. Hybrid deep learning approach to improve classification of low-volume high-dimensional data. BMC Bioinformatics 2023;24:419. [PMID: 37936066 PMCID: PMC10631218 DOI: 10.1186/s12859-023-05557-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 11/01/2023] [Indexed: 11/09/2023] Open

Meng F, Wang J, Chen Z, Qiao F, Yang D. Shaping the concentration of petroleum hydrocarbon pollution in soil: A machine learning and resistivity-based prediction method. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023;345:118817. [PMID: 37597372 DOI: 10.1016/j.jenvman.2023.118817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 08/03/2023] [Accepted: 08/12/2023] [Indexed: 08/21/2023]

Bakasa W, Viriri S. Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting. Front Artif Intell 2023;6:1232640. [PMID: 37876961 PMCID: PMC10591225 DOI: 10.3389/frai.2023.1232640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 09/04/2023] [Indexed: 10/26/2023] Open

Tomita K, Yamasaki A, Katou R, Ikeuchi T, Touge H, Sano H, Tohda Y. Construction of a Diagnostic Algorithm for Diagnosis of Adult Asthma Using Machine Learning with Random Forest and XGBoost. Diagnostics (Basel) 2023;13:3069. [PMID: 37835811 PMCID: PMC10572917 DOI: 10.3390/diagnostics13193069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 09/25/2023] [Accepted: 09/26/2023] [Indexed: 10/15/2023] Open

Yan Y, Shi Z, Wei H. ROSes-FINDER: a multi-task deep learning framework for accurate prediction of microorganism reactive oxygen species scavenging enzymes. Front Microbiol 2023;14:1245805. [PMID: 37744924 PMCID: PMC10513406 DOI: 10.3389/fmicb.2023.1245805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 08/21/2023] [Indexed: 09/26/2023] Open

Luo G, Zou F, Guo F, Liu J, Cai X, Cai Q, Xia C. An over-the-horizon potential safety threat vehicle identification method based on ETC big data. Heliyon 2023;9:e20050. [PMID: 37810065 PMCID: PMC10559829 DOI: 10.1016/j.heliyon.2023.e20050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 09/08/2023] [Accepted: 09/09/2023] [Indexed: 10/10/2023] Open

Unal M, Bostanci E, Ozkul C, Acici K, Asuroglu T, Guzel MS. Crohn's Disease Prediction Using Sequence Based Machine Learning Analysis of Human Microbiome. Diagnostics (Basel) 2023;13:2835. [PMID: 37685376 PMCID: PMC10486516 DOI: 10.3390/diagnostics13172835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 08/24/2023] [Accepted: 08/31/2023] [Indexed: 09/10/2023] Open

Faraz A, Tırınk C, Önder H, Şen U, Ishaq HM, Tauqir NA, Waheed A, Nabeel MS. Usage of the XGBoost and MARS algorithms for predicting body weight in Kajli sheep breed. Trop Anim Health Prod 2023;55:276. [PMID: 37500805 DOI: 10.1007/s11250-023-03700-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 07/19/2023] [Indexed: 07/29/2023]

Xiang T, Li T, Li J, Li X, Wang J. Using machine learning to realize genetic site screening and genomic prediction of productive traits in pigs. FASEB J 2023;37:e22961. [PMID: 37178007 DOI: 10.1096/fj.202300245r] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/30/2023] [Accepted: 04/25/2023] [Indexed: 05/15/2023]

Abstract

Genomic prediction, which is based on solving linear mixed-model (LMM) equations, is the most popular method for predicting breeding values or phenotypic performance for economic traits in livestock. With the need to further improve the performance of genomic prediction, nonlinear methods have been considered as an alternative and promising approach. The excellent ability to predict phenotypes in animal husbandry has been demonstrated by machine learning (ML) approaches, which have been rapidly developed. To investigate the feasibility and reliability of implementing genomic prediction using nonlinear models, the performances of genomic predictions for pig productive traits using the linear genomic selection model and nonlinear machine learning models were compared. Then, to reduce the high-dimensional features of genome sequence data, different machine learning algorithms, including the random forest (RF), support vector machine (SVM), extreme gradient boosting (XGBoost) and convolutional neural network (CNN) algorithms, were used to perform genomic feature selection as well as genomic prediction on reduced feature genome data. All of the analyses were processed on two real pig datasets: the published PIC pig dataset and a dataset comprising data from a national pig nucleus herd in Chifeng, North China. Overall, the accuracies of predicted phenotypic performance for traits T1, T2, T3 and T5 in the PIC dataset and average daily gain (ADG) in the Chifeng dataset were higher using the ML methods than the LMM method, while those for trait T4 in the PIC dataset and total number of piglets born (TNB) in the Chifeng dataset were slightly lower using the ML methods than the LMM method. Among all the different ML algorithms, SVM was the most appropriate for genomic prediction. For the genomic feature selection experiment, the most stable and most accurate results across different algorithms were achieved using XGBoost in combination with the SVM algorithm. Through feature selection, the number of genomic markers can be reduced to 1 in 20, while the predictive performance on some traits can even be improved compared to using the full genome data. Finally, we developed a new tool that can be used to execute combined XGBoost and SVM algorithms to realize genomic feature selection and phenotypic prediction.

Collapse

Li Z, Zhao Y, Duan T, Dai J. Configurational patterns for COVID-19 related social media rumor refutation effectiveness enhancement based on machine learning and fsQCA. Inf Process Manag 2023;60:103303. [PMID: 36741251 PMCID: PMC9889264 DOI: 10.1016/j.ipm.2023.103303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 01/25/2023] [Accepted: 01/26/2023] [Indexed: 02/04/2023]

Pezoa R, Basso F, Quilodrán P, Varas M. Estimation of trip purposes in public transport during the COVID-19 pandemic: The case of Santiago, Chile. JOURNAL OF TRANSPORT GEOGRAPHY 2023;109:103594. [PMID: 37123884 PMCID: PMC10121142 DOI: 10.1016/j.jtrangeo.2023.103594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 03/15/2023] [Accepted: 04/17/2023] [Indexed: 05/03/2023]

Shao X, Wang H, Zhu X, Xiong F, Mu T, Zhang Y. EFFECT: Explainable framework for meta-learning in automatic classification algorithm selection. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2022.11.144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Chen X, Lin S, Zheng Y, He L, Fang Y. Long-term trajectories of depressive symptoms and machine learning techniques for fall prediction in older adults:Evidence from the China Health and Retirement Longitudinal Study (CHARLS). Arch Gerontol Geriatr 2023;111:105012. [PMID: 37030148 DOI: 10.1016/j.archger.2023.105012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/27/2023] [Accepted: 03/29/2023] [Indexed: 04/01/2023]

Wang X, Sheng Y, Ning J, Xi J, Xi L, Qiu D, Yang J, Ke X. A Critical Review of Machine Learning Techniques on Thermoelectric Materials. J Phys Chem Lett 2023;14:1808-1822. [PMID: 36763950 DOI: 10.1021/acs.jpclett.2c03073] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Huang C, Gao W, Zheng Y, Wang W, Zhang Y, Liu K. Universal machine-learning algorithm for predicting adsorption performance of organic molecules based on limited data set: Importance of feature description. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;859:160228. [PMID: 36402319 DOI: 10.1016/j.scitotenv.2022.160228] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 11/09/2022] [Accepted: 11/12/2022] [Indexed: 06/16/2023]

Fei Z, Liang S, Cai Y, Shen Y. Ensemble Machine-Learning-Based Prediction Models for the Compressive Strength of Recycled Powder Mortar. MATERIALS (BASEL, SWITZERLAND) 2023;16:583. [PMID: 36676320 PMCID: PMC9862350 DOI: 10.3390/ma16020583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/27/2022] [Accepted: 01/04/2023] [Indexed: 06/17/2023]

Abstract

Recycled powder (RP) serves as a potential and prospective substitute for cementitious materials in concrete. The compressive strength of RP mortar is a pivotal factor affecting the mechanical properties of RP concrete. The application of machine learning (ML) approaches in the engineering problems, particularly for predicting the mechanical properties of construction materials, leads to high prediction accuracy and low experimental costs. In this study, 204 groups of RP mortar compression experimental data are collected from the literature to establish a dataset for ML, including 163 groups in the training set and 41 groups in the test set. Four ensemble ML models, namely eXtreme Gradient-Boosting (XGBoost), Random Forest (RF), Light Gradient-Boosting Machine (LightGBM) and Adaptive Boosting (AdaBoost), were selected to predict the compressive strength of RP mortar. The comparative results demonstrate that XGBoost has the highest prediction accuracy when the a10-index, MAE, RMSE and R² of the training set are 0.926, 1.596, 2.155 and 0.950 and the a10-index, MAE, RMSE and R² of the test set are 0.659, 3.182, 4.285 and 0.842, respectively. SHapley Additive exPlanation (SHAP) is adopted to interpret the prediction process of XGBoost and explain the influence of influencing factors on the compressive strength of RP mortar. According to the importance of influencing factors, the order is the mass replacement rate of RP, the size of RP, the kind of RP and the water binder ratio of RP. The compressive strength of RP mortar decreases with the increase in the RP mass replacement rate. The compressive strength of RBP mortar is slightly higher than that of RCP mortar. Machine learning technologies will benefit the construction industry by facilitating the rapid and cost-effective evaluation of RP material properties.

Collapse

Interpretable Machine Learning Techniques in ECG-Based Heart Disease Classification: A Systematic Review. Diagnostics (Basel) 2022;13:diagnostics13010111. [PMID: 36611403 PMCID: PMC9818170 DOI: 10.3390/diagnostics13010111] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 12/22/2022] [Accepted: 12/23/2022] [Indexed: 12/31/2022] Open

Osman SMI, Sabit A. Predictors of COVID-19 vaccination rate in USA: A machine learning approach. MACHINE LEARNING WITH APPLICATIONS 2022;10:100408. [PMID: 36128042 PMCID: PMC9479385 DOI: 10.1016/j.mlwa.2022.100408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 09/02/2022] [Accepted: 09/02/2022] [Indexed: 12/14/2022] Open

Li Z, Du X, Zhao Y, Tu Y, Lev B, Gan L. Lifecycle research of social media rumor refutation effectiveness based on machine learning and visualization technology. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.103077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Liu L, Qiao C, Zha JR, Qin H, Wang XR, Zhang XY, Wang YO, Yang XM, Zhang SL, Qin J. Early prediction of clinical scores for left ventricular reverse remodeling using extreme gradient random forest, boosting, and logistic regression algorithm representations. Front Cardiovasc Med 2022;9:864312. [PMID: 36061535 PMCID: PMC9428443 DOI: 10.3389/fcvm.2022.864312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 07/13/2022] [Indexed: 11/13/2022] Open

Abstract ObjectiveAt present, there is no early prediction model of left ventricular reverse remodeling (LVRR) for people who are in cardiac arrest with an ejection fraction (EF) of ≤35% at first diagnosis; thus, the purpose of this article is to provide a supplement to existing research.Materials and methodsA total of 109 patients suffering from heart attack with an EF of ≤35% at first diagnosis were involved in this single-center research study. LVRR was defined as an absolute increase in left ventricular ejection fraction (LVEF) from ≥10% to a final value of >35%, with analysis features including demographic characteristics, diseases, biochemical data, echocardiography, and drug therapy. Extreme gradient boosting (XGBoost), random forest, and logistic regression algorithm models were used to distinguish between LVRR and non-LVRR cases and to obtain the most important features.ResultsThere were 47 cases (42%) of LVRR in patients suffering from heart failure with an EF of ≤35% at first diagnosis after optimal drug therapy. General statistical analysis and machine learning methods were combined to exclude a number of significant feature groups. The median duration of disease in the LVRR group was significantly lower than that in the non-LVRR group (7 vs. 48 months); the mean values of creatine kinase (CK) and MB isoenzyme of creatine kinase (CK-MB) in the LVRR group were lower than those in the non-LVRR group (80.11 vs. 94.23 U/L; 2.61 vs. 2.99 ng/ml; 27.19 vs. 28.54 mm). Moreover, AUC values for our feature combinations ranged from 97 to 94% and to 87% when using the XGBoost, random forest, and logistic regression techniques, respectively. The ablation test revealed that beats per minute (BPM) and disease duration had a greater impact on the model’s ability to accurately forecast outcomes.ConclusionShorter disease duration, slightly lower CK and CK-MB levels, slightly smaller right and left ventricular and left atrial dimensions, and lower mean heart rates were found to be most strongly predictive of LVRR development (BPM). Collapse

Wang S, Jia Z, Cao N. Research on optimization and application of Spark decision tree algorithm under cloud‐edge collaboration. INT J INTELL SYST 2022. [DOI: 10.1002/int.22970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Using Deep Learning Networks to Identify Cyber Attacks on Intrusion Detection for In-Vehicle Networks. ELECTRONICS 2022. [DOI: 10.3390/electronics11142180] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Ding Y, Liu C, Zhu H, Chen Q, Liu J. Visualizing Deep Networks using Segmentation Recognition and Interpretation Algorithm. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.07.160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Short- and Medium-Term Power Demand Forecasting with Multiple Factors Based on Multi-Model Fusion. MATHEMATICS 2022. [DOI: 10.3390/math10122148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract With the continuous development of economy and society, power demand forecasting has become an important task of the power industry. Accurate power demand forecasting can promote the operation and development of the power supply industry. However, since power consumption is affected by a number of factors, it is difficult to accurately predict the power demand data. With the accumulation of data in the power industry, machine learning technology has shown great potential in power demand forecasting. In this study, gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost) and light gradient boosting machine (LightGBM) are integrated by stacking to build an XLG-LR fusion model to predict power demand. Firstly, preprocessing was carried out on 13 months of electricity and meteorological data. Next, the hyperparameters of each model were adjusted and optimized. Secondly, based on the optimal hyperparameter configuration, a prediction model was built using the training set (70% of the data). Finally, the test set (30% of the data) was used to evaluate the performance of each model. Mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), and goodness-of-fit coefficient (R^2) were utilized to analyze each model at different lengths of time, including their seasonal, weekly, and monthly forecast effect. Furthermore, the proposed fusion model was compared with other neural network models such as the GRU, LSTM and TCN models. The results showed that the XLG-LR model achieved the best prediction results at different time lengths, and at the same time consumed the least time compared to the neural network model. This method can provide a more reliable reference for the operation and dispatch of power enterprises and future power construction and planning. Collapse

Xu Q, Peng Y, Tan J, Zhao W, Yang M, Tian J. Prediction of Atrial Fibrillation in Hospitalized Elderly Patients With Coronary Heart Disease and Type 2 Diabetes Mellitus Using Machine Learning: A Multicenter Retrospective Study. Front Public Health 2022;10:842104. [PMID: 35309227 PMCID: PMC8931193 DOI: 10.3389/fpubh.2022.842104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 02/09/2022] [Indexed: 12/01/2022] Open

Customer Churn in Retail E-Commerce Business: Spatial and Machine Learning Approach. JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH 2022. [DOI: 10.3390/jtaer17010009] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]

Understanding Query Combination Behavior in Exploratory Searches. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12020706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]

Rostami M, Oussalah M. A novel explainable COVID-19 diagnosis method by integration of feature selection with random forest. INFORMATICS IN MEDICINE UNLOCKED 2022;30:100941. [PMID: 35399333 PMCID: PMC8985417 DOI: 10.1016/j.imu.2022.100941] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 04/01/2022] [Accepted: 04/01/2022] [Indexed: 12/12/2022] Open

Fu X, Wang Y, Cates RS, Li N, Liu J, Ke D, Liu J, Liu H, Yan S. Implementation of five machine learning methods to predict the 52-week blood glucose level in patients with type 2 diabetes. Front Endocrinol (Lausanne) 2022;13:1061507. [PMID: 36743935 PMCID: PMC9895792 DOI: 10.3389/fendo.2022.1061507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 12/30/2022] [Indexed: 01/22/2023] Open

Abstract

OBJECTIVE

For the patients who are suffering from type 2 diabetes, blood glucose level could be affected by multiple factors. An accurate estimation of the trajectory of blood glucose is crucial in clinical decision making. Frequent glucose measurement serves as a good source of data to train machine learning models for prediction purposes. This study aimed at using machine learning methods to predict blood glucose for type 2 diabetic patients. We investigated various parameters influencing blood glucose, as well as determined the most effective machine learning algorithm in predicting blood glucose.

PATIENTS AND METHODS

273 patients were recruited in this research. Several parameters such as age, diet, family history, BMI, alcohol intake, smoking status et al were analyzed. Patients who had glycosylated hemoglobin less than 6.5% after 52 weeks were considered as having achieved glycemic control and the rest as not achieving it. Five machine learning methods (KNN algorithm, logistic regression algorithm, random forest algorithm, support vector machine, and XGBoost algorithm) were compared to evaluate their performances in prediction accuracy. R 3.6.3 and Python 3.12 were used in data analysis.

RESULTS

The statistical variables for which p< 0.05 was obtained were BMI, pulse, Na, Cl, AKP. Compared with the other four algorithms, XGBoost algorithm has the highest accuracy (Accuracy=99.54% in training set and 78.18% in testing set) and AUC values (1.0 in training set and 0.68 in testing set), thus it is recommended to be used for prediction in clinical practice.

CONCLUSION

When it comes to future blood glucose level prediction using machine learning methods, XGBoost algorithm scores the highest in effectiveness. This algorithm could be applied to assist clinical decision making, as well as guide the lifestyle of diabetic patients, in pursuit of minimizing risks of hyperglycemic or hypoglycemic events.

Collapse

Cui R, Hua W, Qu K, Yang H, Tong Y, Li Q, Wang H, Ma Y, Liu S, Lin T, Zhang J, Sun J, Liu C. An Interpretable Early Dynamic Sequential Predictor for Sepsis-Induced Coagulopathy Progression in the Real-World Using Machine Learning. Front Med (Lausanne) 2021;8:775047. [PMID: 34926518 PMCID: PMC8678506 DOI: 10.3389/fmed.2021.775047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 11/08/2021] [Indexed: 11/17/2022] Open

Affiliation(s)

Ruixia Cui Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Wenbo Hua School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, China
Kai Qu Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Heran Yang School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, China
Yingmu Tong Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Qinglin Li Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Hai Wang Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Yanfen Ma Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Sinan Liu Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Ting Lin Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Jingyao Zhang Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Biobank, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
Jian Sun School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, China
Chang Liu Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.,Biobank, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China

Collapse