1
|
Sallahi N, Zainel AA, Bensmail HN, Syed MA, Arredouani A. Real-world clinical validation of the Qatar pre-diabetes risk score: a cross-sectional study. BMJ PUBLIC HEALTH 2024; 2:e000957. [PMID: 40018134 PMCID: PMC11812911 DOI: 10.1136/bmjph-2024-000957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/29/2024] [Indexed: 03/01/2025]
Abstract
Introduction Pre-diabetes stands as a prominent, independent risk factor for the onset of type 2 diabetes (T2D), with 5%-10% of individuals with pre-diabetes progressing to T2D annually. The effectiveness of rigorous lifestyle interventions in averting the transition from pre-diabetes to T2D has been substantiated across multiple investigations and populations. Consequently, the clinical imperative of early pre-diabetes detection becomes unequivocal. This study assessed the validity of the recently developed pre-diabetes risk score in Qatar (PRISQ) in a real-world clinical setting. Research design and methods We recruited 1021 walk-in participants from 3 different health centres of Qatar's Primary Health Care Corporation. Only adult people without known pre-diabetes or diabetes were included in the study. Along with blood collected for the haemoglobin A1c (HbA1c) test to confirm pre-diabetes, we recorded the age, gender, weight, waist circumference, systolic and diastolic blood pressure, nationality, smoking state and family history of diabetes. Negative predictive value, positive predictive value, sensitivity and specificity of PRISQ were computed. Results Of the 1021 participants, 797 agreed to provide blood. HbA1c test revealed that 21.9% of the 797 subjects had pre-diabetes (HbA1c between 5.7% and 6.5%) while 3.3% had undiagnosed diabetes (HbA1c≥ 6.5%). Using a PRISQ cut-off of 16, PRISQ sensitivity exceeded 90% in all subgroups of individuals aged 40 years and above, regardless of ethnicity. We did not see any significant improvement in PRISQ sensitivity when we considered the family history of diabetes. Conclusions We confirmed a good PRISQ diagnostic rate for pre-diabetes from a representative sample of the Qatar population recruited in a real-world clinical setting. PRISQ can potentially play a significant role in curbing the T2D epidemic sweeping Qatar and beyond.
Collapse
Affiliation(s)
| | | | - Halima None Bensmail
- Data Analytics, Qatar Computing Research Institute, Doha, Qatar
- Hamad Bin Khalifa University, Doha, Qatar
| | | | - Abdelilah Arredouani
- Qatar Biomedical Research Institute, Hamad Bin Khalifa University, Doha, Qatar
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha, Qatar
| |
Collapse
|
2
|
Hu Y, Han Y, Liu Y, Cui Y, Ni Z, Wei L, Cao C, Hu H, He Y. A nomogram model for predicting 5-year risk of prediabetes in Chinese adults. Sci Rep 2023; 13:22523. [PMID: 38110661 PMCID: PMC10728122 DOI: 10.1038/s41598-023-50122-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 12/15/2023] [Indexed: 12/20/2023] Open
Abstract
Early identification is crucial to effectively intervene in individuals at high risk of developing pre-diabetes. This study aimed to create a personalized nomogram to determine the 5-year risk of pre-diabetes among Chinese adults. This retrospective cohort study included 184,188 participants without prediabetes at baseline. Training cohorts (92,177) and validation cohorts (92,011) were randomly assigned (92,011). We compared five prediction models on the training cohorts: full cox proportional hazards model, stepwise cox proportional hazards model, multivariable fractional polynomials (MFP), machine learning, and least absolute shrinkage and selection operator (LASSO) models. At the same time, we validated the above five models on the validation set. And we chose the LASSO model as the final risk prediction model for prediabetes. We presented the model with a nomogram. The model's performance was evaluated in terms of its discriminative ability, clinical utility, and calibration using the area under the receiver operating characteristic (ROC) curve, decision curve analysis, and calibration analysis on the training cohorts. Simultaneously, we also evaluated the above nomogram on the validation set. The 5-year incidence of prediabetes was 10.70% and 10.69% in the training and validation cohort, respectively. We developed a simple nomogram that predicted the risk of prediabetes by using the parameters of age, body mass index (BMI), fasting plasma glucose (FBG), triglycerides (TG), systolic blood pressure (SBP), and serum creatinine (Scr). The nomogram's area under the receiver operating characteristic curve (AUC) was 0.7341 (95% CI 0.7290-0.7392) for the training cohort and 0.7336 (95% CI 0.7285-0.7387) for the validation cohort, indicating good discriminative ability. The calibration curve showed a perfect fit between the predicted prediabetes risk and the observed prediabetes risk. An analysis of the decision curve presented the clinical application of the nomogram, with alternative threshold probability spectrums being presented as well. A personalized prediabetes prediction nomogram was developed and validated among Chinese adults, identifying high-risk individuals. Doctors and others can easily and efficiently use our prediabetes prediction model when assessing prediabetes risk.
Collapse
Affiliation(s)
- Yanhua Hu
- College of Information Science and Engineering, Liuzhou Institute of Technology, Liuzhou, 545616, Guangxi Zhuang Autonomous Region, China
| | - Yong Han
- Department of Emergency, Shenzhen Second People's Hospital, Shenzhen, 518000, Guangdong Province, China
- Department of Emergency, The First Affiliated Hospital of Shenzhen University, Shenzhen, 518000, Guangdong Province, China
| | - Yufei Liu
- Department of Neurosurgery, Shenzhen Second People's Hospital, Shenzhen, 518000, Guangdong Province, China
- Department of Neurosurgery, The First Affiliated Hospital of Shenzhen University, Shenzhen, 518000, Guangdong Province, China
| | - Yanan Cui
- College of Information Science and Engineering, Liuzhou Institute of Technology, Liuzhou, 545616, Guangxi Zhuang Autonomous Region, China
| | - Zhiping Ni
- College of Information Science and Engineering, Liuzhou Institute of Technology, Liuzhou, 545616, Guangxi Zhuang Autonomous Region, China
| | - Ling Wei
- College of Information Science and Engineering, Liuzhou Institute of Technology, Liuzhou, 545616, Guangxi Zhuang Autonomous Region, China
| | - Changchun Cao
- Department of Rehabilitation, Shenzhen Dapeng New District Nan'ao People's Hospital, No. 6, Renmin Road, Dapeng New District, Shenzhen, 518000, Guangdong Province, China.
| | - Haofei Hu
- Department of Nephrology, Shenzhen Second People's Hospital, No. 3002 Sungang Road, Futian District, Shenzhen, 518000, Guangdong Province, China.
- Department of Nephrology, The First Affiliated Hospital of Shenzhen University, Shenzhen, 518000, Guangdong Province, China.
| | - Yongcheng He
- Department of Nephrology, Shenzhen Hengsheng Hospital, No. 20 Yintian Road, Baoan District, Shenzhen, 518000, Guangdong Province, China.
- Department of Nephrology, Affiliated Hospital of North Sichuan Medical College, Nanchong, 637000, Sichuan, China.
| |
Collapse
|
3
|
Liu Y, Feng W, Lou J, Qiu W, Shen J, Zhu Z, Hua Y, Zhang M, Billong LF. Performance of a prediabetes risk prediction model: A systematic review. Heliyon 2023; 9:e15529. [PMID: 37215820 PMCID: PMC10196520 DOI: 10.1016/j.heliyon.2023.e15529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 04/11/2023] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
Backgrounds The prediabetes population is large and easily overlooked because of the lack of obvious symptoms, which can progress to diabetes. Early screening and targeted interventions can substantially reduce the rate of conversion of prediabetes to diabetes. Therefore, this study systematically reviewed prediabetes risk prediction models, performed a summary and quality evaluation, and aimed to recommend the optimal model. Methods We systematically searched five databases (Cochrane, PubMed, Embase, Web Of Science, and CNKI) for published literature related to prediabetes risk prediction models and excluded preprints, duplicate publications, reviews, editorials, and other studies, with a search time frame of March 01, 2023. Data were categorized and summarized using a standardized data extraction form that extracted data including author; publication date; study design; country; demographic characteristics; assessment tool name; sample size; study type; and model-related indicators. The PROBAST tool was used to assess the risk of bias profile of included studies. Findings 14 studies with a total of 15 models were eventually included in the systematic review. We found that the most common predictors of models were age, family history of diabetes, gender, history of hypertension, and BMI. Most of the studies (83.3%) had a high risk of bias, mainly related to under-reporting of outcome information and poor methodological design during the development and validation of models. Due to the low quality of included studies, the evidence for predictive validity of the available models is unclear. Interpretation We should pay attention to the early screening of prediabetes patients and give timely pharmacological and lifestyle interventions. The predictive performance of the existing model is not satisfactory, and the model building process can be standardized and external validation can be added to improve the accuracy of the model in the future.
Collapse
Affiliation(s)
- Yujin Liu
- Schools of Nursing and Medicine, Huzhou University, Huzhou, 313000, China
| | - Wenming Feng
- Huzhou First People's Hospital, Huzhou, 313000, China
| | - Jianlin Lou
- Huzhou Key Laboratory of Precise Prevention and Control of Major Chronic Diseases, Huzhou University, Huzhou, 313000, China
| | - Wei Qiu
- Department of Endocrinology, Huzhou Central Hospital, Huzhou, 313000, China
| | - Jiantong Shen
- Schools of Nursing and Medicine, Huzhou University, Huzhou, 313000, China
- Huzhou Key Laboratory of Precise Prevention and Control of Major Chronic Diseases, Huzhou University, Huzhou, 313000, China
| | - Zhichao Zhu
- Schools of Nursing and Medicine, Huzhou University, Huzhou, 313000, China
- Internal Medicine General Ward, Jinhua Municipal Central Hospital Medical Group, Jinhua, 321200, China
| | - Yuting Hua
- Schools of Nursing and Medicine, Huzhou University, Huzhou, 313000, China
| | - Mei Zhang
- Schools of Nursing and Medicine, Huzhou University, Huzhou, 313000, China
| | | |
Collapse
|
4
|
Yu LP, Dong F, Li YZ, Yang WY, Wu SN, Shan ZY, Teng WP, Zhang B. Development and validation of a risk assessment model for prediabetes in China national diabetes survey. World J Clin Cases 2022; 10:11789-11803. [PMID: 36405266 PMCID: PMC9669875 DOI: 10.12998/wjcc.v10.i32.11789] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 10/07/2022] [Accepted: 10/17/2022] [Indexed: 11/08/2022] Open
Abstract
BACKGROUND Prediabetes risk assessment models derived from large sample sizes are scarce.
AIM To establish a robust assessment model for prediabetes and to validate the model in different populations.
METHODS The China National Diabetes and Metabolic Disorders Study (CNDMDS) collected information from 47325 participants aged at least 20 years across China from 2007 to 2008. The Thyroid Disorders, Iodine Status and Diabetes Epidemiological Survey (TIDE) study collected data from 66108 participants aged at least 18 years across China from 2015 to 2017. A logistic model with stepwise selection was performed to identify significant risk factors for prediabetes and was internally validated by bootstrapping in the CNDMDS. External validations were performed in diverse populations, including populations of Hispanic (Mexican American, other Hispanic) and non-Hispanic (White, Black and Asian) participants in the National Health and Nutrition Examination Survey (NHANES) in the United States and 66108 participants in the TIDE study in China. C statistics and calibration plots were adopted to evaluate the model’s discrimination and calibration performance.
RESULTS A set of easily measured indicators (age, education, family history of diabetes, waist circumference, body mass index, and systolic blood pressure) were selected as significant risk factors. A risk assessment model was established for prediabetes with a C statistic of 0.6998 (95%CI: 0.6933 to 0.7063) and a calibration slope of 1.0002. When externally validated in the NHANES and TIDE studies, the model showed increased C statistics in Mexican American, other Hispanic, Non-Hispanic Black, Asian and Chinese populations but a slightly decreased C statistic in non-Hispanic White individuals. Applying the risk assessment model to the TIDE population, we obtained a C statistic of 0.7308 (95%CI: 0.7260 to 0.7357) and a calibration slope of 1.1137. A risk score was derived to assess prediabetes. Individuals with scores ≥ 7 points were at high risk of prediabetes, with a sensitivity of 60.19% and specificity of 67.59%.
CONCLUSION An easy-to-use assessment model for prediabetes was established and was internally and externally validated in different populations. The model had a satisfactory performance and could screen individuals with a high risk of prediabetes.
Collapse
Affiliation(s)
- Li-Ping Yu
- Department of Endocrinology, China-Japan Friendship Hospital, Beijing 100029, China
| | - Fen Dong
- Institute of Clinical Medical Sciences, China-Japan Friendship Hospital, Beijing 100029, China
| | - Yong-Ze Li
- Department of Endocrinology and Metabolism, First Hospital of China Medical University, Shenyang 110001, Liaoning Province, China
| | - Wen-Ying Yang
- Department of Endocrinology, China-Japan Friendship Hospital, Beijing 100029, China
| | - Si-Nan Wu
- Institute of Clinical Medical Sciences, China-Japan Friendship Hospital, Beijing 100029, China
| | - Zhong-Yan Shan
- Department of Endocrinology and Metabolism, First Hospital of China Medical University, Shenyang 110001, Liaoning Province, China
| | - Wei-Ping Teng
- Department of Endocrinology and Metabolism, First Hospital of China Medical University, Shenyang 110001, Liaoning Province, China
| | - Bo Zhang
- Department of Endocrinology, China-Japan Friendship Hospital, Beijing 100029, China
| |
Collapse
|
5
|
Abbas M, Mall R, Errafii K, Lattab A, Ullah E, Bensmail H, Arredouani A. Simple risk score to screen for prediabetes: A cross-sectional study from the Qatar Biobank cohort. J Diabetes Investig 2021; 12:988-997. [PMID: 33075216 PMCID: PMC8169357 DOI: 10.1111/jdi.13445] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 10/01/2020] [Accepted: 10/06/2020] [Indexed: 12/30/2022] Open
Abstract
AIMS/INTRODUCTION The progression from prediabetes to type 2 diabetes is preventable by lifestyle intervention and/or pharmacotherapy in a large fraction of individuals with prediabetes. Our objective was to develop a risk score to screen for prediabetes in the Middle East, where diabetes prevalence is one of the highest in the world. MATERIALS AND METHODS In this cross-sectional, case-control study, we used data of 4,895 controls and 2,373 prediabetic adults obtained from the Qatar Biobank cohort. Significant risk factors were identified by logistic regression and other machine learning methods. The receiver operating characteristic was used to calculate the area under curve, cut-off point, sensitivity, specificity, positive and negative predictive values. The prediabetes risk score was developed from data of Qatari citizens, as well as long-term (≥15 years) residents. RESULTS The significant risk factors for the Prediabetes Risk Score in Qatar were age, sex, body mass index, waist circumference and blood pressure. The risk score ranges from 0 to 45. The area under the curve of the score was 80% (95% confidence interval 78-83%), and the cut-off point of 16 yielded sensitivity and specificity of 86.2% (95% confidence interval 82.7-89.2%) and 57.9% (95% confidence interval 65.5-71.4%), respectively. Prediabetes Risk Score in Qatar performed equally in Qatari nationals and long-term residents. CONCLUSIONS Prediabetes Risk Score in Qatar is the first prediabetes screening score developed in a Middle Eastern population. It only uses risk factors measured non-invasively, is simple, cost-effective, and can be easily understood by the general public and health providers. Prediabetes Risk Score in Qatar is an important tool for early detection of prediabetes, and can help tremendously in curbing the diabetes epidemic in the region.
Collapse
Affiliation(s)
- Mostafa Abbas
- Qatar Computing Research InstituteHamad Bin Khalifa UniversityDohaQatar
- Department of Imaging Science and InnovationGeisingerDanvillePennsylvaniaUSA
| | - Raghvendra Mall
- Qatar Computing Research InstituteHamad Bin Khalifa UniversityDohaQatar
| | - Khaoula Errafii
- Qatar Biomedical Research InstituteHamad Bin Khalifa UniversityDohaQatar
- College of Health and Life SciencesHamad Bin Khalifa UniversityDohaQatar
| | - Abdelkader Lattab
- Qatar Computing Research InstituteHamad Bin Khalifa UniversityDohaQatar
| | - Ehsan Ullah
- Qatar Computing Research InstituteHamad Bin Khalifa UniversityDohaQatar
| | - Halima Bensmail
- Qatar Computing Research InstituteHamad Bin Khalifa UniversityDohaQatar
| | - Abdelilah Arredouani
- Qatar Biomedical Research InstituteHamad Bin Khalifa UniversityDohaQatar
- College of Health and Life SciencesHamad Bin Khalifa UniversityDohaQatar
| |
Collapse
|
6
|
Wu J, Zhou J, Yin X, Chen Y, Lin X, Xu Z, Li H. A Prediction Model for Prediabetes Risk in Middle-Aged and Elderly Populations: A Prospective Cohort Study in China. Int J Endocrinol 2021; 2021:2520806. [PMID: 34804156 PMCID: PMC8601847 DOI: 10.1155/2021/2520806] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Accepted: 09/29/2021] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND To investigate indicators for prediabetes risk and construct a prediction model for prediabetes incidences in China. METHODS In this study, 551 adults aged 40-70 years had normal glucose tolerance (NGT) and normal hemoglobin A1c (HbA1c) levels at baseline. Baseline data including demographic information, anthropometric measurements, and metabolic profile measurements were collected. The associations between possible indicators and prediabetes were assessed by the Cox proportional-hazards model. The predictive values were evaluated by the area under the receiver operating characteristic (ROC) curve (AUC). RESULTS During an average of 3.35 years of follow-up, the incidence of prediabetes was found to be 19.96% (n = 110). In the univariate analyses, fasting plasma glucose (FPG), fasting serum insulin (FINS), 2 h plasma glucose (2hPG), HbA1c, serum uric acid (SUA), waist circumference (WC), smoking, and family history of diabetes (FHD) were found to be significantly correlated with prediabetes. In the multivariable analyses, WC (hazard ratio (HR): 1.032; 95% confidence interval (CI): 1.010, 1.053; p = 0.003), FHD (HR: 1.824; 95% CI: 1.250, 2.661; p = 0.002), HbA1c (HR: 1.825; 95% CI: 1.227, 2.714; p = 0.003), and FPG (HR: 2.284; 95% CI: 1.556, 3.352; p < 0.001) were found to be independent risk factors for prediabetes. A model that encompassed WC, FHD, HbA1c, and FPG for predicting prediabetes exhibited the largest discriminative ability (AUC: 0.702). CONCLUSIONS WC, FHD, HbA1c, and FPG are independently correlated with the risk of prediabetes. Furthermore, the combination of these predictors enhances the predictive accuracy of prediabetes.
Collapse
Affiliation(s)
- Jiahua Wu
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| | - Jiaqiang Zhou
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| | - Xueyao Yin
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| | - Yixin Chen
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| | - Xihua Lin
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| | - Zhiye Xu
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| | - Hong Li
- Department of Endocrinology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, 3 East Qingchun Road, Hangzhou 310016, China
| |
Collapse
|
7
|
Secondhand Smoke Correlates with Elevated Neutrophil-Lymphocyte Ratio and Has a Synergistic Effect with Physical Inactivity on Increasing Susceptibility to Type 2 Diabetes Mellitus: A Community-Based Case Control Study. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17165696. [PMID: 32781787 PMCID: PMC7459643 DOI: 10.3390/ijerph17165696] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 08/01/2020] [Accepted: 08/03/2020] [Indexed: 01/12/2023]
Abstract
Secondhand smoke (SHS) and physical inactivity are thought to be associated with type 2 diabetes mellitus (T2DM), but the synergistic effect of SHS with physical inactivity and their relationships with T2DM–associated inflammation biomarkers have not been estimated. We investigated the roles of SHS exposure and physical inactivity and their synergistic effect on T2DM risk and their relationships with T2DM associated inflammation biomarkers, neutrophil–lymphocyte ratio (NLR) and white blood cells (WBCs). A case–control study was conducted in total 588 participants (294 case T2DM and 294 healthy controls) from five community clinics in Indonesia. Participants completed a standardized questionnaire on demographic information, smoking status, physical activity habits and food consumption. WBCs and NLR levels were determined using an automated hematology analyzer. Adjusted odds ratios (AORs) and 95% confidence intervals (CIs) were analyzed using multiple logistic regression model. The synergistic effect was analyzed using additive interaction for logistic regression. Physical inactive people exposed to SHS exhibited a synergistically increased 7.78-fold risk of T2DM compared with people who were not exposed to SHS and who were physically active. SHS is significantly correlated with a high NLR, WBCs and has a synergistic effect with physical inactivity on increasing susceptibility to T2DM.
Collapse
|
8
|
Maeta K, Nishiyama Y, Fujibayashi K, Gunji T, Sasabe N, Iijima K, Naito T. Prediction of Glucose Metabolism Disorder Risk Using a Machine Learning Algorithm: Pilot Study. JMIR Diabetes 2018; 3:e10212. [PMID: 30478026 PMCID: PMC6288596 DOI: 10.2196/10212] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 08/16/2018] [Accepted: 10/17/2018] [Indexed: 01/10/2023] Open
Abstract
Background A 75-g oral glucose tolerance test (OGTT) provides important information about glucose metabolism, although the test is expensive and invasive. Complete OGTT information, such as 1-hour and 2-hour postloading plasma glucose and immunoreactive insulin levels, may be useful for predicting the future risk of diabetes or glucose metabolism disorders (GMD), which includes both diabetes and prediabetes. Objective We trained several classification models for predicting the risk of developing diabetes or GMD using data from thousands of OGTTs and a machine learning technique (XGBoost). The receiver operating characteristic (ROC) curves and their area under the curve (AUC) values for the trained classification models are reported, along with the sensitivity and specificity determined by the cutoff values of the Youden index. We compared the performance of the machine learning techniques with logistic regressions (LR), which are traditionally used in medical research studies. Methods Data were collected from subjects who underwent multiple OGTTs during comprehensive check-up medical examinations conducted at a single facility in Tokyo, Japan, from May 2006 to April 2017. For each examination, a subject was diagnosed with diabetes or prediabetes according to the American Diabetes Association guidelines. Given the data, 2 studies were conducted: predicting the risk of developing diabetes (study 1) or GMD (study 2). For each study, to apply supervised machine learning methods, the required label data was prepared. If a subject was diagnosed with diabetes or GMD at least once during the period, then that subject’s data obtained in previous trials were classified into the risk group (y=1). After data processing, 13,581 and 6760 OGTTs were analyzed for study 1 and study 2, respectively. For each study, a randomly chosen subset representing 80% of the data was used for training 9 classification models and the remaining 20% was used for evaluating the models. Three classification models, A to C, used XGBoost with various input variables, some including OGTT data. The other 6 classification models, D to I, used LR for comparison. Results For study 1, the AUC values ranged from 0.78 to 0.93. For study 2, the AUC values ranged from 0.63 to 0.78. The machine learning approach using XGBoost showed better performance compared with traditional LR methods. The AUC values increased when the full OGTT variables were included. In our analysis using a particular setting of input variables, XGBoost showed that the OGTT variables were more important than fasting plasma glucose or glycated hemoglobin. Conclusions A machine learning approach, XGBoost, showed better prediction accuracy compared with LR, suggesting that advanced machine learning methods are useful for detecting the early signs of diabetes or GMD. The prediction accuracy increased when all OGTT variables were added. This indicates that complete OGTT information is important for predicting the future risk of diabetes and GMD accurately.
Collapse
Affiliation(s)
- Katsutoshi Maeta
- Faculty of Informatics and Engineering, The University of Electro-Communications, Tokyo, Japan
| | - Yu Nishiyama
- Faculty of Informatics and Engineering, The University of Electro-Communications, Tokyo, Japan
| | - Kazutoshi Fujibayashi
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Toshiaki Gunji
- Center for Preventive Medicine, NTT Medical Center Tokyo, Tokyo, Japan
| | - Noriko Sasabe
- Center for Preventive Medicine, NTT Medical Center Tokyo, Tokyo, Japan
| | - Kimiko Iijima
- Center for Preventive Medicine, NTT Medical Center Tokyo, Tokyo, Japan
| | - Toshio Naito
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| |
Collapse
|