Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Chen JJ, Tsai CA, Moon H, Ahn H, Young JJ, Chen CH. Decision threshold adjustment in class prediction. SAR QSAR Environ Res 2006;17:337-52. [PMID: 16815772 DOI: 10.1080/10659360600787700] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]

Number

Cited by Other Article(s)

Monthatip K, Boonnag C, Muangmool T, Charoenkwan K. A machine learning-based prediction model of pelvic lymph node metastasis in women with early-stage cervical cancer. J Gynecol Oncol 2024;35:e17. [PMID: 37921601 PMCID: PMC10948976 DOI: 10.3802/jgo.2024.35.e17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 09/03/2023] [Accepted: 10/03/2023] [Indexed: 11/04/2023] Open

Abstract

OBJECTIVE

To develop a novel machine learning-based preoperative prediction model for pelvic lymph node metastasis (PLNM) in early-stage cervical cancer by combining the clinical findings and preoperative computerized tomography (CT) of the whole abdomen and pelvis.

METHODS

Patients diagnosed with International Federation of Gynecology and Obstetrics stage IA2-IIA1 squamous cell carcinoma, adenocarcinoma, and adenosquamous carcinoma of the cervix who had primary radical surgery with bilateral pelvic lymphadenectomy from January 1, 2003 to December 31, 2020, were included. Seven supervised machine learning algorithms, including logistic regression, random forest, support vector machine, adaptive boosting, gradient boosting, extreme gradient boosting, and category boosting, were used to evaluate the risk of PLNM.

RESULTS

PLNM was found in 199 (23.9%) of 832 patients included. Younger age, larger tumor size, higher stage, no prior conization, tumor appearance, adenosquamous histology, and vaginal metastasis as well as the CT findings of larger tumor size, parametrial metastasis, pelvic lymph node enlargement, and vaginal metastasis, were significantly associated with PLNM. The models' predictive performance, including accuracy (89.1%-90.6%), area under the receiver operating characteristics curve (86.9%-91.0%), sensitivity (77.4%-82.4%), specificity (92.1%-94.3%), positive predictive value (77.0%-81.7%), and negative predictive value (93.0%-94.4%), appeared satisfactory and comparable among all the algorithms. After optimizing the model's decision threshold to enhance the sensitivity to at least 95%, the 'highly sensitive' model was obtained with a 2.5%-4.4% false-negative rate of PLNM prediction.

CONCLUSION

We developed prediction models for PLNM in early-stage cervical cancer with promising prediction performance in our setting. Further external validation in other populations is needed with potential clinical applications.

Collapse

Teza H, Pattanateepapon A, Lertpimonchai A, Vathesatogkit P, J McKay G, Attia J, Thakkinstian A. Development of Risk Prediction Models for Severe Periodontitis in a Thai Population: Statistical and Machine Learning Approaches. JMIR Form Res 2023;7:e48351. [PMID: 38096008 PMCID: PMC10755655 DOI: 10.2196/48351] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 10/31/2023] [Accepted: 11/01/2023] [Indexed: 12/31/2023] Open

Abstract

BACKGROUND

Severe periodontitis affects 26% of Thai adults and 11.2% of adults globally and is characterized by the loss of alveolar bone height. Full-mouth examination by periodontal probing is the gold standard for diagnosis but is time- and resource-intensive. A screening model to identify those at high risk of severe periodontitis would offer a targeted approach and aid in reducing the workload for dentists. While statistical modelling by a logistic regression is commonly applied, optimal performance depends on feature selections and engineering. Machine learning has been recently gaining favor given its potential discriminatory power and ability to deal with multiway interactions without the requirements of linear assumptions.

OBJECTIVE

We aim to compare the performance of screening models developed using statistical and machine learning approaches for the risk prediction of severe periodontitis.

METHODS

This study used data from the prospective Electricity Generating Authority of Thailand cohort. Dental examinations were performed for the 2008 and 2013 surveys. Oral examinations (ie, number of teeth and oral hygiene index and plaque scores), periodontal pocket depth, and gingival recession were performed by dentists. The outcome of interest was severe periodontitis diagnosed by the Centre for Disease Control-American Academy of Periodontology, defined as 2 or more interproximal sites with a clinical attachment level ≥6 mm (on different teeth) and 1 or more interproximal sites with a periodontal pocket depth ≥5 mm. Risk prediction models were developed using mixed-effects logistic regression (MELR), recurrent neural network, mixed-effects support vector machine, and mixed-effects decision tree models. A total of 21 features were considered as predictive features, including 4 demographic characteristics, 2 physical examinations, 4 underlying diseases, 1 medication, 2 risk behaviors, 2 oral features, and 6 laboratory features.

RESULTS

A total of 3883 observations from 2086 participants were split into development (n=3112, 80.1%) and validation (n=771, 19.9%) sets with prevalences of periodontitis of 34.4% (n=1070) and 34.1% (n=263), respectively. The final MELR model contained 6 features (gender, education, smoking, diabetes mellitus, number of teeth, and plaque score) with an area under the curve (AUC) of 0.983 (95% CI 0.977-0.989) and positive likelihood ratio (LR+) of 11.9 (95% CI 8.8-16.3). Machine learning yielded lower performance than the MELR model, with AUC (95% CI) and LR+ (95% CI) values of 0.712 (0.669-0.754) and 2.1 (1.8-2.6), respectively, for the recurrent neural network model; 0.698 (0.681-0.734) and 2.1 (1.7-2.6), respectively, for the mixed-effects support vector machine model; and 0.662 (0.621-0.702) and 2.4 (1.9-3.0), respectively, for the mixed-effects decision tree model.

CONCLUSIONS

The MELR model might be more useful than machine learning for large-scale screening to identify those at high risk of severe periodontitis for periodontal evaluation. External validation using data from other centers is required to evaluate the generalizability of the model.

Collapse

Roschewitz M, Khara G, Yearsley J, Sharma N, James JJ, Ambrózay É, Heroux A, Kecskemethy P, Rijken T, Glocker B. Automatic correction of performance drift under acquisition shift in medical image classification. Nat Commun 2023;14:6608. [PMID: 37857643 PMCID: PMC10587231 DOI: 10.1038/s41467-023-42396-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 10/10/2023] [Indexed: 10/21/2023] Open

Gantenbein J, Ahmadizadeh C, Heeb O, Lambercy O, Menon C. Feasibility of force myography for the direct control of an assistive robotic hand orthosis in non-impaired individuals. J Neuroeng Rehabil 2023;20:101. [PMID: 37537602 PMCID: PMC10399035 DOI: 10.1186/s12984-023-01222-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 07/21/2023] [Indexed: 08/05/2023] Open

Abstract

BACKGROUND

Assistive robotic hand orthoses can support people with sensorimotor hand impairment in many activities of daily living and therefore help to regain independence. However, in order for the users to fully benefit from the functionalities of such devices, a safe and reliable way to detect their movement intention for device control is crucial. Gesture recognition based on force myography measuring volumetric changes in the muscles during contraction has been previously shown to be a viable and easy to implement strategy to control hand prostheses. Whether this approach could be efficiently applied to intuitively control an assistive robotic hand orthosis remains to be investigated.

METHODS

In this work, we assessed the feasibility of using force myography measured from the forearm to control a robotic hand orthosis worn on the hand ipsilateral to the measurement site. In ten neurologically-intact participants wearing a robotic hand orthosis, we collected data for four gestures trained in nine arm configurations, i.e., seven static positions and two dynamic movements, corresponding to typical activities of daily living conditions. In an offline analysis, we determined classification accuracies for two binary classifiers (one for opening and one for closing) and further assessed the impact of individual training arm configurations on the overall performance.

RESULTS

We achieved an overall classification accuracy of 92.9% (averaged over two binary classifiers, individual accuracies 95.5% and 90.3%, respectively) but found a large variation in performance between participants, ranging from 75.4 up to 100%. Averaged inference times per sample were measured below 0.15 ms. Further, we found that the number of training arm configurations could be reduced from nine to six without notably decreasing classification performance.

CONCLUSION

The results of this work support the general feasibility of using force myography as an intuitive intention detection strategy for a robotic hand orthosis. Further, the findings also generated valuable insights into challenges and potential ways to overcome them in view of applying such technologies for assisting people with sensorimotor hand impairment during activities of daily living.

Collapse

Stolbov LA, Filimonov DA, Poroikov VV. SAR based on self consistent classifier. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022;33:793-804. [PMID: 36369710 DOI: 10.1080/1062936x.2022.2139751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]

Threshold prediction for detecting rare positive samples using a meta-learner. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01103-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Vijithananda SM, Jayatilake ML, Hewavithana B, Gonçalves T, Rato LM, Weerakoon BS, Kalupahana TD, Silva AD, Dissanayake KD. Feature extraction from MRI ADC images for brain tumor classification using machine learning techniques. Biomed Eng Online 2022;21:52. [PMID: 35915448 PMCID: PMC9344709 DOI: 10.1186/s12938-022-01022-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 07/13/2022] [Indexed: 11/10/2022] Open

Abstract

Background

Diffusion-weighted (DW) imaging is a well-recognized magnetic resonance imaging (MRI) technique that is being routinely used in brain examinations in modern clinical radiology practices. This study focuses on extracting demographic and texture features from MRI Apparent Diffusion Coefficient (ADC) images of human brain tumors, identifying the distribution patterns of each feature and applying Machine Learning (ML) techniques to differentiate malignant from benign brain tumors.

Methods

This prospective study was carried out using 1599 labeled MRI brain ADC image slices, 995 malignant, 604 benign from 195 patients who were radiologically diagnosed and histopathologically confirmed as brain tumor patients. The demographics, mean pixel values, skewness, kurtosis, features of Grey Level Co-occurrence Matrix (GLCM), mean, variance, energy, entropy, contrast, homogeneity, correlation, prominence and shade, were extracted from MRI ADC images of each patient. At the feature selection phase, the validity of the extracted features were measured using ANOVA f-test. Then, these features were used as input to several Machine Learning classification algorithms and the respective models were assessed.

Results

According to the results of ANOVA f-test feature selection process, two attributes: skewness (3.34) and GLCM homogeneity (3.45) scored the lowest ANOVA f-test scores. Therefore, both features were excluded in continuation of the experiment. From the different tested ML algorithms, the Random Forest classifier was chosen to build the final ML model, since it presented the highest accuracy. The final model was able to predict malignant and benign neoplasms with an 90.41% accuracy after the hyper parameter tuning process.

Conclusions

This study concludes that the above mentioned features (except skewness and GLCM homogeneity) are informative to identify and differentiate malignant from benign brain tumors. Moreover, they enable the development of a high-performance ML model that has the ability to assist in the decision-making steps of brain tumor diagnosis process, prior to attempting invasive diagnostic procedures, such as brain biopsies.

Collapse

Huynh T, Nibali A, He Z. Semi-supervised learning for medical image classification using imbalanced training data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022;216:106628. [PMID: 35101700 DOI: 10.1016/j.cmpb.2022.106628] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 12/20/2021] [Accepted: 01/07/2022] [Indexed: 06/14/2023]

Abstract

BACKGROUND AND OBJECTIVE

Medical image classification is often challenging for two reasons: a lack of labelled examples due to expensive and time-consuming annotation protocols, and imbalanced class labels due to the relative scarcity of disease-positive individuals in the wider population. Semi-supervised learning methods exist for dealing with a lack of labels, but they generally do not address the problem of class imbalance. Hence, the purpose of this study is to explore a new approach to perturbation-based semi-supervised learning which tackles the problem of applying semi-supervised learning to medical image classification with imbalanced training data.

METHODS

In this study we propose Adaptive Blended Consistency Loss (ABCL), a simple yet effective drop-in replacement for consistency loss in perturbation-based semi-supervised learning methods. ABCL counteracts data skew by adaptively mixing the target class distribution of the consistency loss in accordance with class frequency. Our proposed method is evaluated and compared with existing methods on two different imbalanced medical image classification datasets. An ablation study is also provided to analyse the properties and effectiveness of our proposed method.

RESULTS

Our experiments with ABCL reveal improvements to unweighted average recall (UAR) when compared with existing consistency losses that are not designed to counteract class imbalance and other existing methods. Our proposed ABCL method is able to improve the performance of the baseline consistency loss approach from 0.59 to 0.67 UAR and outperforms methods that address the class imbalance problem for labelled data (between 0.51 and 0.59 UAR) and for unlabelled data (0.61 UAR) on the imbalanced skin cancer dataset. On the imbalanced retinal fundus glaucoma dataset, ABCL (combined with Weighted Cross Entropy loss) achieves 0.67 UAR, which is an improvement over the best existing approach (0.57 UAR).

CONCLUSIONS

Overall the results show the effectiveness of ABCL to alleviate the class imbalance problem for semi-supervised classification for medical images.

Collapse

Zimmerman J, Soler RE, Lavinder J, Murphy S, Atkins C, Hulbert L, Lusk R, Ng BP. Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study. Syst Rev 2021;10:97. [PMID: 33810798 PMCID: PMC8017891 DOI: 10.1186/s13643-021-01640-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 03/19/2021] [Indexed: 11/10/2022] Open

Rácz A, Bajusz D, Héberger K. Effect of Dataset Size and Train/Test Split Ratios in QSAR/QSPR Multiclass Classification. Molecules 2021;26:1111. [PMID: 33669834 PMCID: PMC7922354 DOI: 10.3390/molecules26041111] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 02/04/2021] [Accepted: 02/16/2021] [Indexed: 01/04/2023] Open

Jing XY, Zhang X, Zhu X, Wu F, You X, Gao Y, Shan S, Yang JY. Multiset Feature Learning for Highly Imbalanced Data Classification. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021;43:139-156. [PMID: 31331881 DOI: 10.1109/tpami.2019.2929166] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Raj A, Dehingia N, Singh A, McDougal L, McAuley J. Application of machine learning to understand child marriage in India. SSM Popul Health 2020;12:100687. [PMID: 33335970 PMCID: PMC7732880 DOI: 10.1016/j.ssmph.2020.100687] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 10/29/2020] [Accepted: 10/30/2020] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Prior research documents that India has the greatest number of girls married as minors of any nation in the world, increasing social and health risks for both these young wives and their children. While the prevalence of child marriage has declined in the nation, more work is needed to accelerate this decline and the negative consequences of the practice. Expanded targets for intervention require greater identification of these targets. Machine learning can offer insight into identification of novel factors associated with child marriage that can serve as targets for intervention.

METHODS

We applied machine learning methods to retrospective cross-sectional survey data from India on demographics and health, the nationally-representative National Family Health Survey, conducted in 2015-16. We analyzed data using a traditional regression model, with child marriage as the dependent variable, and 4000+ variables from the survey as the independent variables. We also used three commonly used machine learning algorithms- Least Absolute Shrinkage and Selection Operator (lasso) or L-1 regularized logistic regression models; L2 regularized logistic regression or ridge models; and neural network models. Finally, we developed and applied a novel and rigorous approach involving expert qualitative review and coding of variables generated from an iterative series of regularized models to assess thematically key variable groupings associated with child marriage.

FINDINGS

Analyses revealed that regularized logistic and neural network applications demonstrated better accuracy and lower error rates than traditional logistic regression, with a greater number of features and variables generated. Regularized models highlight higher fertility and contraception, longer duration of marriage, geographic, and socioeconomic vulnerabilities as key correlates; findings shown in prior research. However, our novel method involving expert qualitative coding of variables generated from iterative regularized models and resultant thematic generation offered clarity on variables not focused upon in prior research, specifically non-utilization of health system benefits related to nutrition for mothers and infants.

INTERPRETATION

Machine learning appears to be a valid means of identifying key correlates of child marriage in India and, via our innovative iterative thematic approach, can be useful to identify novel variables associated with this outcome. Findings related to low nutritional service uptake also demonstrate the need for more focus on public health outreach for nutritional programs tailored to this population.

Collapse

Wang K, Zhou Z, Wang R, Chen L, Zhang Q, Sher D, Wang J. A multi‐objective radiomics model for the prediction of locoregional recurrence in head and neck squamous cell cancer. Med Phys 2020;47:5392-5400. [DOI: 10.1002/mp.14388] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 05/11/2020] [Accepted: 07/02/2020] [Indexed: 02/05/2023] Open

Almilaji O, Smith C, Surgenor S, Clegg A, Williams E, Thomas P, Snook J. Refinement and validation of the IDIOM score for predicting the risk of gastrointestinal cancer in iron deficiency anaemia. BMJ Open Gastroenterol 2020;7:e000403. [PMID: 32444424 PMCID: PMC7247388 DOI: 10.1136/bmjgast-2020-000403] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 03/30/2020] [Accepted: 04/08/2020] [Indexed: 01/27/2023] Open

Deep reinforcement learning for imbalanced classification. APPL INTELL 2020. [DOI: 10.1007/s10489-020-01637-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Féré M, Gobinet C, Liu LH, Beljebbar A, Untereiner V, Gheldof D, Chollat M, Klossa J, Chatelain B, Piot O. Implementation of a classification strategy of Raman data collected in different clinical conditions: application to the diagnosis of chronic lymphocytic leukemia. Anal Bioanal Chem 2019;412:949-962. [PMID: 31853604 DOI: 10.1007/s00216-019-02321-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 10/31/2019] [Accepted: 12/03/2019] [Indexed: 02/06/2023]

Dong Q, Gong S, Zhu X. Imbalanced Deep Learning by Minority Class Incremental Rectification. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2019;41:1367-1381. [PMID: 29993438 DOI: 10.1109/tpami.2018.2832629] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Improving Intrusion Detection Model Prediction by Threshold Adaptation. INFORMATION 2019. [DOI: 10.3390/info10050159] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Enhancing techniques for learning decision trees from imbalanced data. ADV DATA ANAL CLASSI 2019. [DOI: 10.1007/s11634-019-00354-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Barzegar R, Asghari Moghaddam A, Adamowski J, Nazemi AH. Delimitation of groundwater zones under contamination risk using a bagged ensemble of optimized DRASTIC frameworks. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2019;26:8325-8339. [PMID: 30706265 DOI: 10.1007/s11356-019-04252-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Accepted: 01/14/2019] [Indexed: 06/09/2023]

Abstract

Developing a reliable groundwater vulnerability and contamination risk map is very important for groundwater management and protection. This study aims to compare various modified DRASTIC vulnerability frameworks based on rate calibration using the Wilcoxon rank-sum test (WRST), frequency ratio (FR) and weight optimization using the correlation coefficient (CC), the analytic hierarchy process (AHP), and genetic algorithms (GA), as well as to introduce, for the first time, an aggregated approach based on a bagging ensemble to develop a combined modified DRASTIC model. This research was conducted in the Khoy plain, NW Iran. To develop a typical DRASTIC map, seven DRASTIC data layers were generated, weighted, and then overlaid in ArcGIS. The nitrate (NO₃) concentrations at 54 sites in the study area were used to validate the models by calculating the correlation coefficient (r) between the vulnerability/risk indices and NO₃ concentrations. The calculated r value for the typical DRASTIC was 0.12. A sensitivity analysis reveals that the impact of the vadose zone and conductivity parameters with mean variation indices of 22.2 and 7.5%, respectively, have the highest and lowest influence on aquifer vulnerability. The r values increased for all the optimized frameworks. The results show that the WRST and GA methods are the most effective methods for calibration and optimization of DRASTIC rates and weights, with the WRST-GA-DRASTIC model obtaining an r value of 0.64. A bagging ensemble model was employed to combine the advantages of each standalone model. The bagging ensemble model yields an r value of 0.67. The ensemble model has the potential to increase the r value further than both the standalone optimized frameworks and the typical DRASTIC approach. In terms of spatial distribution class area (%), the bagging ensemble-DRASTIC model demonstrates that the moderate and low contamination risk classes with 16.4 and 23.1% of the total area cover the lowest and highest parts of the plain.

Collapse

van Wyk F, Khojandi A, Kamaleswaran R. Improving Prediction Performance Using Hierarchical Analysis of Real-Time Data: A Sepsis Case Study. IEEE J Biomed Health Inform 2019;23:978-986. [PMID: 30676988 DOI: 10.1109/jbhi.2019.2894570] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Guermazi R, Chaabane I, Hammami M. AECID: Asymmetric entropy for classifying imbalanced data. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.07.076] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Zakeri V, Hodgson AJ. Classifying hard and soft bone tissues using drilling sounds. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2018;2017:2855-2858. [PMID: 29060493 DOI: 10.1109/embc.2017.8037452] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Baseer A, Weddell SJ, Jones RD. Prediction of microsleeps using pairwise joint entropy and mutual information between EEG channels. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2018;2017:4495-4498. [PMID: 29060896 DOI: 10.1109/embc.2017.8037855] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Mei N, Grossberg MD, Ng K, Navarro KT, Ellmore TM. Identifying sleep spindles with multichannel EEG and classification optimization. Comput Biol Med 2017;89:441-453. [PMID: 28886481 PMCID: PMC5650544 DOI: 10.1016/j.compbiomed.2017.08.030] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2017] [Revised: 08/28/2017] [Accepted: 08/29/2017] [Indexed: 11/18/2022]

Ahn H. Discussion. Int Stat Rev 2014. [DOI: 10.1111/insr.12061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Hagar JC, Eskelson BNI, Haggerty PK, Nelson SK, Vesely DG. Modeling marbled murrelet (Brachyramphus marmoratus) habitat using LiDAR-derived canopy data. WILDLIFE SOC B 2014. [DOI: 10.1002/wsb.407] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Lin WJ, Chen JJ. Class-imbalanced classifiers for high-dimensional data. Brief Bioinform 2012;14:13-26. [DOI: 10.1093/bib/bbs006] [Citation(s) in RCA: 178] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Lim N, Ahn H, Moon H, Chen JJ. Classification of High-Dimensional Data with Ensemble of Logistic Regression Models. J Biopharm Stat 2010;20:160-71. [DOI: 10.1080/10543400903280639] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Wang Y, Li Y, Ding J, Wang Y, Chang Y. Prediction of binding affinity for estrogen receptor alpha modulators using statistical learning approaches. Mol Divers 2008;12:93-102. [PMID: 18661245 DOI: 10.1007/s11030-008-9080-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2008] [Accepted: 05/23/2008] [Indexed: 02/06/2023]

Cox LA. What's wrong with risk matrices? RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2008;28:497-512. [PMID: 18419665 DOI: 10.1111/j.1539-6924.2008.01030.x] [Citation(s) in RCA: 156] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Abstract

Risk matrices-tables mapping "frequency" and "severity" ratings to corresponding risk priority levels-are popular in applications as diverse as terrorism risk analysis, highway construction project management, office building risk analysis, climate change risk management, and enterprise risk management (ERM). National and international standards (e.g., Military Standard 882C and AS/NZS 4360:1999) have stimulated adoption of risk matrices by many organizations and risk consultants. However, little research rigorously validates their performance in actually improving risk management decisions. This article examines some mathematical properties of risk matrices and shows that they have the following limitations. (a) Poor Resolution. Typical risk matrices can correctly and unambiguously compare only a small fraction (e.g., less than 10%) of randomly selected pairs of hazards. They can assign identical ratings to quantitatively very different risks ("range compression"). (b) Errors. Risk matrices can mistakenly assign higher qualitative ratings to quantitatively smaller risks. For risks with negatively correlated frequencies and severities, they can be "worse than useless," leading to worse-than-random decisions. (c) Suboptimal Resource Allocation. Effective allocation of resources to risk-reducing countermeasures cannot be based on the categories provided by risk matrices. (d) Ambiguous Inputs and Outputs. Categorizations of severity cannot be made objectively for uncertain consequences. Inputs to risk matrices (e.g., frequency and severity categorizations) and resulting outputs (i.e., risk ratings) require subjective interpretation, and different users may obtain opposite ratings of the same quantitative risks. These limitations suggest that risk matrices should be used with caution, and only with careful explanations of embedded judgments.

Collapse

Liu H, Papa E, Walker JD, Gramatica P. In silico screening of estrogen-like chemicals based on different nonlinear classification models. J Mol Graph Model 2007;26:135-44. [PMID: 17293141 DOI: 10.1016/j.jmgm.2007.01.003] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2006] [Revised: 01/10/2007] [Accepted: 01/12/2007] [Indexed: 01/28/2023]