Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hu J, Yang H, Lyu MR, King I, Man-Cho So A. Online Nonlinear AUC Maximization for Imbalanced Data Sets. IEEE Trans Neural Netw Learn Syst 2018;29:882-895. [PMID: 28141529 DOI: 10.1109/tnnls.2016.2610465] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

For:	Hu J, Yang H, Lyu MR, King I, Man-Cho So A. Online Nonlinear AUC Maximization for Imbalanced Data Sets. IEEE Trans Neural Netw Learn Syst 2018;29:882-895. [PMID: 28141529 DOI: 10.1109/tnnls.2016.2610465] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Number

Cited by Other Article(s)

Jimenez-Cruz R, Yáñez-Márquez C, Gonzalez-Mendoza M, Villuendas-Rey Y, Monroy R. Spherical model for Minimalist Machine Learning paradigm in handling complex databases. Front Artif Intell 2025;8:1521063. [PMID: 40028230 PMCID: PMC11868079 DOI: 10.3389/frai.2025.1521063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Accepted: 01/27/2025] [Indexed: 03/05/2025] Open

Gu B, Bao R, Zhang C, Huang H. New Scalable and Efficient Online Pairwise Learning Algorithm. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:17099-17110. [PMID: 37656641 DOI: 10.1109/tnnls.2023.3299756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/03/2023]

Mao Y, Hao Y, Liu W, Lin X, Cao X. Class-Imbalanced-Aware Distantly Supervised Named Entity Recognition. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:12117-12129. [PMID: 37099461 DOI: 10.1109/tnnls.2023.3252084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Bhat S, Mansoor A, Georgescu B, Panambur AB, Ghesu FC, Islam S, Packhäuser K, Rodríguez-Salas D, Grbic S, Maier A. AUCReshaping: improved sensitivity at high-specificity. Sci Rep 2023;13:21097. [PMID: 38036602 PMCID: PMC10689839 DOI: 10.1038/s41598-023-48482-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 11/27/2023] [Indexed: 12/02/2023] Open

Abstract

The evaluation of deep-learning (DL) systems typically relies on the Area under the Receiver-Operating-Curve (AU-ROC) as a performance metric. However, AU-ROC, in its holistic form, does not sufficiently consider performance within specific ranges of sensitivity and specificity, which are critical for the intended operational context of the system. Consequently, two systems with identical AU-ROC values can exhibit significantly divergent real-world performance. This issue is particularly pronounced in the context of anomaly detection tasks, a commonly employed application of DL systems across various research domains, including medical imaging, industrial automation, manufacturing, cyber security, fraud detection, and drug research, among others. The challenge arises from the heavy class imbalance in training datasets, with the abnormality class often incurring a considerably higher misclassification cost compared to the normal class. Traditional DL systems address this by adjusting the weighting of the cost function or optimizing for specific points along the ROC curve. While these approaches yield reasonable results in many cases, they do not actively seek to maximize performance for the desired operating point. In this study, we introduce a novel technique known as AUCReshaping, designed to reshape the ROC curve exclusively within the specified sensitivity and specificity range, by optimizing sensitivity at a predetermined specificity level. This reshaping is achieved through an adaptive and iterative boosting mechanism that allows the network to focus on pertinent samples during the learning process. We primarily investigated the impact of AUCReshaping in the context of abnormality detection tasks, specifically in Chest X-Ray (CXR) analysis, followed by breast mammogram and credit card fraud detection tasks. The results reveal a substantial improvement, ranging from 2 to 40%, in sensitivity at high-specificity levels for binary classification tasks.

Collapse

Sun Y, Vong CM, Wang S. Fast AUC Maximization Learning Machine With Simultaneous Outlier Detection. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:6843-6857. [PMID: 35476558 DOI: 10.1109/tcyb.2022.3164900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Luo J, Qiao H, Zhang B. A Minimax Probability Machine for Nondecomposable Performance Measures. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023;34:2353-2365. [PMID: 34473631 DOI: 10.1109/tnnls.2021.3106484] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Li Y, Hsu W. A classification for complex imbalanced data in disease screening and early diagnosis. Stat Med 2022;41:3679-3695. [PMID: 35603639 PMCID: PMC9541048 DOI: 10.1002/sim.9442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 04/11/2022] [Accepted: 05/10/2022] [Indexed: 11/09/2022]

Li H, Guo W, Lu G, Shi Y. Augmentation Method for High Intra-Class Variation Data in Apple Detection. SENSORS (BASEL, SWITZERLAND) 2022;22:6325. [PMID: 36080783 PMCID: PMC9460715 DOI: 10.3390/s22176325] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 08/17/2022] [Accepted: 08/18/2022] [Indexed: 06/15/2023]

Choi HS, Jung D, Kim S, Yoon S. Imbalanced Data Classification via Cooperative Interaction Between Classifier and Generator. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:3343-3356. [PMID: 33531305 DOI: 10.1109/tnnls.2021.3052243] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Dang Z, Li X, Gu B, Deng C, Huang H. Large-Scale Nonlinear AUC Maximization via Triply Stochastic Gradients. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022;44:1385-1398. [PMID: 32946382 DOI: 10.1109/tpami.2020.3024987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

AI Models for Predicting Readmission of Pneumonia Patients within 30 Days after Discharge. ELECTRONICS 2022. [DOI: 10.3390/electronics11050673] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

A dual encoder DAE neural network for imbalanced binary classification based on NSGA-III and GAN. Pattern Anal Appl 2021. [DOI: 10.1007/s10044-021-01035-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Gultekin S, Saha A, Ratnaparkhi A, Paisley J. MBA: Mini-Batch AUC Optimization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:5561-5574. [PMID: 32142457 DOI: 10.1109/tnnls.2020.2969527] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Incorporating Particle Swarm Optimization into Improved Bacterial Foraging Optimization Algorithm Applied to Classify Imbalanced Data. Symmetry (Basel) 2020. [DOI: 10.3390/sym12020229] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Chen YF, Lin CS, Hong CF, Lee DJ, Sun C, Lin HH. Design of a Clinical Decision Support System for Predicting Erectile Dysfunction in Men Using NHIRD Dataset. IEEE J Biomed Health Inform 2018;23:2127-2137. [PMID: 30369456 DOI: 10.1109/jbhi.2018.2877595] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Abstract

Erectile dysfunction (ED) affects millions of men worldwide. Men with ED generally complain failure to attain or maintain an adequate erection during sexual activity. The prevalence of ED is strongly correlated with age, affecting about 40% of men at age 40 and nearly 70% at age 70. A variety of chronic diseases, including diabetes, ischemic heart disease, congestive heart failure, hypertension, depression, chronic renal failure, obstructive sleep apnea, prostate disease, gout, and sleep disorder, were reported to be associated with ED. In this study, data retrieved from a subset of the National Health Insurance Research Database of Taiwan were used for designing the clinical decision support system (CDSS) for predicting ED incidences in men. The positive cases were male patients aged 20-65 who were diagnosed with ED between January 2000 and December 2010 confirmed by at least three outpatient visits or at least one inpatient visit, while the negative cases were randomly selected from the database without a history of ED and were frequency (1:1), age, and index year matched with the ED patients. Data of a total of 2832 ED patients and 2832 non-ED patients, each consisting of 41 features including index age, 10 comorbidities, and 30 other comorbidity-related variables, were retrieved for designing the predictive models. Integrated genetic algorithm and support vector machine was adopted to design the CDSSs with two experiments of independent training and testing (ITT) conducted to verify their effectiveness. In the 1st ITT experiment, data extracted from January 2000 till December 2005 (61.51%, 1742 positive cases and 1742 negative cases) were used for training and validating and the data retrieved from January 2006 till December 2010 were used for testing (38.49%), whereas in the 2nd ITT experiment, data in the training set (77.78%) were extracted from January 2000 till Deceber 2007 and those in the testing set (22.22%) were retrieved afterward. Tenfold cross validation and three different objective functions were adopted for obtaining the optimal models with best predictive performance in the training phase. The testing results show that the CDSSs achieved a predictive performance with accuracy, sensitivity, specificity, g-mean, and area under ROC curve of 74.72%-76.65%, 72.33%-83.76%, 69.54%-77.10%, 0.7468-0.7632, and 0.766-0.817, respectively. In conclusion, the CDSSs designed based on cost-sensitive objective functions as well as salient comorbidity-related features achieve satisfactory predictive performance for predicting ED incidences.

Collapse

Zhang Z, Hu Z, Yang H, Zhu R, Zuo D. Factorization machines and deep views-based co-training for improving answer quality prediction in online health expert question-answering services. J Biomed Inform 2018;87:21-36. [PMID: 30240803 DOI: 10.1016/j.jbi.2018.09.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Revised: 08/27/2018] [Accepted: 09/17/2018] [Indexed: 11/26/2022]

Abstract

In online health expert question-answering (HQA) services, it is significant to automatically determine the quality of the answers. There are two prominent challenges in this task. First, the answers are usually written in short text, which makes it difficult to absorb the text semantic information. Second, it usually lacks sufficient labeled data but contains a huge amount of unlabeled data. To tackle these challenges, we propose a novel deep co-training framework based on factorization machines (FM) and deep textual views to intelligently and automatically identify the quality of HQA systems. More specifically, we exploit additional domain-specific semantic information from domain-specific word embeddings to expand the semantic space of short text and apply FM to excavate the non-independent interaction relationships among diverse features within individual views for improving the performance of the base classifier via co-training. Our learned deep textual views, the convolutional neural networks (CNN) view which focuses on extracting local features using convolution filters to locally model short text and the dependency-sensitive convolutional neural networks (DSCNN) view which focuses on capturing long-distance dependency information within the text to globally model short text, can then overcome the challenge of feature sparseness in the short text answers from the doctors. The developed co-training framework can effectively mine the highly non-linear semantic information embedded in the unlabeled data and expose the highly non-linear relationships between different views, which minimizes the labeling effort. Finally, we conduct extensive empirical evaluations and demonstrate that our proposed method can significantly improve the predictive performance of the answer quality in the context of HQA services.

Collapse

Design of a Clinical Decision Support System for Fracture Prediction Using Imbalanced Dataset. JOURNAL OF HEALTHCARE ENGINEERING 2018;2018:9621640. [PMID: 29765586 PMCID: PMC5885339 DOI: 10.1155/2018/9621640] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Revised: 01/11/2018] [Accepted: 01/23/2018] [Indexed: 11/18/2022]

Abstract

More than 1 billion people suffer from chronic respiratory diseases worldwide, accounting for more than 4 million deaths annually. Inhaled corticosteroid is a popular medication for treating chronic respiratory diseases. Its side effects include decreased bone mineral density and osteoporosis. The aims of this study are to investigate the association of inhaled corticosteroids and fracture and to design a clinical support system for fracture prediction. The data of patients aged 20 years and older, who had visited healthcare centers and been prescribed with inhaled corticosteroids within 2002-2010, were retrieved from the National Health Insurance Research Database (NHIRD). After excluding patients diagnosed with hip fracture or vertebrate fractures before using inhaled corticosteroid, a total of 11645 patients receiving inhaled corticosteroid therapy were included for this study. Among them, 1134 (9.7%) were diagnosed with hip fracture or vertebrate fracture. The statistical results showed that demographic information, chronic respiratory diseases and comorbidities, and corticosteroid-related variables (cumulative dose, mean exposed daily dose, follow-up duration, and exposed duration) were significantly different between fracture and nonfracture patients. The clinical decision support systems (CDSSs) were designed with integrated genetic algorithm (GA) and support vector machine (SVM) by training and validating the models with balanced training sets obtained by random and cluster-based undersampling methods and testing with the imbalanced NHIRD dataset. Two different objective functions were adopted for obtaining optimal models with best predictive performance. The predictive performance of the CDSSs exhibits a sensitivity of 69.84-77.00% and an AUC of 0.7495-0.7590. It was concluded that long-term use of inhaled corticosteroids may induce osteoporosis and exhibit higher incidence of hip or vertebrate fractures. The accumulated dose of ICS and OCS therapies should be continuously monitored, especially for patients with older age and women after menopause, to prevent from exceeding the maximum dosage.

Collapse

Implicit Heterogeneous Features Embedding in Deep Knowledge Tracing. Cognit Comput 2017. [DOI: 10.1007/s12559-017-9522-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]