1
|
Xu X, Wang Z, Ren S, Niu S, Li D. Local-Global Geometric Information and View Complementarity Introduced Multiview Metric Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:5428-5441. [PMID: 38546990 DOI: 10.1109/tnnls.2024.3380020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
Geometry studies the spatial structure and location information of objects, providing a priori knowledge and intuitive explanation for classification methods. Considering samples from a geometric perspective offers a novel approach to understanding their information. In this article, we propose a method called local-global geometric information and view complementarity introduced multiview metric learning (GIVCMML). Our method effectively exploits the geometric information of multiview samples. The learned metric space retains the geometric relations of samples and makes them more separable. First, we propose the global geometrical constraint in the maximum margin criterion framework. By maximizing the distance between class centers in the metric space, we ensure that samples from different classes are well separated. Second, to maintain the manifold structure of the original space, we build an adjacency matrix that contains the sample label information. This helps explore the local geometric information of sample pairs. Finally, to better mine the complementary information of multiview samples, GIVCMML maximizes the correlation between each view in the metric space. This enables each view to adaptively learn from the others and explore the complementary information between views. We extensively evaluate the effectiveness of our method on real-world datasets. The experimental results demonstrate that GIVCMML achieves competitive performance compared with multiview metric learning (MvML) methods.
Collapse
|
2
|
Gu T, Wang Z, Xu X, Li D, Yang H, Du W. Frame-Level Teacher-Student Learning With Data Privacy for EEG Emotion Recognition. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:11021-11028. [PMID: 35486553 DOI: 10.1109/tnnls.2022.3168935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Recently, electroencephalogram (EEG) emotion recognition has gradually attracted a lot of attention. This brief designs a novel frame-level teacher-student framework with data privacy (FLTSDP) for EEG emotion recognition. The framework first proposes a teacher-student network without prior professional information for automated filtering of useful frame-level features by a gated mechanism and extracting high-level features by using knowledge distillation to capture the results of EEG emotion recognition from a teacher network and student networks. Then, the results from subnetworks are integrated by using the novel decision module, which, motivated by the voting mechanism, adjusts the composition of feature vectors and improves the weight of accurate prediction to optimize the integration effect. During training, an innovative data privacy protection mechanism is applied for avoiding data sharing, where each student network only inherits weights from all trained networks and does not inherit the training dataset. Here, the framework can be repeatedly optimized and improved by only training the next student subnetwork on new EEG signals. Experimental results show that our framework improves the accuracy of EEG emotion recognition by more than 5% and gets state-of-the-art performance for EEG emotion recognition in the subject-independent mode.
Collapse
|
3
|
Du G, Zhang J, Jiang M, Long J, Lin Y, Li S, Tan KC. Graph-Based Class-Imbalance Learning With Label Enhancement. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:6081-6095. [PMID: 34928806 DOI: 10.1109/tnnls.2021.3133262] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Class imbalance is a common issue in the community of machine learning and data mining. The class-imbalance distribution can make most classical classification algorithms neglect the significance of the minority class and tend toward the majority class. In this article, we propose a label enhancement method to solve the class-imbalance problem in a graph manner, which estimates the numerical label and trains the inductive model simultaneously. It gives a new perspective on the class-imbalance learning based on the numerical label rather than the original logical label. We also present an iterative optimization algorithm and analyze the computation complexity and its convergence. To demonstrate the superiority of the proposed method, several single-label and multilabel datasets are applied in the experiments. The experimental results show that the proposed method achieves a promising performance and outperforms some state-of-the-art single-label and multilabel class-imbalance learning methods.
Collapse
|
4
|
Liu Z, Wang Y, Feng F, Liu Y, Li Z, Shan Y. A DDoS Detection Method Based on Feature Engineering and Machine Learning in Software-Defined Networks. SENSORS (BASEL, SWITZERLAND) 2023; 23:6176. [PMID: 37448025 DOI: 10.3390/s23136176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 06/26/2023] [Accepted: 07/03/2023] [Indexed: 07/15/2023]
Abstract
Distributed denial-of-service (DDoS) attacks pose a significant cybersecurity threat to software-defined networks (SDNs). This paper proposes a feature-engineering- and machine-learning-based approach to detect DDoS attacks in SDNs. First, the CSE-CIC-IDS2018 dataset was cleaned and normalized, and the optimal feature subset was found using an improved binary grey wolf optimization algorithm. Next, the optimal feature subset was trained and tested in Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (k-NN), Decision Tree, and XGBoost machine learning algorithms, from which the best classifier was selected for DDoS attack detection and deployed in the SDN controller. The results show that RF performs best when compared across several performance metrics (e.g., accuracy, precision, recall, F1 and AUC values). We also explore the comparison between different models and algorithms. The results show that our proposed method performed the best and can effectively detect and identify DDoS attacks in SDNs, providing a new idea and solution for the security of SDNs.
Collapse
Affiliation(s)
- Zhenpeng Liu
- School of Electronic Information Engineering, Hebei University, Baoding 071002, China
- Information Technology Center, Hebei University, Baoding 071002, China
| | - Yihang Wang
- School of Electronic Information Engineering, Hebei University, Baoding 071002, China
| | - Fan Feng
- Information Technology Center, Hebei University, Baoding 071002, China
| | - Yifan Liu
- School of Cyberspace Security and Computer, Hebei University, Baoding 071002, China
| | - Zelin Li
- School of Electronic Information Engineering, Hebei University, Baoding 071002, China
| | - Yawei Shan
- School of Electronic Information Engineering, Hebei University, Baoding 071002, China
| |
Collapse
|
5
|
ProtoGAN: Towards high diversity and fidelity image synthesis under limited data. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2023]
|
6
|
Xu X, Wang Z, Fu Z, Guo W, Chi Z, Li D. Flexible few-shot class-incremental learning with prototype container. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08272-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
7
|
Federated Probability Memory Recall for Federated Continual Learning. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
|
8
|
Multi-feature space similarity supplement for few-shot class incremental learning. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
|
9
|
Huang ZA, Sang Y, Sun Y, Lv J. Neural network with absent minority class samples and boundary shifting for imbalanced data classification. Neural Comput Appl 2023. [DOI: 10.1007/s00521-022-08135-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
10
|
Choi YH, Yang J. Machine learning iterative filtering algorithm for field defect detection in the process stage. COMPUT IND 2022. [DOI: 10.1016/j.compind.2022.103740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
11
|
Multi-view multi-manifold learning with local and global structure preservation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04101-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
12
|
Xia S, Zheng Y, Wang G, He P, Li H, Chen Z. Random Space Division Sampling for Label-Noisy Classification or Imbalanced Classification. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:10444-10457. [PMID: 33909577 DOI: 10.1109/tcyb.2021.3070005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
This article presents a simple sampling method, which is very easy to be implemented, for classification by introducing the idea of random space division, called "random space division sampling" (RSDS). It can extract the boundary points as the sampled result by efficiently distinguishing the label noise points, inner points, and boundary points. This makes it the first general sampling method for classification that not only can reduce the data size but also enhance the classification accuracy of a classifier, especially in the label-noisy classification. The "general" means that it is not restricted to any specific classifiers or datasets (regardless of whether a dataset is linear or not). Furthermore, the RSDS can online accelerate most classifiers because of its lower time complexity than most classifiers. Moreover, the RSDS can be used as an undersampling method for imbalanced classification. The experimental results on benchmark datasets demonstrate its effectiveness and efficiency. The code of the RSDS and comparison algorithms is available at: https://github.com/syxiaa/RSDS.
Collapse
|
13
|
Chi Z, Wang Z, Du W. Explicit Metric-Based Multiconcept Multi-Instance Learning With Triplet and Superbag. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:5888-5897. [PMID: 33882006 DOI: 10.1109/tnnls.2021.3071814] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Multi-instance learning (MIL) has garnered considerable attention in recent years due to its favorable performance in various scenarios. Nonetheless, most previous studies have implicitly expressed the correlation between instances and bags. Moreover, the importance of negative instances has been largely overlooked. Hence, we seek to present an explicit and intuitively understandable method that can compensate for these deficiencies. In this article, we creatively introduce a metric-based multiconcept MIL approach based on two aspects. First, the triplet-based bag embedding method identifies instance categories and builds attention weights for every instance explicitly. Accordingly, bag embedding is accomplished under the limitation of weak supervision. Second, the developed instance correlation metric approach in the superbag considers the multiconcept issue to boost the model generalization performance. We have designed a rich variety of experiments to demonstrate the performance of our algorithm. The artificial data experiment reveals the interpretability of the proposed network. The results of the comparison experiment confirm that our method shows favorable performance in multiple tasks. Finally, we illustrate the motivation of the presented method by the ablation experiments.
Collapse
|
14
|
A framework to detect DDoS attack in Ryu controller based software defined networks using feature extraction and classification. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03565-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
15
|
Huang Z, Li J. Noise-Tolerant Discrimination Indexes for Fuzzy ɣ Covering and Feature Subset Selection. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; PP:609-623. [PMID: 35622800 DOI: 10.1109/tnnls.2022.3175922] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Fuzzy β covering (FBC) has attracted considerable attention in recent years. Nevertheless, as the basic information granularity of FBC, fuzzy β neighborhood does not satisfy reflexivity, which may lead to instability in classification learning and decision-making. Although a few studies have involved reflexive fuzzy β neighborhoods, they only focus on a single fuzzy covering and cannot effectively deal with the information representation and information fusion of multiple fuzzy coverings. Moreover, there is a lack of investigation on noise-tolerant uncertainty measures for FBC, as well as their application in feature selection. Motivated by these issues, we investigate a noise-tolerant variable precision discrimination index (VPDI) by means of a new reflexive fuzzy covering neighborhood. To this end, fuzzy ɣ neighborhood with reflexivity is introduced to characterize the information fusion of a fuzzy covering family. An uncertainty measure called fuzzy ɣ neighborhood discrimination index is then presented to reflect the discriminatory power of fuzzy covering families. Some variants of the uncertainty measure, such as variable precision joint discrimination index, variable precision conditional discrimination index, and variable precision mutual discrimination index, are then put forth by means of fuzzy decision. These VPDIs can be used as an evaluation metric for a family of fuzzy coverings. Finally, the knowledge reduction of fuzzy covering decision systems is addressed from the point of keeping the discriminatory power, and a heuristic feature selection algorithm is designed by means of the variable precision conditional discrimination index. The experiments on 16 public datasets exhibit that the proposed algorithm can effectively reduce redundant features and achieve competitive results compared with six state-of-the-art feature selection algorithms. Moreover, it demonstrates strong robustness to the interference of random noise.
Collapse
|
16
|
Ren J, Wang Y, Mao M, Cheung YM. Equalization ensemble for large scale highly imbalanced data classification. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108295] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
17
|
Jiang M, Yang Y, Qiu H. Fuzzy entropy and fuzzy support-based boosting random forests for imbalanced data. APPL INTELL 2022. [DOI: 10.1007/s10489-021-02620-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
18
|
Binary imbalanced big data classification based on fuzzy data reduction and classifier fusion. Soft comput 2022. [DOI: 10.1007/s00500-021-06654-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
19
|
Wang Z, Jia P, Xu X, Wang B, Zhu Y, Li D. Sample and feature selecting based ensemble learning for imbalanced problems. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
20
|
Feature selection via minimizing global redundancy for imbalanced data. APPL INTELL 2021. [DOI: 10.1007/s10489-021-02855-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
21
|
UFFDFR: Undersampling framework with denoising, fuzzy c-means clustering, and representative sample selection for imbalanced data classification. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.07.053] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
22
|
An Intelligent Fusion Algorithm and Its Application Based on Subgroup Migration and Adaptive Boosting. Symmetry (Basel) 2021. [DOI: 10.3390/sym13040569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Imbalanced data and feature redundancies are common problems in many fields, especially in software defect prediction, data mining, machine learning, and industrial big data application. To resolve these problems, we propose an intelligent fusion algorithm, SMPSO-HS-AdaBoost, which combines particle swarm optimization based on subgroup migration and adaptive boosting based on hybrid-sampling. In this paper, we apply the proposed intelligent fusion algorithm to software defect prediction to improve the prediction efficiency and accuracy by solving the issues caused by imbalanced data and feature redundancies. The results show that the proposed algorithm resolves the coexisting problems of imbalanced data and feature redundancies, and ensures the efficiency and accuracy of software defect prediction.
Collapse
|