1
|
Yan X, Wang S, Chen H, Zhu H. Multi-view learning with enhanced multi-weight vector projection support vector machine. Neural Netw 2025; 185:107180. [PMID: 39864229 DOI: 10.1016/j.neunet.2025.107180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Revised: 01/11/2025] [Accepted: 01/14/2025] [Indexed: 01/28/2025]
Abstract
Multi-view learning aims on learning from the data represented by multiple distinct feature sets. Various multi-view support vector machine methods have been successfully applied to classification tasks. However, the existed methods often face the problems of long processing time or weak generalization on some complex datasets. In this paper, two multi-view enhanced multi-weight vector projection support vector machine models are proposed. One is a ratio form of multi-view enhanced multi-weight vector projection support vector machine (R-MvEMV), while the other is a difference form (D-MvEMV). Instead of searching for specific classification hyperplanes, each proposed model tries to generate two projection matrices composed of a set of projection vectors for each view. A co-regularization term is added to maximize the consistency of different views. R-MvEMV and D-MvEMV can be simplified to two generalized eigenvalue problems and two eigenvalue problems, respectively. The optimal weight vector projections are the eigenvectors corresponding to the smallest eigenvalues. Some numerical tests are conducted to compare the proposed methods with the other state-of-art multi-view support vector machine methods. The numerical results show the better classification performance and higher efficiency of the proposed methods.
Collapse
Affiliation(s)
- Xin Yan
- School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai 201620, China
| | - Shuaixing Wang
- School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai 201620, China
| | - Huina Chen
- School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai 201620, China
| | - Hongmiao Zhu
- School of Management, Shanghai University of International Business and Economics, Shanghai 201620, China.
| |
Collapse
|
2
|
Xin L, Yang W, Wang L, Yang M. Selective Contrastive Learning for Unpaired Multi-View Clustering. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:1749-1763. [PMID: 37995163 DOI: 10.1109/tnnls.2023.3329658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2023]
Abstract
In this article, we investigate a novel but insufficiently studied issue, unpaired multi-view clustering (UMC), where no paired observed samples exist in multi-view data, and the goal is to leverage the unpaired observed samples in all views for effective joint clustering. Existing methods in incomplete multi-view clustering usually utilize the sample pairing relationship between views to connect the views for joint clustering, but unfortunately, it is invalid for the UMC case. Therefore, we strive to mine a consistent cluster structure between views and propose an effective method, namely selective contrastive learning for UMC (scl-UMC), which needs to solve the following two challenging issues: 1) uncertain clustering structure under no supervision information and 2) uncertain pairing relationship between the clusters of views. Specifically, for the first one, we design an inner-view (IV) selective contrastive learning module to enhance the clustering structures and alleviate the uncertainty, which selects confident samples near the cluster centroids to perform contrastive learning in each view. For the second one, we design a cross-view (CV) selective contrastive learning module to first iteratively match the clusters between views and then tighten the matched clusters. Also, we utilize mutual information to further enhance the correlation of the matched clusters between views. Extensive experiments show the efficiency of our methods for UMC, compared with the state-of-the-art methods.
Collapse
|
3
|
Dong Q, Cai H, Li Z, Liu J, Hu B. A Multiview Brain Network Transformer Fusing Individualized Information for Autism Spectrum Disorder Diagnosis. IEEE J Biomed Health Inform 2024; 28:4854-4865. [PMID: 38700974 DOI: 10.1109/jbhi.2024.3396457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]
Abstract
Functional connectivity (FC) networks, built from analyses of resting-state magnetic resonance imaging (rs-fMRI), serve as efficacious biomarkers for identifying Autism Spectrum Disorders (ASD) patients. Given the neurobiological heterogeneity across individuals and the unique presentation of ASD symptoms, the fusion of individualized information into diagnosis becomes essential. However, this aspect is overlooked in most methods. Furthermore, the existing methods typically focus on studying direct pairwise connections between brain ROIs, while disregarding interactions between indirectly connected neighbors. To overcome above challenges, we build common FC and individualized FC by tangent pearson embedding (TP) and common orthogonal basis extraction (COBE) respectively, and present a novel multiview brain transformer (MBT) aimed at effectively fusing common and indivinformation of subjects. MBT is mainly constructed by transformer layers with diffusion kernel (DK), fusion quality-inspired weighting module (FQW), similarity loss and orthonormal clustering fusion readout module (OCFRead). DK transformer can incorporate higher-order random walk methods to capture wider interactions among indirectly connected brain regions. FQW promotes adaptive fusion of features between views, and similarity loss and OCFRead are placed on the last layer to accomplish the ultimate integration of information. In our method, TP, DK and FQW modules all help to model wider connectivity in the brain that make up for the shortcomings of traditional methods. We conducted experiments on the public ABIDE dataset based on AAL and CC200 respectively. Our framework has shown promising results, outperforming state-of-the-art methods on both templates. This suggests its potential as a valuable approach for clinical ASD diagnosis.
Collapse
|
4
|
Sheikhpour R. A local spline regression-based framework for semi-supervised sparse feature selection. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
5
|
Shang R, Kong J, Wang L, Zhang W, Wang C, Li Y, Jiao L. Unsupervised feature selection via discrete spectral clustering and feature weights. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2022.10.053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
6
|
Wang Q, Jiang X, Chen M, Li X. Autoweighted Multiview Feature Selection With Graph Optimization. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:12966-12977. [PMID: 34398782 DOI: 10.1109/tcyb.2021.3094843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
In this article, we focus on the unsupervised multiview feature selection, which tries to handle high-dimensional data in the field of multiview learning. Although some graph-based methods have achieved satisfactory performance, they ignore the underlying data structure across different views. Besides, their predefined Laplacian graphs are sensitive to the noises in the original data space and fail to obtain the optimal neighbor assignment. To address the above problems, we propose a novel unsupervised multiview feature selection model based on graph learning, and the contributions are three-fold: 1) during the feature selection procedure, the consensus similarity graph shared by different views is learned. Therefore, the proposed model can reveal the data relationship from the feature subset; 2) a reasonable rank constraint is added to optimize the similarity matrix to obtain more accurate information; and 3) an autoweighted framework is presented to assign view weights adaptively, and an effective alternative iterative algorithm is proposed to optimize the problem. Experiments on various datasets demonstrate the superiority of the proposed method compared to the state-of-the-art methods.
Collapse
|
7
|
Multiview nonlinear discriminant structure learning for emotion recognition. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
8
|
Chen T, Zeng Y, Yuan H, Zhong G, Lai LL, Tang YY. Multi-level regularization-based unsupervised multi-view feature selection with adaptive graph learning. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01721-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
9
|
Multi-view dimensionality reduction learning with hierarchical sparse feature selection. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04161-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
10
|
Xu C, Liu H, Guan Z, Wu X, Tan J, Ling B. Adversarial Incomplete Multiview Subspace Clustering Networks. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:10490-10503. [PMID: 33750730 DOI: 10.1109/tcyb.2021.3062830] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Multiview clustering aims to leverage information from multiple views to improve the clustering performance. Most previous works assumed that each view has complete data. However, in real-world datasets, it is often the case that a view may contain some missing data, resulting in the problem of incomplete multiview clustering (IMC). Previous approaches to this problem have at least one of the following drawbacks: 1) employing shallow models, which cannot well handle the dependence and discrepancy among different views; 2) ignoring the hidden information of the missing data; and 3) being dedicated to the two-view case. To eliminate all these drawbacks, in this work, we present the adversarial IMC (AIMC) framework. In particular, AIMC seeks the common latent representation of multiview data for reconstructing raw data and inferring missing data. The elementwise reconstruction and the generative adversarial network are integrated to evaluate the reconstruction. They aim to capture the overall structure and get a deeper semantic understanding, respectively. Moreover, the clustering loss is designed to obtain a better clustering structure. We explore two variants of AIMC, namely: 1) autoencoder-based AIMC (AAIMC) and 2) generalized AIMC (GAIMC), with different strategies to obtain the multiview common representation. Experiments conducted on six real-world datasets show that AAIMC and GAIMC perform well and outperform the baseline methods.
Collapse
|
11
|
Wang Y, Wang Z, Hu Q, Zhou Y, Su H. Hierarchical Semantic Risk Minimization for Large-Scale Classification. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:9546-9558. [PMID: 33729972 DOI: 10.1109/tcyb.2021.3059631] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Hierarchical structures of labels usually exist in large-scale classification tasks, where labels can be organized into a tree-shaped structure. The nodes near the root stand for coarser labels, while the nodes close to leaves mean the finer labels. We label unseen samples from the root node to a leaf node, and obtain multigranularity predictions in the hierarchical classification. Sometimes, we cannot obtain a leaf decision due to uncertainty or incomplete information. In this case, we should stop at an internal node, rather than going ahead rashly. However, most existing hierarchical classification models aim at maximizing the percentage of correct predictions, and do not take the risk of misclassifications into account. Such risk is critically important in some real-world applications, and can be measured by the distance between the ground truth and the predicted classes in the class hierarchy. In this work, we utilize the semantic hierarchy to define the classification risk and design an optimization technique to reduce such risk. By defining the conservative risk and the precipitant risk as two competing risk factors, we construct the balanced conservative/precipitant semantic (BCPS) risk matrix across all nodes in the semantic hierarchy with user-defined weights to adjust the tradeoff between two kinds of risks. We then model the classification process on the semantic hierarchy as a sequential decision-making task. We design an algorithm to derive the risk-minimized predictions. There are two modules in this model: 1) multitask hierarchical learning and 2) deep reinforce multigranularity learning. The first one learns classification confidence scores of multiple levels. These scores are then fed into deep reinforced multigranularity learning for obtaining a global risk-minimized prediction with flexible granularity. Experimental results show that the proposed model outperforms state-of-the-art methods on seven large-scale classification datasets with the semantic tree.
Collapse
|
12
|
Co-consensus semi-supervised multi-view learning with orthogonal non-negative matrix factorization. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.103054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
13
|
Xiong D, Ying S, Zhu H. Intrinsic partial linear models for manifold-valued data. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.102954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
14
|
Zhang Y, Zhang H, Adeli E, Chen X, Liu M, Shen D. Multiview Feature Learning With Multiatlas-Based Functional Connectivity Networks for MCI Diagnosis. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:6822-6833. [PMID: 33306476 DOI: 10.1109/tcyb.2020.3016953] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Functional connectivity (FC) networks built from resting-state functional magnetic resonance imaging (rs-fMRI) has shown promising results for the diagnosis of Alzheimer's disease and its prodromal stage, that is, mild cognitive impairment (MCI). FC is usually estimated as a temporal correlation of regional mean rs-fMRI signals between any pair of brain regions, and these regions are traditionally parcellated with a particular brain atlas. Most existing studies have adopted a predefined brain atlas for all subjects. However, the constructed FC networks inevitably ignore the potentially important subject-specific information, particularly, the subject-specific brain parcellation. Similar to the drawback of the "single view" (versus the "multiview" learning) in medical image-based classification, FC networks constructed based on a single atlas may not be sufficient to reveal the underlying complicated differences between normal controls and disease-affected patients due to the potential bias from that particular atlas. In this study, we propose a multiview feature learning method with multiatlas-based FC networks to improve MCI diagnosis. Specifically, a three-step transformation is implemented to generate multiple individually specified atlases from the standard automated anatomical labeling template, from which a set of atlas exemplars is selected. Multiple FC networks are constructed based on these preselected atlas exemplars, providing multiple views of the FC network-based feature representations for each subject. We then devise a multitask learning algorithm for joint feature selection from the constructed multiple FC networks. The selected features are jointly fed into a support vector machine classifier for multiatlas-based MCI diagnosis. Extensive experimental comparisons are carried out between the proposed method and other competing approaches, including the traditional single-atlas-based method. The results indicate that our method significantly improves the MCI classification, demonstrating its promise in the brain connectome-based individualized diagnosis of brain diseases.
Collapse
|
15
|
Zhao D, Gao Q, Lu Y, Sun D. Learning view-specific labels and label-feature dependence maximization for multi-view multi-label classification. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
16
|
Zeng L, Li H, Xiao T, Shen F, Zhong Z. Graph convolutional network with sample and feature weights for Alzheimer’s disease diagnosis. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.102952] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
17
|
Liu W, Yuan J, Lyu G, Feng S. Label driven latent subspace learning for multi-view multi-label classification. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03600-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
Zhang ML, Li YK, Yang H, Liu XY. Towards Class-Imbalance Aware Multi-Label Learning. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:4459-4471. [PMID: 33206614 DOI: 10.1109/tcyb.2020.3027509] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Multi-label learning deals with training examples each represented by a single instance while associated with multiple class labels. Due to the exponential number of possible label sets to be considered by the predictive model, it is commonly assumed that label correlations should be well exploited to design an effective multi-label learning approach. On the other hand, class-imbalance stands as an intrinsic property of multi-label data which significantly affects the generalization performance of the multi-label predictive model. For each class label, the number of training examples with positive labeling assignment is generally much less than those with negative labeling assignment. To deal with the class-imbalance issue for multi-label learning, a simple yet effective class-imbalance aware learning strategy called cross-coupling aggregation (COCOA) is proposed in this article. Specifically, COCOA works by leveraging the exploitation of label correlations as well as the exploration of class-imbalance simultaneously. For each class label, a number of multiclass imbalance learners are induced by randomly coupling with other labels, whose predictions on the unseen instance are aggregated to determine the corresponding labeling relevancy. Extensive experiments on 18 benchmark datasets clearly validate the effectiveness of COCOA against state-of-the-art multi-label learning approaches especially in terms of imbalance-specific evaluation metrics.
Collapse
|
19
|
MTGCN: A multi-task approach for node classification and link prediction in graph data. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.102902] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
20
|
|
21
|
Improved Multiple Vector Representations of Images and Robust Dictionary Learning. ELECTRONICS 2022. [DOI: 10.3390/electronics11060847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Each sparse representation classifier has different classification accuracy for different samples. It is difficult to achieve good performance with a single feature classification model. In order to balance the large-scale information and global features of images, a robust dictionary learning method based on image multi-vector representation is proposed in this paper. First, this proposed method generates a reasonable virtual image for the original image and obtains the multi-vector representation of all images. Second, the same dictionary learning algorithm is used for each vector representation to obtain multiple sets of image features. The proposed multi-vector representation can provide a good global understanding of the whole image contour and increase the content of dictionary learning. Last, the weighted fusion algorithm is used to classify the test samples. The introduction of influencing factors and the automatic adjustment of the weights of each classifier in the final decision results have a significant indigenous effect on better extracting image features. The study conducted experiments on the proposed algorithm on a number of widely used image databases. A large number of experimental results show that it effectively improves the accuracy of image classification. At the same time, to fully dig and exploit possible representation diversity might be a better way to lead to potential various appearances and high classification accuracy concerning the image.
Collapse
|
22
|
Deploying Machine Learning Techniques for Human Emotion Detection. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:8032673. [PMID: 35154306 PMCID: PMC8828335 DOI: 10.1155/2022/8032673] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Revised: 10/12/2021] [Accepted: 10/22/2021] [Indexed: 11/17/2022]
Abstract
Emotion recognition is one of the trending research fields. It is involved in several applications. Its most interesting applications include robotic vision and interactive robotic communication. Human emotions can be detected using both speech and visual modalities. Facial expressions can be considered as ideal means for detecting the persons' emotions. This paper presents a real-time approach for implementing emotion detection and deploying it in the robotic vision applications. The proposed approach consists of four phases: preprocessing, key point generation, key point selection and angular encoding, and classification. The main idea is to generate key points using MediaPipe face mesh algorithm, which is based on real-time deep learning. In addition, the generated key points are encoded using a sequence of carefully designed mesh generator and angular encoding modules. Furthermore, feature decomposition is performed using Principal Component Analysis (PCA). This phase is deployed to enhance the accuracy of emotion detection. Finally, the decomposed features are enrolled into a Machine Learning (ML) technique that depends on a Support Vector Machine (SVM), k-Nearest Neighbor (KNN), Naïve Bayes (NB), Logistic Regression (LR), or Random Forest (RF) classifier. Moreover, we deploy a Multilayer Perceptron (MLP) as an efficient deep neural network technique. The presented techniques are evaluated on different datasets with different evaluation metrics. The simulation results reveal that they achieve a superior performance with a human emotion detection accuracy of 97%, which ensures superiority among the efforts in this field.
Collapse
|
23
|
Attribute and label distribution driven multi-label active learning. APPL INTELL 2022. [DOI: 10.1007/s10489-021-03086-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
24
|
Multi-view multi-label-based online method with threefold correlations and dynamic updating multi-region. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06766-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
25
|
Sparse robust multiview feature selection via adaptive-weighting strategy. INT J MACH LEARN CYB 2021. [DOI: 10.1007/s13042-021-01453-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
26
|
Lin Q, Yang L, Zhong P, Zou H. Robust supervised multi-view feature selection with weighted shared loss and maximum margin criterion. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107331] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
27
|
|
28
|
Li Y, Fang Y, Wang J, Zhang H, Hu B. Biomarker Extraction Based on Subspace Learning for the Prediction of Mild Cognitive Impairment Conversion. BIOMED RESEARCH INTERNATIONAL 2021; 2021:5531940. [PMID: 34513992 PMCID: PMC8429015 DOI: 10.1155/2021/5531940] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 08/13/2021] [Indexed: 01/18/2023]
Abstract
Accurate recognition of progressive mild cognitive impairment (MCI) is helpful to reduce the risk of developing Alzheimer's disease (AD). However, it is still challenging to extract effective biomarkers from multivariate brain structural magnetic resonance imaging (MRI) features to accurately differentiate the progressive MCI from stable MCI. We develop novel biomarkers by combining subspace learning methods with the information of AD as well as normal control (NC) subjects for the prediction of MCI conversion using multivariate structural MRI data. Specifically, we first learn two projection matrices to map multivariate structural MRI data into a common label subspace for AD and NC subjects, where the original data structure and the one-to-one correspondence between multiple variables are kept as much as possible. Afterwards, the multivariate structural MRI features of MCI subjects are mapped into a common subspace according to the projection matrices. We then perform the self-weighted operation and weighted fusion on the features in common subspace to extract the novel biomarkers for MCI subjects. The proposed biomarkers are tested on Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. Experimental results indicate that our proposed biomarkers outperform the competing biomarkers on the discrimination between progressive MCI and stable MCI. And the improvement from the proposed biomarkers is not limited to a particular classifier. Moreover, the results also confirm that the information of AD and NC subjects is conducive to predicting conversion from MCI to AD. In conclusion, we find a good representation of brain features from high-dimensional MRI data, which exhibits promising performance for predicting conversion from MCI to AD.
Collapse
Affiliation(s)
- Ying Li
- Key Laboratory of TCM Data Cloud Service in Universities of Shandong, Shandong Management University, Jinan 250357, China
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China
| | - Yixian Fang
- School of Mathematics and Statistics, Qilu University of Technology, Jinan 250353, China
| | | | - Huaxiang Zhang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China
| | - Bin Hu
- School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China
- Key Laboratory of Wearable Computing of Gansu Province, Lanzhou University, Lanzhou 730000, China
| |
Collapse
|
29
|
Sun S, Zong D. LCBM: A Multi-View Probabilistic Model for Multi-Label Classification. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021; 43:2682-2696. [PMID: 32078533 DOI: 10.1109/tpami.2020.2974203] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Multi-label classification is an important research topic in machine learning, for which exploiting label dependencies is an effective modeling principle. Recently, probabilistic models have shown great potential in discovering dependencies among labels. In this paper, motivated by the recent success of multi-view learning to improve the generalization performance, we propose a novel multi-view probabilistic model named latent conditional Bernoulli mixture (LCBM) for multi-label classification. LCBM is a generative model taking features from different views as inputs, and conditional on the latent subspace shared by the views a Bernoulli mixture model is adopted to build label dependencies. Inside each component of the mixture, the labels have a weak correlation which facilitates computational convenience. The mean field variational inference framework is used to carry out approximate posterior inference in the probabilistic model, where we propose a Gaussian mixture variational autoencoder (GMVAE) for effective posterior approximation. We further develop a scalable stochastic training algorithm for efficiently optimizing the model parameters and variational parameters, and derive an efficient prediction procedure based on greedy search. Experimental results on multiple benchmark datasets show that our approach outperforms other state-of-the-art methods under various metrics.
Collapse
|
30
|
Zhang J, Liu M, Lu K, Gao Y. Group-Wise Learning for Aurora Image Classification With Multiple Representations. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:4112-4124. [PMID: 30932858 DOI: 10.1109/tcyb.2019.2903591] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In conventional aurora image classification methods, it is general to employ only one single feature representation to capture the morphological characteristics of aurora images, which is difficult to describe the complicated morphologies of different aurora categories. Although several studies have proposed to use multiple feature representations, the inherent correlation among these representations are usually neglected. To address this problem, we propose a group-wise learning (GWL) method for the automatic aurora image classification using multiple representations. Specifically, we first extract the multiple feature representations for aurora images, and then construct a graph in each of multiple feature spaces. To model the correlation among different representations, we partition multiple graphs into several groups via a clustering algorithm. We further propose a GWL model to automatically estimate class labels for aurora images and optimal weights for the multiple representations in a data-driven manner. Finally, we develop a label fusion approach to make a final classification decision for new testing samples. The proposed GWL method focuses on the diverse properties of multiple feature representations, by clustering the correlated representations into the same group. We evaluate our method on an aurora image data set that contains 12 682 aurora images from 19 days. The experimental results demonstrate that the proposed GWL method achieves approximately 6% improvement in terms of classification accuracy, compared to the methods using a single feature representation.
Collapse
|
31
|
|
32
|
Singh Malan N, Sharma S. Time window and frequency band optimization using regularized neighbourhood component analysis for Multi-View Motor Imagery EEG classification. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102550] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
33
|
Wu X, Xu X, Liu J, Wang H, Hu B, Nie F. Supervised Feature Selection With Orthogonal Regression and Feature Weighting. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:1831-1838. [PMID: 32406845 DOI: 10.1109/tnnls.2020.2991336] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Effective features can improve the performance of a model and help us understand the characteristics and underlying structure of complex data. Previously proposed feature selection methods usually cannot retain more discriminative information. To address this shortcoming, we propose a novel supervised orthogonal least square regression model with feature weighting for feature selection. The optimization problem of the objective function can be solved by employing generalized power iteration and augmented Lagrangian multiplier methods. Experimental results show that the proposed method can more effectively reduce feature dimensionality and obtain better classification results than traditional feature selection methods. The convergence of our iterative method is also proved. Consequently, the effectiveness and superiority of the proposed method are verified both theoretically and experimentally.
Collapse
|
34
|
Bhatti YK, Jamil A, Nida N, Yousaf MH, Viriri S, Velastin SA. Facial Expression Recognition of Instructor Using Deep Features and Extreme Learning Machine. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:5570870. [PMID: 34007266 PMCID: PMC8110428 DOI: 10.1155/2021/5570870] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Revised: 02/22/2021] [Accepted: 04/12/2021] [Indexed: 11/25/2022]
Abstract
Classroom communication involves teacher's behavior and student's responses. Extensive research has been done on the analysis of student's facial expressions, but the impact of instructor's facial expressions is yet an unexplored area of research. Facial expression recognition has the potential to predict the impact of teacher's emotions in a classroom environment. Intelligent assessment of instructor behavior during lecture delivery not only might improve the learning environment but also could save time and resources utilized in manual assessment strategies. To address the issue of manual assessment, we propose an instructor's facial expression recognition approach within a classroom using a feedforward learning model. First, the face is detected from the acquired lecture videos and key frames are selected, discarding all the redundant frames for effective high-level feature extraction. Then, deep features are extracted using multiple convolution neural networks along with parameter tuning which are then fed to a classifier. For fast learning and good generalization of the algorithm, a regularized extreme learning machine (RELM) classifier is employed which classifies five different expressions of the instructor within the classroom. Experiments are conducted on a newly created instructor's facial expression dataset in classroom environments plus three benchmark facial datasets, i.e., Cohn-Kanade, the Japanese Female Facial Expression (JAFFE) dataset, and the Facial Expression Recognition 2013 (FER2013) dataset. Furthermore, the proposed method is compared with state-of-the-art techniques, traditional classifiers, and convolutional neural models. Experimentation results indicate significant performance gain on parameters such as accuracy, F1-score, and recall.
Collapse
Affiliation(s)
- Yusra Khalid Bhatti
- Department of Computer Engineering, University of Engineering and Technology, Taxila, Pakistan
| | - Afshan Jamil
- Department of Computer Engineering, University of Engineering and Technology, Taxila, Pakistan
| | - Nudrat Nida
- Department of Computer Engineering, University of Engineering and Technology, Taxila, Pakistan
| | - Muhammad Haroon Yousaf
- Department of Computer Engineering, University of Engineering and Technology, Taxila, Pakistan
- Swarm Robotics Lab, National Centre for Robotics and Automation (NCRA), Rawalpindi, Pakistan
| | - Serestina Viriri
- Department of Computer Science, University of Kwazulu Natal, Durban, South Africa
| | - Sergio A. Velastin
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK
- Department of Computer Science and Engineering, Universidad Carlos III de Madrid, Leganés, Madrid 28911, Spain
| |
Collapse
|
35
|
Zhao D, Gao Q, Lu Y, Sun D. Two-step multi-view and multi-label learning with missing label via subspace learning. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107120] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
36
|
Zhao D, Gao Q, Lu Y, Sun D, Cheng Y. Consistency and diversity neural network multi-view multi-label learning. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106841] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
37
|
Abstract
This paper proposes different classification algorithms—logistic regression, support vector machine, K-nearest neighbors, and random forest—in order to identify which candidates are likely to default for a credit scoring model. Three different feature selection methods are used in order to mitigate the overfitting in the curse of dimensionality of these classification algorithms: one filter method (Chi-squared test and correlation coefficients) and two wrapper methods (forward stepwise selection and backward stepwise selection). The performances of these three methods are discussed using two measures, the mean absolute error and the number of selected features. The methodology is applied for a valuable database of Taiwan. The results suggest that forward stepwise selection yields superior performance in each one of the classification algorithms used. The conclusions obtained are related to those in the literature, and their managerial implications are analyzed.
Collapse
|
38
|
Tao H, Hou C, Yi D, Zhu J, Hu D. Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multiview Learning. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:1690-1703. [PMID: 31804950 DOI: 10.1109/tcyb.2019.2953564] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In real-world applications, not all instances in the multiview data are fully represented. To deal with incomplete data, incomplete multiview learning (IML) rises. In this article, we propose the joint embedding learning and low-rank approximation (JELLA) framework for IML. The JELLA framework approximates the incomplete data by a set of low-rank matrices and learns a full and common embedding by linear transformation. Several existing IML methods can be unified as special cases of the framework. More interestingly, some linear transformation-based complete multiview methods can be adapted to IML directly with the guidance of the framework. Thus, the JELLA framework improves the efficiency of processing incomplete multiview data, and bridges the gap between complete multiview learning and IML. Moreover, the JELLA framework can provide guidance for developing new algorithms. For illustration, within the framework, we propose the IML with the block-diagonal representation (IML-BDR) method. Assuming that the sampled examples have an approximate linear subspace structure, IML-BDR uses the block-diagonal structure prior to learning the full embedding, which would lead to more correct clustering. A convergent alternating iterative algorithm with the successive over-relaxation optimization technique is devised for optimization. The experimental results on various datasets demonstrate the effectiveness of IML-BDR.
Collapse
|
39
|
Kuang Z, Zhang X, Yu J, Li Z, Fan J. Deep embedding of concept ontology for hierarchical fashion recognition. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.04.085] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
40
|
Multi-view generalized support vector machine via mining the inherent relationship between views with applications to face and fire smoke recognition. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.106488] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
41
|
Convolutional Sparse Coded Dynamic Brain Functional Connectivity. Neural Process Lett 2020. [DOI: 10.1007/s11063-020-10295-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
42
|
|
43
|
Jin M. Achievements analysis of mooc English course based on fuzzy statistics and neural network clustering. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2020. [DOI: 10.3233/jifs-189037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
At present, the field of natural language will also introduce in-depth learning, using the concept of word vector, so that the neural network can also complete the work in the field of statistics. It can be said that the neural network has begun to show its advantages in the field of natural language processing. In this paper, the author analyzes the multimedia English course based on fuzzy statistics and neural network clustering. Different factors were classified, and scores were classified according to the number of characteristics of different categories. It can be seen that with the popularization of the Internet, MOOC teaching meets the requirements of the current college English curriculum, is a breakthrough in the traditional teaching mode, improves students’ participation, and enables students to learn independently. It not only conforms to the characteristics of College students, but also improves their learning effect. In the automatic scoring stage, the quantitative text features are extracted by the feature extractor in the pre-processing stage, and then the weights of network connections obtained in the training stage are used to score the weights comprehensively. This model can better reflect students’ autonomous learning ability and language application ability.
Collapse
Affiliation(s)
- Meichen Jin
- School of Foreign Language, University of Science and Technology Liaoning, Liaoning, China
| |
Collapse
|
44
|
|
45
|
|
46
|
Xiaofeng D. Application of deep learning and artificial intelligence algorithm in multimedia music teaching. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2020. [DOI: 10.3233/jifs-179800] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Du Xiaofeng
- Shandong University of Arts, Jinan, Shandong, China
| |
Collapse
|
47
|
Du T, Wen G, Cai Z, Zheng W, Tan M, Li Y. Spectral clustering algorithm combining local covariance matrix with normalization. Neural Comput Appl 2020. [DOI: 10.1007/s00521-018-3852-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
48
|
Gao L, Cao L, Xu X, Shao J, Song J. Question-Led object attention for visual question answering. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2018.11.102] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
49
|
|
50
|
A new graph-preserving unsupervised feature selection embedding LLE with low-rank constraint and feature-level representation. Artif Intell Rev 2020. [DOI: 10.1007/s10462-019-09749-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|