1
|
He P, Xu X, Chen S. Robust Supervised Spline Embedding. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:6829-6842. [PMID: 38870004 DOI: 10.1109/tnnls.2024.3409394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
High-dimensional data present significant challenges such as inadequate sample size, abundance of noise, and the curse of dimensionality, which make many traditional classification algorithms inapplicable. To provide valid inference for such data, it requires finding a noise-free low-dimensional representation that preserves both the underlying manifold structure and discriminative information. However, the existing methods often fail to take full consideration of these requirements. In this article, we introduce a robust supervised spline embedding (RS2E) algorithm for high-dimensional classification. The proposed approach is highlighted in four aspects: 1) it preserves the class-aware submanifold structure in the thin plate spline embedding space; 2) it eliminates noise and outliers to recover the clean manifold by exploiting its intrinsic low complexity; 3) it separates the class-aware submanifolds by maximizing the distance between each data point and the marginal data points of other class-aware submanifolds; and 4) it applies the alternating direction method of multipliers with generalized power iteration to solve the objective function. Promising experimental results on the real-world, generative adversarial network (GAN)-generated and artificially corrupted datasets demonstrate that RS2E outperforms other supervised dimensionality reduction algorithms in terms of classification accuracy.
Collapse
|
2
|
Wang B, Chen M, Li X. Robust Subcluster Search and Mergence Clustering. IEEE TRANSACTIONS ON CYBERNETICS 2024; 54:7616-7628. [PMID: 39231063 DOI: 10.1109/tcyb.2024.3446764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]
Abstract
In recent years, graph-based clustering presents outstanding performance and has been widely investigated. It segments the data similarity graph into multiple subgraphs as final clusters. Many methods integrate graph learning and segmentation into a unified optimization problem to explore the graph structure. However, existing research 1) attempts to derive the final clusters from the learned graph directly, which relies on a highly tight internal distribution within each cluster, and is too strict for the real-world data; 2) generally constructs a holistic full sample graph, which means the outliers are involved in graph learning explicitly, and may corrupt the graph quality. To overcome the above limitations, a new clustering model called robust subcluster search and mergence (RSSM) is established in this article. Inspired by the positive-incentive noise (Pi-Noise), RSSM assumes that the outliers are useful for learning the data structure. Considering a few samples with large errors as outliers, RSSM finds the subcentroids by searching an imbalanced residue distribution. In this way, the subcentroids pull the normal samples together and push the outliers far away. Compared with the traditional clusters, the subclusters indicated by the subcentroids are more explicit, where the normal samples are tightly connected. After that, a subcluster similarity graph is constructed to guide the mergence of subclusters. To sum up, RSSM performs the search and mergence of subclusters simultaneously with the help of outliers, and generates a graph that is more suitable for clustering. Experiments on several datasets demonstrate the rationality and superiority of RSSM.
Collapse
|
3
|
Liu Z, Yang J, Zhong X, Wang W, Chen H, Chang Y. A Novel Composite Graph Neural Network. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:13411-13425. [PMID: 37200114 DOI: 10.1109/tnnls.2023.3268766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Graph neural networks (GNNs) have achieved great success in many fields due to their powerful capabilities of processing graph-structured data. However, most GNNs can only be applied to scenarios where graphs are known, but real-world data are often noisy or even do not have available graph structures. Recently, graph learning has attracted increasing attention in dealing with these problems. In this article, we develop a novel approach to improving the robustness of the GNNs, called composite GNN. Different from existing methods, our method uses composite graphs (C-graphs) to characterize both sample and feature relations. The C-graph is a unified graph that unifies these two kinds of relations, where edges between samples represent sample similarities, and each sample has a tree-based feature graph to model feature importance and combination preference. By jointly learning multiaspect C-graphs and neural network parameters, our method improves the performance of semisupervised node classification and ensures robustness. We conduct a series of experiments to evaluate the performance of our method and the variants of our method that only learn sample relations or feature relations. Extensive experimental results on nine benchmark datasets demonstrate that our proposed method achieves the best performance on almost all the datasets and is robust to feature noises.
Collapse
|
4
|
Li J, Shyr Y, Liu Q. aKNNO: single-cell and spatial transcriptomics clustering with an optimized adaptive k-nearest neighbor graph. Genome Biol 2024; 25:203. [PMID: 39090647 PMCID: PMC11293182 DOI: 10.1186/s13059-024-03339-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 07/16/2024] [Indexed: 08/04/2024] Open
Abstract
Typical clustering methods for single-cell and spatial transcriptomics struggle to identify rare cell types, while approaches tailored to detect rare cell types gain this ability at the cost of poorer performance for grouping abundant ones. Here, we develop aKNNO to simultaneously identify abundant and rare cell types based on an adaptive k-nearest neighbor graph with optimization. Benchmarking on 38 simulated and 20 single-cell and spatial transcriptomics datasets demonstrates that aKNNO identifies both abundant and rare cell types more accurately than general and specialized methods. Using only gene expression aKNNO maps abundant and rare cells more precisely compared to integrative approaches.
Collapse
Affiliation(s)
- Jia Li
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
| | - Yu Shyr
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA.
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN, 37203, USA.
| | - Qi Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA.
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN, 37203, USA.
| |
Collapse
|
5
|
Chen Y, Zhao YP, Wang S, Chen J, Zhang Z. Partial Tubal Nuclear Norm-Regularized Multiview Subspace Learning. IEEE TRANSACTIONS ON CYBERNETICS 2024; 54:3777-3790. [PMID: 37058384 DOI: 10.1109/tcyb.2023.3263175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
In this article, a unified multiview subspace learning model, called partial tubal nuclear norm-regularized multiview subspace learning (PTN2MSL), was proposed for unsupervised multiview subspace clustering (MVSC), semisupervised MVSC, and multiview dimension reduction. Unlike most of the existing methods which treat the above three related tasks independently, PTN2MSL integrates the projection learning and the low-rank tensor representation to promote each other and mine their underlying correlations. Moreover, instead of minimizing the tensor nuclear norm which treats all singular values equally and neglects their differences, PTN2MSL develops the partial tubal nuclear norm (PTNN) as a better alternative solution by minimizing the partial sum of tubal singular values. The PTN2MSL method was applied to the above three multiview subspace learning tasks. It demonstrated that these tasks organically benefited from each other and PTN2MSL has achieved better performance in comparison to state-of-the-art methods.
Collapse
|
6
|
Liang Y, Huang D, Wang CD, Yu PS. Multi-View Graph Learning by Joint Modeling of Consistency and Inconsistency. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:2848-2862. [PMID: 35895654 DOI: 10.1109/tnnls.2022.3192445] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Graph learning has emerged as a promising technique for multi-view clustering due to its ability to learn a unified and robust graph from multiple views. However, existing graph learning methods mostly focus on the multi-view consistency issue, yet often neglect the inconsistency between views, which makes them vulnerable to possibly low-quality or noisy datasets. To overcome this limitation, we propose a new multi-view graph learning framework, which for the first time simultaneously and explicitly models multi-view consistency and inconsistency in a unified objective function, through which the consistent and inconsistent parts of each single-view graph as well as the unified graph that fuses the consistent parts can be iteratively learned. Though optimizing the objective function is NP-hard, we design a highly efficient optimization algorithm that can obtain an approximate solution with linear time complexity in the number of edges in the unified graph. Furthermore, our multi-view graph learning approach can be applied to both similarity graphs and dissimilarity graphs, which lead to two graph fusion-based variants in our framework. Experiments on 12 multi-view datasets have demonstrated the robustness and efficiency of the proposed approach. The code is available at https://github.com/youweiliang/Multi-view_Graph_Learning.
Collapse
|
7
|
Li J, Shyr Y, Liu Q. Single-cell and Spatial Transcriptomics Clustering with an Optimized Adaptive K-Nearest Neighbor Graph. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.13.562261. [PMID: 37905097 PMCID: PMC10614787 DOI: 10.1101/2023.10.13.562261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Single-cell and spatial transcriptomics have been widely used to characterize cellular landscape in complex tissues. To understand cellular heterogeneity, one essential step is to define cell types through unsupervised clustering. While typical clustering methods have difficulty in identifying rare cell types, approaches specifically tailored to detect rare cell types gain their ability at the cost of poorer performance for grouping abundant ones. Here, we developed aKNNO, a method to identify abundant and rare cell types simultaneously based on an adaptive k-nearest neighbor graph with optimization. Benchmarked on 38 simulated and 20 single-cell and spatial transcriptomics datasets, aKNNO identified both abundant and rare cell types accurately. Without sacrificing performance for clustering abundant cell types, aKNNO discovered known and novel rare cell types that those typical and even specifically tailored methods failed to detect. aKNNO, using transcriptome alone, stereotyped fine-grained anatomical structures more precisely than those integrative approaches combining expression with spatial locations and histology image.
Collapse
Affiliation(s)
- Jia Li
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Yu Shyr
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Qi Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| |
Collapse
|
8
|
Cembrowska-Lech D, Krzemińska A, Miller T, Nowakowska A, Adamski C, Radaczyńska M, Mikiciuk G, Mikiciuk M. An Integrated Multi-Omics and Artificial Intelligence Framework for Advance Plant Phenotyping in Horticulture. BIOLOGY 2023; 12:1298. [PMID: 37887008 PMCID: PMC10603917 DOI: 10.3390/biology12101298] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/27/2023] [Accepted: 09/28/2023] [Indexed: 10/28/2023]
Abstract
This review discusses the transformative potential of integrating multi-omics data and artificial intelligence (AI) in advancing horticultural research, specifically plant phenotyping. The traditional methods of plant phenotyping, while valuable, are limited in their ability to capture the complexity of plant biology. The advent of (meta-)genomics, (meta-)transcriptomics, proteomics, and metabolomics has provided an opportunity for a more comprehensive analysis. AI and machine learning (ML) techniques can effectively handle the complexity and volume of multi-omics data, providing meaningful interpretations and predictions. Reflecting the multidisciplinary nature of this area of research, in this review, readers will find a collection of state-of-the-art solutions that are key to the integration of multi-omics data and AI for phenotyping experiments in horticulture, including experimental design considerations with several technical and non-technical challenges, which are discussed along with potential solutions. The future prospects of this integration include precision horticulture, predictive breeding, improved disease and stress response management, sustainable crop management, and exploration of plant biodiversity. The integration of multi-omics and AI holds immense promise for revolutionizing horticultural research and applications, heralding a new era in plant phenotyping.
Collapse
Affiliation(s)
- Danuta Cembrowska-Lech
- Department of Physiology and Biochemistry, Institute of Biology, University of Szczecin, Felczaka 3c, 71-412 Szczecin, Poland;
- Polish Society of Bioinformatics and Data Science BIODATA, Popiełuszki 4c, 71-214 Szczecin, Poland; (A.K.); (T.M.)
| | - Adrianna Krzemińska
- Polish Society of Bioinformatics and Data Science BIODATA, Popiełuszki 4c, 71-214 Szczecin, Poland; (A.K.); (T.M.)
- Institute of Biology, University of Szczecin, Wąska 13, 71-415 Szczecin, Poland;
| | - Tymoteusz Miller
- Polish Society of Bioinformatics and Data Science BIODATA, Popiełuszki 4c, 71-214 Szczecin, Poland; (A.K.); (T.M.)
- Institute of Marine and Environmental Sciences, University of Szczecin, Wąska 13, 71-415 Szczecin, Poland
| | - Anna Nowakowska
- Department of Physiology and Biochemistry, Institute of Biology, University of Szczecin, Felczaka 3c, 71-412 Szczecin, Poland;
| | - Cezary Adamski
- Institute of Biology, University of Szczecin, Wąska 13, 71-415 Szczecin, Poland;
| | | | - Grzegorz Mikiciuk
- Department of Horticulture, Faculty of Environmental Management and Agriculture, West Pomeranian University of Technology in Szczecin, Słowackiego 17, 71-434 Szczecin, Poland;
| | - Małgorzata Mikiciuk
- Department of Bioengineering, Faculty of Environmental Management and Agriculture, West Pomeranian University of Technology in Szczecin, Słowackiego 17, 71-434 Szczecin, Poland;
| |
Collapse
|
9
|
Liu Z, Pan L, Chen G. Link-Information Augmented Twin Autoencoders for Network Denoising. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:5585-5595. [PMID: 35358055 DOI: 10.1109/tcyb.2022.3160470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Removing noisy links from an observed network is a task commonly required for preprocessing real-world network data. However, containing both noisy and clean links, the observed network cannot be treated as a trustworthy information source for supervised learning. Therefore, it is necessary but also technically challenging to detect noisy links in the context of data contamination. To address this issue, in the present article, a two-phased computational model is proposed, called link-information augmented twin autoencoders, which is able to deal with: 1) link information augmentation; 2) link-level contrastive denoising; 3) link information correction. Extensive experiments on six real-world networks verify that the proposed model outperforms other comparable methods in removing noisy links from the observed network so as to recover the real network from the corrupted one very accurately. Extended analyses also provide interpretable evidence to support the superiority of the proposed model for the task of network denoising.
Collapse
|
10
|
Yu Y, Liu B, Du S, Song J, Zhang K. Semi-supervised Multi-view Clustering Based on Non-negative Matrix Factorization and Low-Rank Tensor Representation. Neural Process Lett 2023. [DOI: 10.1007/s11063-023-11260-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]
|
11
|
Liu BY, Huang L, Wang CD, Lai JH, Yu PS. Multiview Clustering via Proximity Learning in Latent Representation Space. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:973-986. [PMID: 34432638 DOI: 10.1109/tnnls.2021.3104846] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Most existing multiview clustering methods are based on the original feature space. However, the feature redundancy and noise in the original feature space limit their clustering performance. Aiming at addressing this problem, some multiview clustering methods learn the latent data representation linearly, while performance may decline if the relation between the latent data representation and the original data is nonlinear. The other methods which nonlinearly learn the latent data representation usually conduct the latent representation learning and clustering separately, resulting in that the latent data representation might be not well adapted to clustering. Furthermore, none of them model the intercluster relation and intracluster correlation of data points, which limits the quality of the learned latent data representation and therefore influences the clustering performance. To solve these problems, this article proposes a novel multiview clustering method via proximity learning in latent representation space, named multiview latent proximity learning (MLPL). For one thing, MLPL learns the latent data representation in a nonlinear manner which takes the intercluster relation and intracluster correlation into consideration simultaneously. For another, through conducting the latent representation learning and consensus proximity learning simultaneously, MLPL learns a consensus proximity matrix with k connected components to output the clustering result directly. Extensive experiments are conducted on seven real-world datasets to demonstrate the effectiveness and superiority of the MLPL method compared with the state-of-the-art multiview clustering methods.
Collapse
|
12
|
Spacecraft anomaly detection with attention temporal convolution networks. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08213-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
13
|
Ros F, Riad R, Guillaume S. PDBI: a Partitioning Davies-Bouldin Index for Clustering Evaluation. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2023.01.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
14
|
Zhou S, Ou Q, Liu X, Wang S, Liu L, Wang S, Zhu E, Yin J, Xu X. Multiple Kernel Clustering With Compressed Subspace Alignment. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:252-263. [PMID: 34242173 DOI: 10.1109/tnnls.2021.3093426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Multiple kernel clustering (MKC) has recently achieved remarkable progress in fusing multisource information to boost the clustering performance. However, the O(n2) memory consumption and O(n3) computational complexity prohibit these methods from being applied into median- or large-scale applications, where n denotes the number of samples. To address these issues, we carefully redesign the formulation of subspace segmentation-based MKC, which reduces the memory and computational complexity to O(n) and O(n2) , respectively. The proposed algorithm adopts a novel sampling strategy to enhance the performance and accelerate the speed of MKC. Specifically, we first mathematically model the sampling process and then learn it simultaneously during the procedure of information fusion. By this way, the generated anchor point set can better serve data reconstruction across different views, leading to improved discriminative capability of the reconstruction matrix and boosted clustering performance. Although the integrated sampling process makes the proposed algorithm less efficient than the linear complexity algorithms, the elaborate formulation makes our algorithm straightforward for parallelization. Through the acceleration of GPU and multicore techniques, our algorithm achieves superior performance against the compared state-of-the-art methods on six datasets with comparable time cost to the linear complexity algorithms.
Collapse
|
15
|
Guo L, Zhang X, Zhang R, Wang Q, Xue X, Liu Z. Robust graph representation clustering based on adaptive data correction. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04268-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
16
|
Asymmetric and robust loss function driven least squares support vector machine. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
17
|
Wu D, Nie F, Dong X, Wang R, Li X. Parameter-Free Consensus Embedding Learning for Multiview Graph-Based Clustering. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:7944-7950. [PMID: 34185650 DOI: 10.1109/tnnls.2021.3087162] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Finding a consensus embedding from multiple views is the mainstream task in multiview graph-based clustering, in which the key problem is to handle the inconsistence among multiple views. In this article, we consider clustering effectiveness and practical applicability collectively, and propose a parameter-free model to alleviate the inconsistence of multiple views cleverly. To be specific, the proposed model considers the diversities of multiple views as two-layers. The first layer considers the inconsistence among different features of each view and the second layer considers linking the preembeddings of multiple views attentively. By this way, a consensus embedding can be learned via kernel method effectively and the whole learning procedure is parameter-free. To solve the optimization problem involved in the proposed model, we propose an alternative algorithm which is efficient and easy to implement in practice. In the experiments, we evaluate the proposed model on synthetic and real datasets and the experimental results demonstrate its effectiveness.
Collapse
|
18
|
Eybpoosh K, Rezghi M, Heydari A. A novel conformal deformation based sparse subspace clustering. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01712-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
19
|
Roshanfekr B, Amirmazlaghani M, Rahmati M. Learning graph from graph signals: An approach based on sensitivity analysis over a deep learning framework. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
20
|
Wang L, Geng X. The Real Eigenpairs of Symmetric Tensors and Its Application to Independent Component Analysis. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:10137-10150. [PMID: 33750718 DOI: 10.1109/tcyb.2021.3055238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
It has been proved that the determination of independent components (ICs) in the independent component analysis (ICA) can be attributed to calculating the eigenpairs of high-order statistical tensors of the data. However, previous works can only obtain approximate solutions, which may affect the accuracy of the ICs. In addition, the number of ICs would need to be set manually. Recently, an algorithm based on semidefinite programming (SDP) has been proposed, which utilizes the first-order gradient information of the Lagrangian function and can obtain all the accurate real eigenpairs. In this article, for the first time, we introduce this into the ICA field, which tends to further improve the accuracy of the ICs. Note that the number of eigenpairs of symmetric tensors is usually larger than the number of ICs, indicating that the results directly obtained by SDP are redundant. Thus, in practice, it is necessary to introduce second-order derivative information to identify local extremum solutions. Therefore, originating from the SDP method, we present a new modified version, called modified SDP (MSDP), which incorporates the concept of the projected Hessian matrix into SDP and, thus, can intellectually exclude redundant ICs and select true ICs. Some cases that have been tested in the experiments demonstrate its effectiveness. Experiments on the image/sound blind separation and real multi/hyperspectral image also show its superiority in improving the accuracy of ICs and automatically determining the number of ICs. In addition, the results on hyperspectral simulation and real data also demonstrate that MSDP is also capable of dealing with cases, where the number of features is less than the number of ICs.
Collapse
|
21
|
|
22
|
He J, Chen H, Li T, Wan J. Multi-view latent structure learning with rank recovery. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04141-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
23
|
JSMix: a holistic algorithm for learning with label noise. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07770-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
24
|
|
25
|
Kang Z, Lin Z, Zhu X, Xu W. Structured Graph Learning for Scalable Subspace Clustering: From Single View to Multiview. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:8976-8986. [PMID: 33729977 DOI: 10.1109/tcyb.2021.3061660] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Graph-based subspace clustering methods have exhibited promising performance. However, they still suffer some of these drawbacks: they encounter the expensive time overhead, they fail to explore the explicit clusters, and cannot generalize to unseen data points. In this work, we propose a scalable graph learning framework, seeking to address the above three challenges simultaneously. Specifically, it is based on the ideas of anchor points and bipartite graph. Rather than building an n×n graph, where n is the number of samples, we construct a bipartite graph to depict the relationship between samples and anchor points. Meanwhile, a connectivity constraint is employed to ensure that the connected components indicate clusters directly. We further establish the connection between our method and the K -means clustering. Moreover, a model to process multiview data is also proposed, which is linearly scaled with respect to n . Extensive experiments demonstrate the efficiency and effectiveness of our approach with respect to many state-of-the-art clustering methods.
Collapse
|
26
|
Chen Y, Xiao X, Hua Z, Zhou Y. Adaptive Transition Probability Matrix Learning for Multiview Spectral Clustering. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:4712-4726. [PMID: 33651701 DOI: 10.1109/tnnls.2021.3059874] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Multiview clustering as an important unsupervised method has been gathering a great deal of attention. However, most multiview clustering methods exploit the self-representation property to capture the relationship among data, resulting in high computation cost in calculating the self-representation coefficients. In addition, they usually employ different regularizers to learn the representation tensor or matrix from which a transition probability matrix is constructed in a separate step, such as the one proposed by Wu et al.. Thus, an optimal transition probability matrix cannot be guaranteed. To solve these issues, we propose a unified model for multiview spectral clustering by directly learning an adaptive transition probability matrix (MCA2M), rather than an individual representation matrix of each view. Different from the one proposed by Wu et al., MCA2M utilizes the one-step strategy to directly learn the transition probability matrix under the robust principal component analysis framework. Unlike existing methods using the absolute symmetrization operation to guarantee the nonnegativity and symmetry of the affinity matrix, the transition probability matrix learned from MCA2M is nonnegative and symmetric without any postprocessing. An alternating optimization algorithm is designed based on the efficient alternating direction method of multipliers. Extensive experiments on several real-world databases demonstrate that the proposed method outperforms the state-of-the-art methods.
Collapse
|
27
|
Li J, Tao Z, Wu Y, Zhong B, Fu Y. Large-Scale Subspace Clustering by Independent Distributed and Parallel Coding. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:9090-9100. [PMID: 33635812 DOI: 10.1109/tcyb.2021.3052056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Subspace clustering is a popular method to discover underlying low-dimensional structures of high-dimensional multimedia data (e.g., images, videos, and texts). In this article, we consider a large-scale subspace clustering (LS2C) problem, that is, partitioning million data points with a millon dimensions. To address this, we explore an independent distributed and parallel framework by dividing big data/variable matrices and regularization by both columns and rows. Specifically, LS2C is independently decomposed into many subproblems by distributing those matrices into different machines by columns since the regularization of the code matrix is equal to a sum of that of its submatrices (e.g., square-of-Frobenius/ l1 -norm). Consensus optimization is designed to solve these subproblems in a parallel way for saving communication costs. Moreover, we provide theoretical guarantees that LS2C can recover consensus subspace representations of high-dimensional data points under broad conditions. Compared with the state-of-the-art LS2C methods, our approach achieves better clustering results in public datasets, including a million images and videos.
Collapse
|
28
|
Fuzzy Support Vector Machine with Graph for Classifying Imbalanced Datasets. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.09.139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
29
|
Lai J, Chen H, Li W, Li T, Wan J. Semi-supervised feature selection via adaptive structure learning and constrained graph learning. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
30
|
Self-Adaptive Clustering of Dynamic Multi-Graph Learning. Neural Process Lett 2022. [DOI: 10.1007/s11063-020-10405-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
31
|
Shi M, Tang Y, Zhu X, Zhuang Y, Lin M, Liu J. Feature-Attention Graph Convolutional Networks for Noise Resilient Learning. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:7719-7731. [PMID: 35104237 DOI: 10.1109/tcyb.2022.3143798] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Noise and inconsistency commonly exist in real-world information networks, due to the inherent error-prone nature of human or user privacy concerns. To date, tremendous efforts have been made to advance feature learning from networks, including the most recent graph convolutional networks (GCNs) or attention GCN, by integrating node content and topology structures. However, all existing methods consider networks as error-free sources and treat feature content in each node as independent and equally important to model node relations. Noisy node content, combined with sparse features, provides essential challenges for existing methods to be used in real-world noisy networks. In this article, we propose feature-based attention GCN (FA-GCN), a feature-attention graph convolution learning framework, to handle networks with noisy and sparse node content. To tackle noise and sparse content in each node, FA-GCN first employs a long short-term memory (LSTM) network to learn dense representation for each node feature. To model interactions between neighboring nodes, a feature-attention mechanism is introduced to allow neighboring nodes to learn and vary feature importance, with respect to their connections. By using a spectral-based graph convolution aggregation process, each node is allowed to concentrate more on the most determining neighborhood features aligned with the corresponding learning task. Experiments and validations, w.r.t. different noise levels, demonstrate that FA-GCN achieves better performance than the state-of-the-art methods in both noise-free and noisy network environments.
Collapse
|
32
|
Semi-supervised Learning with Graph Convolutional Networks Based on Hypergraph. Neural Process Lett 2022. [DOI: 10.1007/s11063-021-10487-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
33
|
Wang Y, Gao C, Zhou J. Geometrical structure preservation joint with self-expression maintenance for adaptive graph learning. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.06.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
34
|
Research for an Adaptive Classifier Based on Dynamic Graph Learning. Neural Process Lett 2022. [DOI: 10.1007/s11063-021-10452-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
35
|
Guo W, Wang Z, Du W. Pseudolabel‐guided multiview consensus graph learning for semisupervised classification. INT J INTELL SYST 2022. [DOI: 10.1002/int.22958] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Wei Guo
- Key Laboratory of Smart Manufacturing in Energy Chemical Process Ministry of Education, East China University of Science and Technology Shanghai China
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai China
| | - Zhe Wang
- Key Laboratory of Smart Manufacturing in Energy Chemical Process Ministry of Education, East China University of Science and Technology Shanghai China
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai China
| | - Wenli Du
- Key Laboratory of Smart Manufacturing in Energy Chemical Process Ministry of Education, East China University of Science and Technology Shanghai China
| |
Collapse
|
36
|
Huang D, Hu J, Li T, Du S, Chen H. Consistency regularization for deep semi-supervised clustering with pairwise constraints. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01599-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
37
|
Yang B, Wu J, Sun A, Gao N, Zhang X. Robust landmark graph-based clustering for high-dimensional data. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.05.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
38
|
Cai Y, Huang JZ, Yin J. A new method to build the adaptive k-nearest neighbors similarity graph matrix for spectral clustering. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
39
|
A Clustering Ensemble Framework with Integration of Data Characteristics and Structure Information: A Graph Neural Networks Approach. MATHEMATICS 2022. [DOI: 10.3390/math10111834] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Clustering ensemble is a research hotspot of data mining that aggregates several base clustering results to generate a single output clustering with improved robustness and stability. However, the validity of the ensemble result is usually affected by unreliability in the generation and integration of base clusterings. In order to address this issue, we develop a clustering ensemble framework viewed from graph neural networks that generates an ensemble result by integrating data characteristics and structure information. In this framework, we extract structure information from base clustering results of the data set by using a coupling affinity measure After that, we combine structure information with data characteristics by using a graph neural network (GNN) to learn their joint embeddings in latent space. Then, we employ a Gaussian mixture model (GMM) to predict the final cluster assignment in the latent space. Finally, we construct the GNN and GMM as a unified optimization model to integrate the objectives of graph embedding and consensus clustering. Our framework can not only elegantly combine information in feature space and structure space, but can also achieve suitable representations for final cluster partitioning. Thus, it can produce an outstanding result. Experimental results on six synthetic benchmark data sets and six real world data sets show that the proposed framework yields a better performance compared to 12 reference algorithms that are developed based on either clustering ensemble architecture or a deep clustering strategy.
Collapse
|
40
|
Liu Z, Jin W, Mu Y. Learning robust graph for clustering. INT J INTELL SYST 2022. [DOI: 10.1002/int.22901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Zheng Liu
- College of Control Science and Engineering, Research Center for Analytical Instrumentation, Institute of Cyber‐Systems and Control, State Key Laboratory of Industrial Control Technology Zhejiang University Hangzhou China
| | - Wei Jin
- College of Control Science and Engineering, Research Center for Analytical Instrumentation, Institute of Cyber‐Systems and Control, State Key Laboratory of Industrial Control Technology Zhejiang University Hangzhou China
- College of Control Science and Engineering Huzhou Institute of Zhejiang University Huzhou China
| | - Ying Mu
- College of Control Science and Engineering, Research Center for Analytical Instrumentation, Institute of Cyber‐Systems and Control, State Key Laboratory of Industrial Control Technology Zhejiang University Hangzhou China
| |
Collapse
|
41
|
|
42
|
Dai X, Zhang K, Li J, Xiong J, Zhang N, Li H. Robust semi-supervised non-negative matrix factorization for binary subspace learning. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-021-00285-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
AbstractNon-negative matrix factorization and its extensions were applied to various areas (i.e., dimensionality reduction, clustering, etc.). When the original data are corrupted by outliers and noise, most of non-negative matrix factorization methods cannot achieve robust factorization and learn a subspace with binary codes. This paper puts forward a robust semi-supervised non-negative matrix factorization method for binary subspace learning, called RSNMF, for image clustering. For better clustering performance on the dataset contaminated by outliers and noise, we propose a weighted constraint on the noise matrix and impose manifold learning into non-negative matrix factorization. Moreover, we utilize the discrete hashing learning method to constrain the learned subspace, which can achieve a binary subspace from the original data. Experimental results validate the robustness and effectiveness of RSNMF in binary subspace learning and image clustering on the face dataset corrupted by Salt and Pepper noise and Contiguous Occlusion.
Collapse
|
43
|
Farswan A, Gupta R, Gupta A. ARCANE-ROG: Algorithm for Reconstruction of Cancer Evolution from single-cell data using Robust Graph Learning. J Biomed Inform 2022; 129:104055. [PMID: 35337943 DOI: 10.1016/j.jbi.2022.104055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 02/17/2022] [Accepted: 03/12/2022] [Indexed: 11/27/2022]
Abstract
Tumor heterogeneity, marked by the presence of divergent clonal subpopulations of tumor cells, impedes the treatment response in cancer patients. Single-cell sequencing technology provides substantial prospects to gain an in-depth understanding of the cellular phenotypic variability driving tumor progression. A comprehensive insight into the intra-tumor heterogeneity may further assist in dealing with the treatment-resistant clones in cancer patients, thereby improving their overall survival. However, this task is hampered due to the challenges associated with the single-cell data, such as false positives, false negatives and missing bases, and the increase in their size. As a result, the computational cost of the existing methods increases, thereby limiting their usage. In this work, we propose a robust graph learning-based method, ARCANE-ROG (Algorithm for Reconstruction of CANcer Evolution via RObust Graph learning), for inferring clonal evolution from single-cell datasets. The first step of the proposed method is a joint framework of denoising with data imputation for the noisy and incomplete matrix while simultaneously learning an adjacency graph. Both the operations in the joint framework boost each other such that the overall performance of the denoising algorithm is improved. In the second step, an optimal number of clusters are identified via the Leiden method. In the last step, clonal evolution trees are inferred via a minimum spanning tree algorithm. The method has been benchmarked against a state-of-the-art method, RobustClone, using simulated datasets of varying sizes and five real datasets. The performance of our proposed method is found to be significantly superior (p-value < 0.05) in terms of reconstruction error, False Positive to False Negative (FPFN) ratio, tree distance error and V-measure compared to the other method. Overall, the proposed method is an improvement over the existing methods as it enhances cluster assignment and inference on clonal hierarchies.
Collapse
Affiliation(s)
- Akanksha Farswan
- SBILab, Department of ECE, Indraprastha Institute of Information Technology, New Delhi, India
| | - Ritu Gupta
- Laboratory Oncology Unit, Dr. B.R.A. IRCH, AIIMS, New Delhi, India.
| | - Anubha Gupta
- SBILab, Department of ECE, Indraprastha Institute of Information Technology, New Delhi, India.
| |
Collapse
|
44
|
Chao G, Sun S, Bi J. A Survey on Multi-View Clustering. IEEE TRANSACTIONS ON ARTIFICIAL INTELLIGENCE 2022; 2:146-168. [PMID: 35308425 DOI: 10.1109/tai.2021.3065894] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Clustering is a machine learning paradigm of dividing sample subjects into a number of groups such that subjects in the same groups are more similar to those in other groups. With advances in information acquisition technologies, samples can frequently be viewed from different angles or in different modalities, generating multi-view data. Multi-view clustering, that clusters subjects into subgroups using multi-view data, has attracted more and more attentions. Although MVC methods have been developed rapidly, there has not been enough survey to summarize and analyze the current progress. Therefore, we propose a novel taxonomy of the MVC approaches. Similar to other machine learning methods, we categorize them into generative and discriminative classes. In discriminative class, based on the way of view integration, we split it further into five groups: Common Eigenvector Matrix, Common Coefficient Matrix, Common Indicator Matrix, Direct Combination and Combination After Projection. Furthermore, we relate MVC to other topics: multi-view representation, ensemble clustering, multi-task clustering, multi-view supervised and semi-supervised learning. Several representative real-world applications are elaborated for practitioners. Some benchmark multi-view datasets are introduced and representative MVC algorithms from each group are empirically evaluated to analyze how they perform on benchmark datasets. To promote future development of MVC approaches, we point out several open problems that may require further investigation and thorough examination.
Collapse
Affiliation(s)
- Guoqing Chao
- School of Computer Science and Technology, Harbin Institute of Technology, Weihai 264209, PR China
| | - Shiliang Sun
- School of Computer Science and Technology, East China Normal University, Shanghai, Shanghai 200062 China
| | - Jinbo Bi
- Department of Computer Science, University of Connecticut, Storrs, CT 06269 USA
| |
Collapse
|
45
|
Guo W, Wang Z, Ma M, Chen L, Yang H, Li D, Du W. Semi‐supervised multiple empirical kernel learning with pseudo empirical loss and similarity regularization. INT J INTELL SYST 2022. [DOI: 10.1002/int.22690] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Wei Guo
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education East China University of Science and Technology Shanghai People's Republic of China
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai People's Republic of China
| | - Zhe Wang
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education East China University of Science and Technology Shanghai People's Republic of China
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai People's Republic of China
| | - Menghao Ma
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education East China University of Science and Technology Shanghai People's Republic of China
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai People's Republic of China
| | - Lilong Chen
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai People's Republic of China
| | - Hai Yang
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai People's Republic of China
| | - Dongdong Li
- Department of Computer Science and Engineering East China University of Science and Technology Shanghai People's Republic of China
| | - Wenli Du
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education East China University of Science and Technology Shanghai People's Republic of China
| |
Collapse
|
46
|
Mi Y, Ren Z, Xu Z, Li H, Sun Q, Chen H, Dai J. Multi-view clustering with dual tensors. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-06927-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
47
|
Dai Y, Shou L, Gong M, Xia X, Kang Z, Xu Z, Jiang D. Graph Fusion Network for Text Classification. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107659] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
48
|
Li X, Zhang H, Wang R, Nie F. Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:330-344. [PMID: 32750830 DOI: 10.1109/tpami.2020.3011148] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Multiview clustering partitions data into different groups according to their heterogeneous features. Most existing methods degenerate the applicability of models due to their intractable hyper-parameters triggered by various regularization terms. Moreover, traditional spectral based methods always encounter the expensive time overheads and fail in exploring the explicit clusters from graphs. In this paper, we present a scalable and parameter-free graph fusion framework for multiview clustering, seeking for a joint graph compatible across multiple views in a self-supervised weighting manner. Our formulation coalesces multiple view-wise graphs straightforward and learns the weights as well as the joint graph interactively, which could actively release the model from any weight-related hyper-parameters. Meanwhile, we manipulate the joint graph by a connectivity constraint such that the connected components indicate clusters directly. The designed algorithm is initialization-independent and time-economical which obtains the stable performance and scales well with the data size. Substantial experiments on toy data as well as real datasets are conducted that verify the superiority of the proposed method compared to the state-of-the-arts over the clustering performance and time expenditure.
Collapse
|
49
|
Self-Supervised Convolutional Subspace Clustering Network with the Block Diagonal Regularizer. Neural Process Lett 2021. [DOI: 10.1007/s11063-021-10563-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
50
|
|