1
|
Zhang H, Zhou R, Zhao S, Jing L, Chen Y. TCH: A novel multi-view dimensionality reduction method based on triple contrastive heads. Neural Netw 2025; 188:107459. [PMID: 40249996 DOI: 10.1016/j.neunet.2025.107459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Revised: 08/01/2024] [Accepted: 04/01/2025] [Indexed: 04/20/2025]
Abstract
Multi-view dimensionality reduction (MvDR) is a potent approach for addressing the high-dimensional challenges in multi-view data. Recently, contrastive learning (CL) has gained considerable attention due to its superior performance. However, most CL-based methods focus on promoting consistency between any two cross views from the perspective of subspace samples, which extract features containing redundant information and fail to capture view-specific discriminative information. In this study, we propose feature- and recovery-level contrastive losses to eliminate redundant information and capture view-specific discriminative information, respectively. Based on this, we construct a novel MvDR method based on triple contrastive heads (TCH). This method combines sample-, feature-, and recovery-level contrastive losses to extract sufficient yet minimal subspace discriminative information in accordance with the information bottleneck principle. Furthermore, the relationship between TCH and mutual information is revealed, which provides the theoretical support for the outstanding performance of our method. Our experiments on five real-world datasets show that the proposed method outperforms existing methods.
Collapse
Affiliation(s)
- Hongjie Zhang
- School of Mathematical Sciences, Tiangong University, Tianjin 300387, PR China
| | - Ruojin Zhou
- College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, PR China; Key Laboratory of Smart Farming Technologies for Aquatic Animal and Livestock, Ministry of Agriculture and Rural Affairs, Beijing 100083, PR China
| | - Siyu Zhao
- College of Science, China Agricultural University, Beijing 100083, PR China
| | - Ling Jing
- College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, PR China; College of Science, China Agricultural University, Beijing 100083, PR China.
| | - Yingyi Chen
- College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, PR China; Key Laboratory of Smart Farming Technologies for Aquatic Animal and Livestock, Ministry of Agriculture and Rural Affairs, Beijing 100083, PR China; Beijing Engineering and Technology Research Center for Internet of Things in Agriculture, Beijing 100083, PR China; National Innovation Center for Digital Fishery, China Agricultural University, Beijing 100083, PR China.
| |
Collapse
|
2
|
Chen Y, Wu W, Ou-Yang L, Wang R, Kwong S. GRESS: Grouping Belief-Based Deep Contrastive Subspace Clustering. IEEE TRANSACTIONS ON CYBERNETICS 2025; 55:148-160. [PMID: 39437281 DOI: 10.1109/tcyb.2024.3475034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2024]
Abstract
The self-expressive coefficient plays a crucial role in the self-expressiveness-based subspace clustering method. To enhance the precision of the self-expressive coefficient, we propose a novel deep subspace clustering method, named grouping belief-based deep contrastive subspace clustering (GRESS), which integrates the clustering information and higher-order relationship into the coefficient matrix. Specifically, we develop a deep contrastive subspace clustering module to enhance the learning of both self-expressive coefficients and cluster representations simultaneously. This approach enables the derivation of relatively noiseless self-expressive similarities and cluster-based similarities. To enable interaction between these two types of similarities, we propose a unique grouping belief-based affinity refinement module. This module leverages grouping belief to uncover the higher-order relationships within the similarity matrix, and integrates the well-designed noisy similarity suppression and similarity increment regularization to eliminate redundant connections while complete absent information. Extensive experimental results on four benchmark datasets validate the superiority of our proposed method GRESS over several state-of-the-art methods.
Collapse
|
3
|
Qin Y, Pu N, Sebe N, Feng G. Latent Space Learning-Based Ensemble Clustering. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2025; 34:1259-1270. [PMID: 40031529 DOI: 10.1109/tip.2025.3540297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Ensemble clustering fuses a set of base clusterings and shows promising capability in achieving more robust and better clustering results. The existing methods usually realize ensemble clustering by adopting a co-association matrix to measure how many times two data points are categorized into the same cluster based on the base clusterings. Though great progress has been achieved, the obtained co-association matrix is constructed based on the combination of different connective matrices or its variants. These methods ignore exploring the inherent latent space shared by multiple connective matrices and learning the corresponding co-association matrices according to this latent space. Moreover, these methods neglect to learn discriminative connective matrices, explore the high-order relation among these connective matrices and consider the latent space in a unified framework. In this paper, we propose a Latent spacE leArning baseD Ensemble Clustering (LEADEC), which introduces the latent space shared by different connective matrices and learns the corresponding connective matrices according to this latent space. Specifically, we factorize the original multiple connective matrices into a consensus latent space representation and the specific connective matrices. Meanwhile, the orthogonal constraint is imposed to make the latent space representation more discriminative. In addition, we collect the obtained connective matrices based on the latent space into a tensor with three orders to investigate the high-order relations among these connective matrices. The connective matrices learning, the high-order relation investigation among connective matrices and the latent space representation learning are integrated into a unified framework. Experiments on seven benchmark datasets confirm the superiority of LEADEC compared with the existing representive methods.
Collapse
|
4
|
Zhao S, Fei L, Wen J, Zhang B, Zhao P, Li S. Structure Suture Learning-Based Robust Multiview Palmprint Recognition. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:8401-8413. [PMID: 37015591 DOI: 10.1109/tnnls.2022.3227473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Low-quality palmprint images will degrade the recognition performance, when they are captured under the open, unconstraint, and low-illumination conditions. Moreover, the traditional single-view palmprint representation methods have been difficult to express the characteristics of each palm strongly, where the palmprint characteristics become weak. To tackle these issues, in this article, we propose a structure suture learning-based robust multiview palmprint recognition method (SSL_RMPR), which comprehensively presents the salient palmprint features from multiple views. Unlike the existing multiview palmprint representation methods, SSL_RMPR introduces a structure suture learning strategy to produce an elastic nearest neighbor graph (ENNG) on the reconstruction errors that simultaneously exploit the label information and the latent consensus structure of the multiview data, such that the discriminant palmprint representation can be adaptively enhanced. Meanwhile, a low-rank reconstruction term integrating with the projection matrix learning is proposed, in such a manner that the robustness of the projection matrix can be improved. Particularly, since no extra structure capture term is imposed into the proposed model, the complexity of the model can be greatly reduced. Experimental results have proven the superiority of the proposed SSL_RMPR by achieving the best recognition performances on a number of real-world palmprint databases.
Collapse
|
5
|
Sun Y, Wang X, Peng D, Ren Z, Shen X. Hierarchical Hashing Learning for Image Set Classification. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023; 32:1732-1744. [PMID: 37028051 DOI: 10.1109/tip.2023.3251025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
With the development of video network, image set classification (ISC) has received a lot of attention and can be used for various practical applications, such as video based recognition, action recognition, and so on. Although the existing ISC methods have obtained promising performance, they often have extreme high complexity. Due to the superiority in storage space and complexity cost, learning to hash becomes a powerful solution scheme. However, existing hashing methods often ignore complex structural information and hierarchical semantics of the original features. They usually adopt a single-layer hashing strategy to transform high-dimensional data into short-length binary codes in one step. This sudden drop of dimension could result in the loss of advantageous discriminative information. In addition, they do not take full advantage of intrinsic semantic knowledge from whole gallery sets. To tackle these problems, in this paper, we propose a novel Hierarchical Hashing Learning (HHL) for ISC. Specifically, a coarse-to-fine hierarchical hashing scheme is proposed that utilizes a two-layer hash function to gradually refine the beneficial discriminative information in a layer-wise fashion. Besides, to alleviate the effects of redundant and corrupted features, we impose the $\ell _{2,1}$ norm on the layer-wise hash function. Moreover, we adopt a bidirectional semantic representation with the orthogonal constraint to keep intrinsic semantic information of all samples in whole image sets adequately. Comprehensive experiments demonstrate HHL acquires significant improvements in accuracy and running time. We will release the demo code on https://github.com/sunyuan-cs.
Collapse
|
6
|
Li J, Zhang X, Wang J, Wang X, Tan Z, Sun H. Projection-based coupled tensor learning for robust multi-view clustering. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.03.072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2023]
|
7
|
Ruan W, Sun L. Robust latent discriminant adaptive graph preserving learning for image feature extraction. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023]
|
8
|
Dai J, Song H, Luo Y, Ren Z, Yang J. Robust multi-view low-rank embedding clustering. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-08137-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
9
|
Robust dimensionality reduction method based on relaxed energy and structure preserving embedding for multiview clustering. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.11.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
10
|
Multiple kernel-based anchor graph coupled low-rank tensor learning for incomplete multi-view clustering. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03735-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
AbstractIncomplete Multi-View Clustering (IMVC) attempts to give an optimal clustering solution for incomplete multi-view data that suffer from missing instances in certain views. However, most existing IMVC methods still have various drawbacks in practical applications, such as arbitrary incomplete scenarios cannot be handled; the computational cost is relatively high; most valuable nonlinear relations among samples are often ignored; complementary information among views is not sufficiently exploited. To address the above issues, in this paper, we present a novel and flexible unified graph learning framework, called Multiple Kernel-based Anchor Graph coupled low-rank Tensor learning for Incomplete Multi-View Clustering (MKAGT_IMVC), whose goal is to adaptively learn the optimal unified similarity matrix from all incomplete views. Specifically, according to the characteristics of incomplete multi-view data, MKAGT_IMVC innovatively improves an anchor selection strategy. Then, a novel cross-view anchor graph fusion mechanism is introduced to construct multiple fused complete anchor graphs, which captures more the intra-view and inter-view nonlinear relations. Moreover, a graph learning model combining low-rank tensor constraint and consensus graph constraint is developed, where all fused complete anchor graphs are regarded as prior knowledge to initialize this model. Extensive experiments conducted on eight incomplete multi-view datasets clearly show that our method delivers superior performance relative to some state-of-the-art methods in terms of clustering ability and time-consuming.
Collapse
|
11
|
Clustering via multiple kernel k-means coupled graph and enhanced tensor learning. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03679-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
12
|
Mi Y, Ren Z, Xu Z, Li H, Sun Q, Chen H, Dai J. Multi-view clustering with dual tensors. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-06927-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
13
|
Abstract
Rice sheath blight is one of the main diseases in rice production. The traditional detection method, which needs manual recognition, is usually inefficient and slow. In this study, a recognition method for identifying rice sheath blight based on a backpropagation (BP) neural network is posed. Firstly, the sample image is smoothed by median filtering and histogram equalization, and the edge of the lesion is segmented using a Sobel operator, which largely reduces the background information and significantly improves the image quality. Then, the corresponding feature parameters of the image are extracted based on color and texture features. Finally, a BP neural network is built for training and testing with excellent tunability and easy optimization. The results demonstrate that when the number of hidden layer nodes is set to 90, the recognition accuracy of the BP neural network can reach up to 85.8%. Based on the color and texture features of the rice sheath blight image, the recognition algorithm constructed with a BP neural network has high accuracy and can effectively make up for the deficiency of manual recognition.
Collapse
|
14
|
Lin Z, Gao W, Jia J, Huang F. CapsNet meets ORB: A deformation‐tolerant baseline for recognizing distorted targets. INT J INTELL SYST 2021. [DOI: 10.1002/int.22677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Zhongqi Lin
- College of Information and Electrical Engineering China Agricultural University Beijing China
- Key Laboratory of Agricultural Informatization Standardization Ministry of Agriculture and Rural Affairs Beijing China
| | - Wanlin Gao
- College of Information and Electrical Engineering China Agricultural University Beijing China
- Key Laboratory of Agricultural Informatization Standardization Ministry of Agriculture and Rural Affairs Beijing China
| | - Jingdun Jia
- Key Laboratory of Agricultural Informatization Standardization Ministry of Agriculture and Rural Affairs Beijing China
| | - Feng Huang
- College of Science, China Agricultural University Beijing China
| |
Collapse
|
15
|
Ren Z, Yang SX, Sun Q, Wang T. Consensus Affinity Graph Learning for Multiple Kernel Clustering. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:3273-3284. [PMID: 32584777 DOI: 10.1109/tcyb.2020.3000947] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Significant attention to multiple kernel graph-based clustering (MKGC) has emerged in recent years, primarily due to the superiority of multiple kernel learning (MKL) and the outstanding performance of graph-based clustering. However, many existing MKGC methods design a fat model that poses challenges for computational cost and clustering performance, as they learn both an affinity graph and an extra consensus kernel cumbersomely. To tackle this challenging problem, this article proposes a new MKGC method to learn a consensus affinity graph directly. By using the self-expressiveness graph learning and an adaptive local structure learning term, the local manifold structure of the data in kernel space is preserved for learning multiple candidate affinity graphs from a kernel pool first. After that, these candidate affinity graphs are synthesized to learn a consensus affinity graph via a thin autoweighted fusion model, in which a self-tuned Laplacian rank constraint and a top- k neighbors sparse strategy are introduced to improve the quality of the consensus affinity graph for accurate clustering purposes. The experimental results on ten benchmark datasets and two synthetic datasets show that the proposed method consistently and significantly outperforms the state-of-the-art methods.
Collapse
|
16
|
|
17
|
Lv J, Kang Z, Lu X, Xu Z. Pseudo-Supervised Deep Subspace Clustering. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:5252-5263. [PMID: 34033539 DOI: 10.1109/tip.2021.3079800] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Auto-Encoder (AE)-based deep subspace clustering (DSC) methods have achieved impressive performance due to the powerful representation extracted using deep neural networks while prioritizing categorical separability. However, self-reconstruction loss of an AE ignores rich useful relation information and might lead to indiscriminative representation, which inevitably degrades the clustering performance. It is also challenging to learn high-level similarity without feeding semantic labels. Another unsolved problem facing DSC is the huge memory cost due to n×n similarity matrix, which is incurred by the self-expression layer between an encoder and decoder. To tackle these problems, we use pairwise similarity to weigh the reconstruction loss to capture local structure information, while a similarity is learned by the self-expression layer. Pseudo-graphs and pseudo-labels, which allow benefiting from uncertain knowledge acquired during network training, are further employed to supervise similarity learning. Joint learning and iterative training facilitate to obtain an overall optimal solution. Extensive experiments on benchmark datasets demonstrate the superiority of our approach. By combining with the k -nearest neighbors algorithm, we further show that our method can address the large-scale and out-of-sample problems. The source code of our method is available: https://github.com/sckangz/SelfsupervisedSC.
Collapse
|
18
|
Mi Y, Ren Z, Mukherjee M, Huang Y, Sun Q, Chen L. Diversity and consistency embedding learning for multi-view subspace clustering. APPL INTELL 2021. [DOI: 10.1007/s10489-020-02126-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
19
|
Xiao X, Chen Y, Gong YJ, Zhou Y. Low-Rank Preserving t-Linear Projection for Robust Image Feature Extraction. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; 30:108-120. [PMID: 33090953 DOI: 10.1109/tip.2020.3031813] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
As the cornerstone for joint dimension reduction and feature extraction, extensive linear projection algorithms were proposed to fit various requirements. When being applied to image data, however, existing methods suffer from representation deficiency since the multi-way structure of the data is (partially) neglected. To solve this problem, we propose a novel Low-Rank Preserving t-Linear Projection (LRP-tP) model that preserves the intrinsic structure of the image data using t-product-based operations. The proposed model advances in four aspects: 1) LRP-tP learns the t-linear projection directly from the tensorial dataset so as to exploit the correlation among the multi-way data structure simultaneously; 2) to cope with the widely spread data errors, e.g., noise and corruptions, the robustness of LRP-tP is enhanced via self-representation learning; 3) LRP-tP is endowed with good discriminative ability by integrating the empirical classification error into the learning procedure; 4) an adaptive graph considering the similarity and locality of the data is jointly learned to precisely portray the data affinity. We devise an efficient algorithm to solve the proposed LRP-tP model using the alternating direction method of multipliers. Extensive experiments on image feature extraction have demonstrated the superiority of LRP-tP compared to the state-of-the-arts.
Collapse
|