1
|
Cai R, Zhu Y, Chen X, Fang Y, Wu M, Qiao J, Hao Z. On the probability of necessity and sufficiency of explaining Graph Neural Networks: A lower bound optimization approach. Neural Netw 2025; 184:107065. [PMID: 39733700 DOI: 10.1016/j.neunet.2024.107065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 12/06/2024] [Accepted: 12/15/2024] [Indexed: 12/31/2024]
Abstract
The explainability of Graph Neural Networks (GNNs) is critical to various GNN applications, yet it remains a significant challenge. A convincing explanation should be both necessary and sufficient simultaneously. However, existing GNN explaining approaches focus on only one of the two aspects, necessity or sufficiency, or a heuristic trade-off between the two. Theoretically, the Probability of Necessity and Sufficiency (PNS) holds the potential to identify the most necessary and sufficient explanation since it can mathematically quantify the necessity and sufficiency of an explanation. Nevertheless, the difficulty of obtaining PNS due to non-monotonicity and the challenge of counterfactual estimation limit its wide use. To address the non-identifiability of PNS, we resort to a lower bound of PNS that can be optimized via counterfactual estimation, and propose a framework of Necessary and Sufficient Explanation for GNN (NSEG) via optimizing that lower bound. Specifically, we depict the GNN as a structural causal model (SCM), and estimate the probability of counterfactual via the intervention under the SCM. Additionally, we leverage continuous masks with a sampling strategy to optimize the lower bound to enhance the scalability. Empirical results demonstrate that NSEG outperforms state-of-the-art methods, consistently generating the most necessary and sufficient explanations. The implementation of our NSEG is available at https://github.com/EthanChu7/NSEG.
Collapse
Affiliation(s)
- Ruichu Cai
- School of Computer Science, Guangdong University of Technology, Guangzhou 510006, China; Peng Cheng Laboratory, Shenzhen 518066, China.
| | - Yuxuan Zhu
- School of Computer Science, Guangdong University of Technology, Guangzhou 510006, China.
| | - Xuexin Chen
- School of Computer Science, Guangdong University of Technology, Guangzhou 510006, China.
| | - Yuan Fang
- School of Computing and Information Systems, Singapore Management University, 178902, Singapore.
| | - Min Wu
- Institute for Infocomm Research (I(2) R), A*STAR, 138632, Singapore.
| | - Jie Qiao
- School of Computer Science, Guangdong University of Technology, Guangzhou 510006, China; Peng Cheng Laboratory, Shenzhen 518066, China.
| | - Zhifeng Hao
- College of Science, Shantou University, Shantou 515063, China.
| |
Collapse
|
2
|
Cao Y, Xu X, Cheng Y, Sun C, Du Z, Gao L, Shen W. Personalizing Vision-Language Models With Hybrid Prompts for Zero-Shot Anomaly Detection. IEEE TRANSACTIONS ON CYBERNETICS 2025; 55:1917-1929. [PMID: 40031813 DOI: 10.1109/tcyb.2025.3536165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Zero-shot anomaly detection (ZSAD) aims to develop a foundational model capable of detecting anomalies across arbitrary categories without relying on reference images. However, since "abnormality" is inherently defined in relation to "normality" within specific categories, detecting anomalies without reference images describing the corresponding normal context remains a significant challenge. As an alternative to reference images, this study explores the use of widely available product standards to characterize normal contexts and potential abnormal states. Specifically, this study introduces AnomalyVLM, which leverages generalized pretrained vision-language models (VLMs) to interpret these standards and detect anomalies. Given the current limitations of VLMs in comprehending complex textual information, AnomalyVLM generates hybrid prompts-comprising prompts for abnormal regions, symbolic rules, and region numbers-from the standards to facilitate more effective understanding. These hybrid prompts are incorporated into various stages of the anomaly detection process within the selected VLMs, including an anomaly region generator and an anomaly region refiner. By utilizing hybrid prompts, VLMs are personalized as anomaly detectors for specific categories, offering users flexibility and control in detecting anomalies across novel categories without the need for training data. Experimental results on four public industrial anomaly detection datasets, as well as a practical automotive part inspection task, highlight the superior performance and enhanced generalization capability of AnomalyVLM, especially in texture categories. An online demo of AnomalyVLM is available at https://github.com/caoyunkang/Segment-Any-Anomaly.
Collapse
|
3
|
Chen J, Wang Z. One-Shot Any-Scene Crowd Counting With Local-to-Global Guidance. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024; 33:6622-6632. [PMID: 38963733 DOI: 10.1109/tip.2024.3420713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/06/2024]
Abstract
Due to different installation angles, heights, and positions of the camera installation in real-world scenes, it is difficult for crowd counting models to work in unseen surveillance scenes. In this paper, we are interested in accurate crowd counting based on the data collected by any surveillance camera, that is to count the crowd from any scene given only one annotated image from that scene. To this end, we firstly pose crowd counting as a one-shot learning task. Through the metric-learning, we propose a simple yet effective method that firstly estimates crowd characteristics and then transfers them to guide the model to count the crowd. Specifically, to fully capture these crowd characteristics of the target scene, we devise the Multi-Prototype Learner to learn the prototypes of foreground and density from the limited support image using the Expectation-Maximization algorithm. To learn the adaptation capability for any unseen scene, estimated multi prototypes are proposed to guide the crowd counting of query images in a local-to-global way. CNN is utilized to activate the local features. And transformer is introduced to correlate global features. Extensive experiments on three surveillance datasets suggest that our method outperforms the SOTA methods in the few-shot crowd counting.
Collapse
|
4
|
Zeng H, Ye Z, Zhang D. Dynamic event-triggering-based distributed model predictive control of heterogeneous connected vehicle platoon under DoS attacks. ISA TRANSACTIONS 2024; 153:1-12. [PMID: 39034230 DOI: 10.1016/j.isatra.2024.07.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Revised: 06/11/2024] [Accepted: 07/05/2024] [Indexed: 07/23/2024]
Abstract
This paper is concerned with the distributed model predictive control (DMPC) for heterogeneous connected vehicle platoon (CVP) under denial-of-service (DoS) attacks. Firstly, a dynamic event-triggering mechanism (DETM) based on the information interaction between vehicles is proposed to reduce the communication and computational burdens. Due to the fact that the triggering moment for each vehicle cannot be synchronized and DoS attacks can break the communication between vehicles, a packet replenishment mechanism is designed to ensure the integrity and effectiveness of information interaction. Then, the effect of external disturbance is handled by adding robustness constraints to the DMPC algorithm. In addition, the recursive feasibility of the DMPC algorithm and input-to-state practical stability (ISPS) of the CVP control system are demonstrated. Finally, the effectiveness of the algorithm is verified by simulation and comparison results.
Collapse
Affiliation(s)
- Hao Zeng
- Research Center of Automation and Artificial Intelligence, Zhejiang University of Technology, Hangzhou, 310023, PR China
| | - Zehua Ye
- Research Center of Automation and Artificial Intelligence, Zhejiang University of Technology, Hangzhou, 310023, PR China
| | - Dan Zhang
- Research Center of Automation and Artificial Intelligence, Zhejiang University of Technology, Hangzhou, 310023, PR China.
| |
Collapse
|
5
|
Chen Z, Wu Z, Zhong L, Plant C, Wang S, Guo W. Attributed Multi-Order Graph Convolutional Network for Heterogeneous Graphs. Neural Netw 2024; 174:106225. [PMID: 38471260 DOI: 10.1016/j.neunet.2024.106225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 01/17/2024] [Accepted: 03/02/2024] [Indexed: 03/14/2024]
Abstract
Heterogeneous graph neural networks play a crucial role in discovering discriminative node embeddings and relations from multi-relational networks. One of the key challenges in heterogeneous graph learning lies in designing learnable meta-paths, which significantly impact the quality of learned embeddings. In this paper, we propose an Attributed Multi-Order Graph Convolutional Network (AMOGCN), which automatically explores meta-paths that involve multi-hop neighbors by aggregating multi-order adjacency matrices. The proposed model first constructs different orders of adjacency matrices from manually designed node connections. Next, AMOGCN fuses these various orders of adjacency matrices to create an intact multi-order adjacency matrix. This process is supervised by the node semantic information, which is extracted from the node homophily evaluated by attributes. Eventually, we employ a one-layer simplifying graph convolutional network with the learned multi-order adjacency matrix, which is equivalent to the cross-hop node information propagation with multi-layer graph neural networks. Substantial experiments reveal that AMOGCN achieves superior semi-supervised classification performance compared with state-of-the-art competitors.
Collapse
Affiliation(s)
- Zhaoliang Chen
- College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China; Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fuzhou 350116, China
| | - Zhihao Wu
- College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China; Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fuzhou 350116, China
| | - Luying Zhong
- College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China; Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fuzhou 350116, China
| | - Claudia Plant
- Faculty of Computer Science, University of Vienna, Vienna 1090, Austria; ds:UniVie, Vienna 1090, Austria
| | - Shiping Wang
- College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China; Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fuzhou 350116, China
| | - Wenzhong Guo
- College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China; Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fuzhou 350116, China.
| |
Collapse
|
6
|
Wang Z, Chen J, Gong M, Shao Z. Higher-order neurodynamical equation for simplex prediction. Neural Netw 2024; 173:106185. [PMID: 38387202 DOI: 10.1016/j.neunet.2024.106185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 02/01/2024] [Accepted: 02/15/2024] [Indexed: 02/24/2024]
Abstract
It is demonstrated that higher-order patterns beyond pairwise relations can significantly enhance the learning capability of existing graph-based models, and simplex is one of the primary form for graphically representing higher-order patterns. Predicting unknown (disappeared) simplices in real-world complex networks can provide us with deeper insights, thereby assisting us in making better decisions. Nevertheless, previous efforts to predict simplices suffer from two issues: (i) they mainly focus on 2- or 3-simplices, and there are few models available for predicting simplices of arbitrary orders, and (ii) they lack the ability to analyze and learn the features of simplices from the perspective of dynamics. In this paper, we present a Higher-order Neurodynamical Equation for Simplex Prediction of arbitrary order (HNESP), which is a framework that combines neural networks and neurodynamics. Specifically, HNESP simulates the dynamical coupling process of nodes in simplicial complexes through different relations (i.e., strong pairwise relation, weak pairwise relation, and simplex) to learn node-level representations, while explaining the learning mechanism of neural networks from neurodynamics. To enrich the higher-order information contained in simplices, we exploit the entropy and normalized multivariate mutual information of different sub-structures of simplices to acquire simplex-level representations. Furthermore, simplex-level representations and multi-layer perceptron are used to quantify the existence probability of simplices. The effectiveness of HNESP is demonstrated by extensive simulations on seven higher-order benchmarks. Experimental results show that HNESP improves the AUC values of the state-of-the-art baselines by an average of 8.32%. Our implementations will be publicly available at: https://github.com/jianruichen/HNESP.
Collapse
Affiliation(s)
- Zhihui Wang
- Key Laboratory of Modern Teaching Technology, Ministry of Education, Xi'an, China; School of Computer Science, Shaanxi Normal University, Xi'an, China.
| | - Jianrui Chen
- Key Laboratory of Modern Teaching Technology, Ministry of Education, Xi'an, China; School of Computer Science, Shaanxi Normal University, Xi'an, China.
| | - Maoguo Gong
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xi'an, China; School of Electronic Engineering, Xidian University, Xi'an, China.
| | - Zhongshi Shao
- Key Laboratory of Modern Teaching Technology, Ministry of Education, Xi'an, China; School of Computer Science, Shaanxi Normal University, Xi'an, China.
| |
Collapse
|
7
|
Shen X, Li X. Deep-learning methods for unveiling large-scale single-cell transcriptomes. Cancer Biol Med 2024; 20:j.issn.2095-3941.2023.0436. [PMID: 38318925 PMCID: PMC10845931 DOI: 10.20892/j.issn.2095-3941.2023.0436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 12/20/2023] [Indexed: 02/07/2024] Open
Affiliation(s)
- Xilin Shen
- Tianjin Cancer Institute, Key Laboratory of Cancer Prevention and Therapy, Tianjin’s Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute & Hospital, Tianjin Medical University, Tianjin 300060, China
| | - Xiangchun Li
- Tianjin Cancer Institute, Key Laboratory of Cancer Prevention and Therapy, Tianjin’s Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute & Hospital, Tianjin Medical University, Tianjin 300060, China
| |
Collapse
|