1
|
Sun H, Mao S, Cai W, Cui Y, Chen D, Yao D, Guo D. BISNN: bio-information-fused spiking neural networks for enhanced EEG-based emotion recognition. Cogn Neurodyn 2025; 19:52. [PMID: 40129877 PMCID: PMC11929665 DOI: 10.1007/s11571-025-10239-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2024] [Revised: 02/19/2025] [Accepted: 03/04/2025] [Indexed: 03/26/2025] Open
Abstract
Spiking neural networks (SNNs), known for their rich spatio-temporal dynamics, have recently gained considerable attention in EEG-based emotion recognition. However, conventional model training approaches often fail to fully exploit the capabilities of SNNs, posing challenges for effective EEG data analysis. In this work, we propose a novel bio-information-fused SNN (BISNN) model to enhance EEG-based emotion recognition. The BISNN model incorporates biologically plausible intrinsic parameters into spiking neurons and is initialized with a structurally equivalent pre-trained ANN model. By constructing a bio-information-fused loss function, the BISNN model enables simultaneous training under dual constraints. Extensive experiments on benchmark EEG-based emotion datasets demonstrate that the BISNN model achieves competitive performance compared to state-of-the-art methods. Additionally, ablation studies investigating various components further elucidate the mechanisms underlying the model's effectiveness and evolution, aligning well with previous findings.
Collapse
Affiliation(s)
- Hongze Sun
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
| | - Shifeng Mao
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
| | - Wuque Cai
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
| | - Yan Cui
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
- Department of Neurosurgery, Sichuan Provincial People’s Hospital, Chengdu, 610072 China
| | - Duo Chen
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
- Chongqing University of Education, Chongqing University Industrial Technology Research Institute, Chongqing, 400065 China
| | - Dezhong Yao
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
| | - Daqing Guo
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Lab for NeuroInformation, China-Cuba Belt and Road Joint Laboratory on Neurotechnology and Brain-Apparatus Communication, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 611731 China
- Department of Neurosurgery, Sichuan Provincial People’s Hospital, Chengdu, 610072 China
| |
Collapse
|
2
|
Wang S, Zhang D, Belatreche A, Xiao Y, Qing H, Wei W, Zhang M, Yang Y. Ternary spike-based neuromorphic signal processing system. Neural Netw 2025; 187:107333. [PMID: 40081275 DOI: 10.1016/j.neunet.2025.107333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 11/11/2024] [Accepted: 02/27/2025] [Indexed: 03/15/2025]
Abstract
Deep Neural Networks (DNNs) have been successfully implemented across various signal processing fields, resulting in significant enhancements in performance. However, DNNs generally require substantial computational resources, leading to significant economic costs and posing challenges for their deployment on resource-constrained edge devices. In this study, we take advantage of spiking neural networks (SNNs) and quantization technologies to develop an energy-efficient and lightweight neuromorphic signal processing system. Our system is characterized by two principal innovations: a threshold-adaptive encoding (TAE) method and a quantized ternary SNN (QT-SNN). The TAE method can efficiently encode time-varying analog signals into sparse ternary spike trains, thereby reducing energy and memory demands for signal processing. QT-SNN, compatible with ternary spike trains from the TAE method, quantifies both membrane potentials and synaptic weights to reduce memory requirements while maintaining performance. Extensive experiments are conducted on two typical signal-processing tasks: speech and electroencephalogram recognition. The results demonstrate that our neuromorphic signal processing system achieves state-of-the-art (SOTA) performance with a 94% reduced memory requirement. Furthermore, through theoretical energy consumption analysis, our system shows 7.5× energy saving compared to other SNN works. The efficiency and efficacy of the proposed system highlight its potential as a promising avenue for energy-efficient signal processing.
Collapse
Affiliation(s)
- Shuai Wang
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Dehao Zhang
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Ammar Belatreche
- Department of Computer and Information Sciences, Faculty of Engineering and Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, UK
| | - Yichen Xiao
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Hongyu Qing
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Wenjie Wei
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Malu Zhang
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
| | - Yang Yang
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| |
Collapse
|
3
|
Li Y, Guan X, Yue W, Huang Y, Zhang B, Duan P. A Reinforced, Event-Driven, and Attention-Based Convolution Spiking Neural Network for Multivariate Time Series Prediction. Biomimetics (Basel) 2025; 10:240. [PMID: 40277639 PMCID: PMC12024570 DOI: 10.3390/biomimetics10040240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2025] [Revised: 04/03/2025] [Accepted: 04/08/2025] [Indexed: 04/26/2025] Open
Abstract
Despite spiking neural networks (SNNs) inherently exceling at processing time series due to their rich spatio-temporal information and efficient event-driven computing, the challenge of extracting complex correlations between variables in multivariate time series (MTS) remains to be addressed. This paper proposes a reinforced, event-driven, and attention-based convolution SNN model (REAT-CSNN) with three novel features. First, a joint Gramian Angular Field and Rate (GAFR) coding scheme is proposed to convert MTS into spike images, preserving the inherent features in MTS, such as the temporal patterns and spatio-temporal correlations between time series. Second, an advanced LIF-pooling strategy is developed, which is then theoretically and empirically proved to be effective in preserving more features from the regions of interest in spike images than average-pooling strategies. Third, a convolutional block attention mechanism (CBAM) is redesigned to support spike-based input, enhancing event-driven characteristics in weighting operations while maintaining outstanding capability to capture the information encoded in spike images. Experiments on multiple MTS data sets, such as stocks and PM2.5 data sets, demonstrate that our model rivals, and even surpasses, some CNN- and RNN-based techniques, with up to 3% better performance, while consuming significantly less energy.
Collapse
Affiliation(s)
- Ying Li
- School of Software, Northeastern University Shenyang, Shenyang 110167, China; (Y.L.); (X.G.); (Y.H.); (B.Z.)
| | - Xikang Guan
- School of Software, Northeastern University Shenyang, Shenyang 110167, China; (Y.L.); (X.G.); (Y.H.); (B.Z.)
| | - Wenwei Yue
- State Key Lab of Integrated Services Networks, Xidian University, Xi’an 710071, China;
| | - Yongsheng Huang
- School of Software, Northeastern University Shenyang, Shenyang 110167, China; (Y.L.); (X.G.); (Y.H.); (B.Z.)
| | - Bin Zhang
- School of Software, Northeastern University Shenyang, Shenyang 110167, China; (Y.L.); (X.G.); (Y.H.); (B.Z.)
| | - Peibo Duan
- School of Software, Northeastern University Shenyang, Shenyang 110167, China; (Y.L.); (X.G.); (Y.H.); (B.Z.)
| |
Collapse
|
4
|
Wu K, E S, Yang N, Zhang A, Yan X, Mu C, Song Y. A novel approach to enhancing biomedical signal recognition via hybrid high-order information bottleneck driven spiking neural networks. Neural Netw 2025; 183:106976. [PMID: 39644595 DOI: 10.1016/j.neunet.2024.106976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 10/16/2024] [Accepted: 11/06/2024] [Indexed: 12/09/2024]
Abstract
Biomedical signals, encapsulating vital physiological information, are pivotal in elucidating human traits and conditions, serving as a cornerstone for advancing human-machine interfaces. Nonetheless, the fidelity of biomedical signal interpretation is frequently compromised by pervasive noise sources such as skin, motion, and equipment interference, posing formidable challenges to precision recognition tasks. Concurrently, the burgeoning adoption of intelligent wearable devices illuminates a societal shift towards enhancing life and work through technological integration. This surge in popularity underscores the imperative for efficient, noise-resilient biomedical signal recognition methodologies, a quest that is both challenging and profoundly impactful. This study proposes a novel approach to enhancing biomedical signal recognition. The proposed approach employs a hierarchical information bottleneck mechanism within SNNs, quantifying the mutual information in different orders based on the depth of information flow in the network. Subsequently, these mutual information, together with the network's output and category labels, are restructured based on information theory principles to form the loss function used for training. A series of theoretical analyses and substantial experimental results have shown that this method can effectively compress noise in the data, and on the premise of low computational cost, it can also significantly outperform its vanilla counterpart in terms of classification performance.
Collapse
Affiliation(s)
- Kunlun Wu
- School of Artificial Intelligence, Anhui University, Hefei, 237090, China
| | - Shunzhuo E
- Suzhou High School of Jiangsu Province, Suzhou, 215011, China
| | - Ning Yang
- State Key Laboratory of Analog and Mixed-Signal VLSI, Institute of Microelectronics, University of Macau, Taipa, 999078, Macau
| | - Anguo Zhang
- School of Artificial Intelligence, Anhui University, Hefei, 237090, China.
| | - Xiaorong Yan
- Department of Neurosurgery, First Affiliated Hospital of Fujian Medical University, Fuzhou, 350506, China.
| | - Chaoxu Mu
- School of Artificial Intelligence, Anhui University, Hefei, 237090, China; School of Electrical and Information Engineering, Tianjin University, Tianjin, 300072, China
| | - Yongduan Song
- School of Artificial Intelligence, Anhui University, Hefei, 237090, China; Chongqing Key Laboratory of Autonomous Systems, Institute of Artificial Intelligence, School of Automation, Chongqing University, Chongqing, 400044, China
| |
Collapse
|
5
|
Yu W, Yang N, Wang Z, Li HC, Zhang A, Mu C, Pun SH. Fault-Tolerant Attitude Tracking Control Driven by Spiking NNs for Unmanned Aerial Vehicles. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:3773-3785. [PMID: 38113157 DOI: 10.1109/tnnls.2023.3342078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
In this article, we proposed a novel fault-tolerant control scheme for quadrotor unmanned aerial vehicles (UAVs) based on spiking neural networks (SNNs), which leverages the inherent features of neural network computing to significantly enhance the reliability and robustness of UAV flight control. Traditional control methods are known to be inadequate in dealing with complex and real-time sensor data, which results in poor performance and reduced robustness in fault-tolerant control. In contrast, the temporal processing, parallelism, and nonlinear capacity of SNNs enable the fault-tolerant control scheme to process vast amounts of sensory data with the ability to accurately identify and respond to faults. Furthermore, SNNs can learn and adjust to new environments and fault conditions, providing effective and adaptive flight control. The proposed SNN-based fault-tolerant control scheme demonstrates significant improvements in control accuracy and robustness compared with conventional methods, indicating its potential applicability and suitability for a range of UAV flight control scenarios.
Collapse
|
6
|
Limbacher T, Ozdenizci O, Legenstein R. Memory-Dependent Computation and Learning in Spiking Neural Networks Through Hebbian Plasticity. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:2551-2562. [PMID: 38113154 DOI: 10.1109/tnnls.2023.3341446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Spiking neural networks (SNNs) are the basis for many energy-efficient neuromorphic hardware systems. While there has been substantial progress in SNN research, artificial SNNs still lack many capabilities of their biological counterparts. In biological neural systems, memory is a key component that enables the retention of information over a huge range of temporal scales, ranging from hundreds of milliseconds up to years. While Hebbian plasticity is believed to play a pivotal role in biological memory, it has so far been analyzed mostly in the context of pattern completion and unsupervised learning in artificial and SNNs. Here, we propose that Hebbian plasticity is fundamental for computations in biological and artificial spiking neural systems. We introduce a novel memory-augmented SNN architecture that is enriched by Hebbian synaptic plasticity. We show that Hebbian enrichment renders SNNs surprisingly versatile in terms of their computational as well as learning capabilities. It improves their abilities for out-of-distribution generalization, one-shot learning, cross-modal generative association, language processing, and reward-based learning. This suggests that powerful cognitive neuromorphic systems can be built based on this principle.
Collapse
|
7
|
Sun J, Zhai Y, Liu P, Wang Y. Memristor-Based Neural Network Circuit of Associative Memory With Overshadowing and Emotion Congruent Effect. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:3618-3630. [PMID: 38194385 DOI: 10.1109/tnnls.2023.3348553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]
Abstract
Most memristor-based neural network circuits consider only a single pattern of overshadowing or emotion, but the relationship between overshadowing and emotion is ignored. In this article, a memristor-based neural network circuit of associative memory with overshadowing and emotion congruent effect is designed, and overshadowing under multiple emotions is taken into account. The designed circuit mainly consists of an emotion module, a memory module, an inhibition module, and a feedback module. The generation and recovery of different emotions are realized by the emotion module. The functions of overshadowing under different emotions and recovery from overshadowing are achieved by the inhibition module and the memory module. Finally, the blocking caused by long-term overshadowing is implemented by the feedback module. The proposed circuit can be applied to bionic emotional robots and offers some references for brain-like systems.
Collapse
|
8
|
Zhang J, Zhang M, Wang Y, Liu Q, Yin B, Li H, Yang X. Spiking Neural Networks with Adaptive Membrane Time Constant for Event-Based Tracking. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2025; PP:1009-1021. [PMID: 40031251 DOI: 10.1109/tip.2025.3533213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
The brain-inspired Spiking Neural Networks (SNNs) work in an event-driven manner and have an implicit recurrence in neuronal membrane potential to memorize information over time, which are inherently suitable to handle temporal event-based streams. Despite their temporal nature and recent approaches advancements, these methods have predominantly been assessed on event-based classification tasks. In this paper, we explore the utility of SNNs for event-based tracking tasks. Specifically, we propose a brain-inspired adaptive Leaky Integrate-and-Fire neuron (BA-LIF) that can adaptively adjust the membrane time constant according to the inputs, thereby accelerating the leakage of meaningless noise features and reducing the decay of valuable information. SNNs composed of our proposed BA-LIF neurons can achieve high performance without a careful and time-consuming trial-by-error initialization on the membrane time constant. The adaptive capability of our network is further improved by introducing an extra temporal feature aggregator (TFA) that assigns attention weights over the temporal dimension. Extensive experiments on various event-based tracking datasets validate the effectiveness of our proposed method. We further validate the generalization capability of our method by applying it to other event-classification tasks.
Collapse
|
9
|
Xie X, Chua Y, Liu G, Zhang M, Luo G, Tang H. Event-Driven Spiking Learning Algorithm Using Aggregated Labels. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:17596-17607. [PMID: 37651489 DOI: 10.1109/tnnls.2023.3306749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Traditional spiking learning algorithm aims to train neurons to spike at a specific time or on a particular frequency, which requires precise time and frequency labels in the training process. While in reality, usually only aggregated labels of sequential patterns are provided. The aggregate-label (AL) learning is proposed to discover these predictive features in distracting background streams only by aggregated spikes. It has achieved much success recently, but it is still computationally intensive and has limited use in deep networks. To address these issues, we propose an event-driven spiking aggregate learning algorithm (SALA) in this article. Specifically, to reduce the computational complexity, we improve the conventional spike-threshold-surface (STS) calculation in AL learning by analytical calculating voltage peak values in spiking neurons. Then we derive the algorithm to multilayers by event-driven strategy using aggregated spikes. We conduct comprehensive experiments on various tasks including temporal clue recognition, segmented and continuous speech recognition, and neuromorphic image classification. The experimental results demonstrate that the new STS method improves the efficiency of AL learning significantly, and the proposed algorithm outperforms the conventional spiking algorithm in various temporal clue recognition tasks.
Collapse
|
10
|
Zhou S, Yang B, Yuan M, Jiang R, Yan R, Pan G, Tang H. Enhancing SNN-based spatio-temporal learning: A benchmark dataset and Cross-Modality Attention model. Neural Netw 2024; 180:106677. [PMID: 39260008 DOI: 10.1016/j.neunet.2024.106677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 08/24/2024] [Accepted: 08/29/2024] [Indexed: 09/13/2024]
Abstract
Spiking Neural Networks (SNNs), renowned for their low power consumption, brain-inspired architecture, and spatio-temporal representation capabilities, have garnered considerable attention in recent years. Similar to Artificial Neural Networks (ANNs), high-quality benchmark datasets are of great importance to the advances of SNNs. However, our analysis indicates that many prevalent neuromorphic datasets lack strong temporal correlation, preventing SNNs from fully exploiting their spatio-temporal representation capabilities. Meanwhile, the integration of event and frame modalities offers more comprehensive visual spatio-temporal information. Yet, the SNN-based cross-modality fusion remains underexplored. In this work, we present a neuromorphic dataset called DVS-SLR that can better exploit the inherent spatio-temporal properties of SNNs. Compared to existing datasets, it offers advantages in terms of higher temporal correlation, larger scale, and more varied scenarios. In addition, our neuromorphic dataset contains corresponding frame data, which can be used for developing SNN-based fusion methods. By virtue of the dual-modal feature of the dataset, we propose a Cross-Modality Attention (CMA) based fusion method. The CMA model efficiently utilizes the unique advantages of each modality, allowing for SNNs to learn both temporal and spatial attention scores from the spatio-temporal features of event and frame modalities, subsequently allocating these scores across modalities to enhance their synergy. Experimental results demonstrate that our method not only improves recognition accuracy but also ensures robustness across diverse scenarios.
Collapse
Affiliation(s)
- Shibo Zhou
- Research Center for Data Hub and Security, Zhejiang Lab, Hangzhou, China.
| | - Bo Yang
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China.
| | - Mengwen Yuan
- Research Center for High Efficiency Computing System, Zhejiang Lab, Hangzhou, China.
| | - Runhao Jiang
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China.
| | - Rui Yan
- College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, China.
| | - Gang Pan
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China; The State Key Lab of Brain-Machine Intelligence, Zhejiang University, Hangzhou, China.
| | - Huajin Tang
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China; The State Key Lab of Brain-Machine Intelligence, Zhejiang University, Hangzhou, China.
| |
Collapse
|
11
|
Zhang A, Shi J, Wu J, Zhou Y, Yu W. Low Latency and Sparse Computing Spiking Neural Networks With Self-Driven Adaptive Threshold Plasticity. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:17177-17188. [PMID: 37581976 DOI: 10.1109/tnnls.2023.3300514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]
Abstract
Spiking neural networks (SNNs) have captivated the attention worldwide owing to their compelling advantages in low power consumption, high biological plausibility, and strong robustness. However, the intrinsic latency associated with SNNs during inference poses a significant challenge, impeding their further development and application. This latency is caused by the need for spiking neurons to collect electrical stimuli and generate spikes only when their membrane potential exceeds a firing threshold. Considering the firing threshold plays a crucial role in SNN performance, this article proposes a self-driven adaptive threshold plasticity (SATP) mechanism, wherein neurons autonomously adjust the firing thresholds based on their individual state information using unsupervised learning rules, of which the adjustment is triggered by their own firing events. SATP is based on the principle of maximizing the information contained in the output spike rate distribution of each neuron. This article derives the mathematical expression of SATP and provides extensive experimental results, demonstrating that SATP effectively reduces SNN inference latency, further reduces the computation density while improving computational accuracy, so that SATP facilitates SNN models to be with low latency, sparse computing, and high accuracy.
Collapse
|
12
|
Nomura K, Nishi Y. Synchronized stepwise control of firing and learning thresholds in a spiking randomly connected neural network toward hardware implementation. Front Neurosci 2024; 18:1402646. [PMID: 39605789 PMCID: PMC11599226 DOI: 10.3389/fnins.2024.1402646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 10/22/2024] [Indexed: 11/29/2024] Open
Abstract
Spiking randomly connected neural network (RNN) hardware is promising as ultimately low power devices for temporal data processing at the edge. Although the potential of RNNs for temporal data processing has been demonstrated, randomness of the network architecture often causes performance degradation. To mitigate such degradation, self-organization mechanism using intrinsic plasticity (IP) and synaptic plasticity (SP) should be implemented in the spiking RNN. Therefore, we propose hardware-oriented models of these functions. To implement the function of IP, a variable firing threshold is introduced to each excitatory neuron in the RNN that changes stepwise in accordance with its activity. We also define other thresholds for SP that synchronize with the firing threshold, which determine the direction of stepwise synaptic update that is executed on receiving a pre-synaptic spike. To discuss the effectiveness of our model, we perform simulations of temporal data learning and anomaly detection using publicly available electrocardiograms (ECGs) with a spiking RNN. We observe that the spiking RNN with our IP and SP models realizes the true positive rate of 1 with the false positive rate being suppressed at 0 successfully, which does not occur otherwise. Furthermore, we find that these thresholds as well as the synaptic weights can be reduced to binary if the RNN architecture is appropriately designed. This contributes to minimization of the circuit of the neuronal system having IP and SP.
Collapse
|
13
|
Cai W, Sun H, Liu R, Cui Y, Wang J, Xia Y, Yao D, Guo D. A Spatial-Channel-Temporal-Fused Attention for Spiking Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:14315-14329. [PMID: 37256807 DOI: 10.1109/tnnls.2023.3278265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Spiking neural networks (SNNs) mimic brain computational strategies, and exhibit substantial capabilities in spatiotemporal information processing. As an essential factor for human perception, visual attention refers to the dynamic process for selecting salient regions in biological vision systems. Although visual attention mechanisms have achieved great success in computer vision applications, they are rarely introduced into SNNs. Inspired by experimental observations on predictive attentional remapping, we propose a new spatial-channel-temporal-fused attention (SCTFA) module that can guide SNNs to efficiently capture underlying target regions by utilizing accumulated historical spatial-channel information in the present study. Through a systematic evaluation on three event stream datasets (DVS Gesture, SL-Animals-DVS, and MNIST-DVS), we demonstrate that the SNN with the SCTFA module (SCTFA-SNN) not only significantly outperforms the baseline SNN (BL-SNN) and two other SNN models with degenerated attention modules, but also achieves competitive accuracy with the existing state-of-the-art (SOTA) methods. Additionally, our detailed analysis shows that the proposed SCTFA-SNN model has strong robustness to noise and outstanding stability when faced with incomplete data, while maintaining acceptable complexity and efficiency. Overall, these findings indicate that incorporating appropriate cognitive mechanisms of the brain may provide a promising approach to elevate the capabilities of SNNs.
Collapse
|
14
|
Zhang A, Zhang Q, Zhao K. Editorial: Information theory meets deep neural networks: theory and applications. Front Neurosci 2024; 18:1448517. [PMID: 39077426 PMCID: PMC11284159 DOI: 10.3389/fnins.2024.1448517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Accepted: 06/27/2024] [Indexed: 07/31/2024] Open
Affiliation(s)
- Anguo Zhang
- Institute of Microelectronics, University of Macau, Taipa, China
| | - Qichun Zhang
- School of Creative and Digital Industries, Buckinghamshire New University, Bradford, United Kingdom
| | - Kai Zhao
- School of Automation, Chongqing University, Chongqing, China
| |
Collapse
|
15
|
Zhang W, Geng H, Li P. Composing recurrent spiking neural networks using locally-recurrent motifs and risk-mitigating architectural optimization. Front Neurosci 2024; 18:1412559. [PMID: 38966757 PMCID: PMC11222634 DOI: 10.3389/fnins.2024.1412559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 06/03/2024] [Indexed: 07/06/2024] Open
Abstract
In neural circuits, recurrent connectivity plays a crucial role in network function and stability. However, existing recurrent spiking neural networks (RSNNs) are often constructed by random connections without optimization. While RSNNs can produce rich dynamics that are critical for memory formation and learning, systemic architectural optimization of RSNNs is still an open challenge. We aim to enable systematic design of large RSNNs via a new scalable RSNN architecture and automated architectural optimization. We compose RSNNs based on a layer architecture called Sparsely-Connected Recurrent Motif Layer (SC-ML) that consists of multiple small recurrent motifs wired together by sparse lateral connections. The small size of the motifs and sparse inter-motif connectivity leads to an RSNN architecture scalable to large network sizes. We further propose a method called Hybrid Risk-Mitigating Architectural Search (HRMAS) to systematically optimize the topology of the proposed recurrent motifs and SC-ML layer architecture. HRMAS is an alternating two-step optimization process by which we mitigate the risk of network instability and performance degradation caused by architectural change by introducing a novel biologically-inspired "self-repairing" mechanism through intrinsic plasticity. The intrinsic plasticity is introduced to the second step of each HRMAS iteration and acts as unsupervised fast self-adaptation to structural and synaptic weight modifications introduced by the first step during the RSNN architectural "evolution." We demonstrate that the proposed automatic architecture optimization leads to significant performance gains over existing manually designed RSNNs: we achieve 96.44% on TI46-Alpha, 94.66% on N-TIDIGITS, 90.28% on DVS-Gesture, and 98.72% on N-MNIST. To the best of the authors' knowledge, this is the first work to perform systematic architecture optimization on RSNNs.
Collapse
Affiliation(s)
| | | | - Peng Li
- Department of Electrical and Computer Engineering, University of California, Santa Barbara, Santa Barbara, CA, United States
| |
Collapse
|
16
|
Li J, Peng C. Weighted residual network for SAR automatic target recognition with data augmentation. Front Neurorobot 2023; 17:1298653. [PMID: 38169785 PMCID: PMC10758409 DOI: 10.3389/fnbot.2023.1298653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 11/30/2023] [Indexed: 01/05/2024] Open
Abstract
Introduction Decades of research have been dedicated to overcoming the obstacles inherent in synthetic aperture radar (SAR) automatic target recognition (ATR). The rise of deep learning technologies has brought a wave of new possibilities, demonstrating significant progress in the field. However, challenges like the susceptibility of SAR images to noise, the requirement for large-scale training datasets, and the often protracted duration of model training still persist. Methods This paper introduces a novel data augmentation strategy to address these issues. Our method involves the intentional addition and subsequent removal of speckle noise to artificially enlarge the scope of training data through noise perturbation. Furthermore, we propose a modified network architecture named weighted ResNet, which incorporates residual strain controls for enhanced performance. This network is designed to be computationally efficient and to minimize the amount of training data required. Results Through rigorous experimental analysis, our research confirms that the proposed data augmentation method, when used in conjunction with the weighted ResNet model, significantly reduces the time needed for training. It also improves the SAR ATR capabilities. Discussion Compared to existing models and methods tested, the combination of our data augmentation scheme and the weighted ResNet framework achieves higher computational efficiency and better recognition accuracy in SAR ATR applications. This suggests that our approach could be a valuable advancement in the field of SAR image analysis.
Collapse
Affiliation(s)
| | - Cheng Peng
- School of Electrical and Mechanical Engineering, Hefei Technology College, Hefei, China
| |
Collapse
|
17
|
Shang F, Lan Y, Yang J, Li E, Kang X. Robust data hiding for JPEG images with invertible neural network. Neural Netw 2023; 163:219-232. [PMID: 37062180 DOI: 10.1016/j.neunet.2023.03.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 03/15/2023] [Accepted: 03/28/2023] [Indexed: 04/03/2023]
Abstract
JPEG compression will cause severe distortion to the shared compressed image, which brings great challenges to extracting messages correctly from the stego image. To address such challenges, we propose a novel end-to-end robust data hiding scheme for JPEG images. The embedding and extracting secret messages on the quantized discrete cosine transform (DCT) coefficients are implemented by the bi-directional process of the invertible neural network (INN), which can provide intrinsic robustness against lossy JPEG compression. We design a JPEG compression attack module to simulate the JPEG compression process, which helps the network automatically learn how to recover the secret message from JPEG compressed image. Experimental results have demonstrated that our method achieves strong robustness against lossy JPEG compression, and also significantly improves the security compared with the existing data hiding methods on the premise of ensuring image quality and high capacity. For example, the detection error of our method against XuNet has been increased by 3.45% over the existing data hiding methods.
Collapse
|
18
|
Yu C, Gu Z, Li D, Wang G, Wang A, Li E. STSC-SNN: Spatio-Temporal Synaptic Connection with temporal convolution and attention for spiking neural networks. Front Neurosci 2022; 16:1079357. [PMID: 36620452 PMCID: PMC9817103 DOI: 10.3389/fnins.2022.1079357] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 12/08/2022] [Indexed: 12/25/2022] Open
Abstract
Spiking neural networks (SNNs), as one of the algorithmic models in neuromorphic computing, have gained a great deal of research attention owing to temporal information processing capability, low power consumption, and high biological plausibility. The potential to efficiently extract spatio-temporal features makes it suitable for processing event streams. However, existing synaptic structures in SNNs are almost full-connections or spatial 2D convolution, neither of which can extract temporal dependencies adequately. In this work, we take inspiration from biological synapses and propose a Spatio-Temporal Synaptic Connection SNN (STSC-SNN) model to enhance the spatio-temporal receptive fields of synaptic connections, thereby establishing temporal dependencies across layers. Specifically, we incorporate temporal convolution and attention mechanisms to implement synaptic filtering and gating functions. We show that endowing synaptic models with temporal dependencies can improve the performance of SNNs on classification tasks. In addition, we investigate the impact of performance via varied spatial-temporal receptive fields and reevaluate the temporal modules in SNNs. Our approach is tested on neuromorphic datasets, including DVS128 Gesture (gesture recognition), N-MNIST, CIFAR10-DVS (image classification), and SHD (speech digit recognition). The results show that the proposed model outperforms the state-of-the-art accuracy on nearly all datasets.
Collapse
Affiliation(s)
- Chengting Yu
- College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China
- Zhejiang University - University of Illinois at Urbana-Champaign Institute, Zhejiang University, Haining, China
| | - Zheming Gu
- College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China
| | - Da Li
- College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China
| | - Gaoang Wang
- Zhejiang University - University of Illinois at Urbana-Champaign Institute, Zhejiang University, Haining, China
| | - Aili Wang
- College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China
- Zhejiang University - University of Illinois at Urbana-Champaign Institute, Zhejiang University, Haining, China
| | - Erping Li
- College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, China
- Zhejiang University - University of Illinois at Urbana-Champaign Institute, Zhejiang University, Haining, China
| |
Collapse
|
19
|
Hendrikse SCF, Treur J, Wilderjans TF, Dikker S, Koole SL. Becoming Attuned to Each Other Over Time: A Computational Neural Agent Model for the Role of Time Lags in Subjective Synchrony Detection and Related Behavioral Adaptivity. Brain Inform 2022. [DOI: 10.1007/978-3-031-15037-1_30] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022] Open
|