1
|
Yedjour H, Yedjour D. A spatiotemporal energy model based on spiking neurons for human motion perception. Cogn Neurodyn 2024; 18:2015-2029. [PMID: 39104665 PMCID: PMC11297886 DOI: 10.1007/s11571-024-10068-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 11/30/2023] [Accepted: 01/09/2024] [Indexed: 08/07/2024] Open
Abstract
Inspired by the motion processing pathway, this paper proposes a bio-inspired feedforward spiking network model based on Hodgkin-Huxley neurons for human motion perception. The proposed network mimics the mechanisms of direction selectivity found in simple and complex cells of the primary visual cortex. Simple cells' receptive fields are modeled using Gabor energy filters, while complex cells' receptive fields are constructed by integrating the responses of simple cells in an energy model. To generate the motion map, the spiking output of the network integrates motion information encoded by the responses of complex cells with various preferred directions. Simulation results demonstrate that the spiking neuron-based network effectively replicates the directional selectivity operation of the visual cortex when presented with a sequence of time-varying images. We evaluate the proposed model against state-of-the-art spiking neuron-based motion detection models using publicly available datasets. The results highlight the model's capability to extract motion energy from diverse video sequences, akin to human visual motion perception models. Additionally, we showcase the application of the proposed model in motion segmentation tasks and compare its performance with state-of-the-art motion-based segmentation models using challenging video segmentation benchmarks. The results indicate competitive performance. The motion maps generated by the proposed model can be utilized for action recognition in input videos.
Collapse
Affiliation(s)
- Hayat Yedjour
- Faculty of Mathematics and Computer Science, Department of Computer Science, Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf, USTO-MB, EL M’naouer, BP 1505, 31000 Oran, Algeria
| | - Dounia Yedjour
- Faculty of Mathematics and Computer Science, Department of Computer Science, Université des Sciences et de la Technologie d’Oran Mohamed Boudiaf, USTO-MB, EL M’naouer, BP 1505, 31000 Oran, Algeria
| |
Collapse
|
2
|
Zhou L, Lin C, Pang X, Yang H, Pan Y, Zhang Y. Learning parallel and hierarchical mechanisms for edge detection. Front Neurosci 2023; 17:1194713. [PMID: 37559703 PMCID: PMC10407095 DOI: 10.3389/fnins.2023.1194713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 07/03/2023] [Indexed: 08/11/2023] Open
Abstract
Edge detection is one of the fundamental components of advanced computer vision tasks, and it is essential to preserve computational resources while ensuring a certain level of performance. In this paper, we propose a lightweight edge detection network called the Parallel and Hierarchical Network (PHNet), which draws inspiration from the parallel processing and hierarchical processing mechanisms of visual information in the visual cortex neurons and is implemented via a convolutional neural network (CNN). Specifically, we designed an encoding network with parallel and hierarchical processing based on the visual information transmission pathway of the "retina-LGN-V1" and meticulously modeled the receptive fields of the cells involved in the pathway. Empirical evaluation demonstrates that, despite a minimal parameter count of only 0.2 M, the proposed model achieves a remarkable ODS score of 0.781 on the BSDS500 dataset and ODS score of 0.863 on the MBDD dataset. These results underscore the efficacy of the proposed network in attaining superior edge detection performance at a low computational cost. Moreover, we believe that this study, which combines computational vision and biological vision, can provide new insights into edge detection model research.
Collapse
Affiliation(s)
- Ling Zhou
- Key Laboratory of AI and Information Processing (Hechi University), Education Department of Guangxi Zhuang Autonomous Region, Hechi University, Yizhou, China
| | - Chuan Lin
- Key Laboratory of AI and Information Processing (Hechi University), Education Department of Guangxi Zhuang Autonomous Region, Hechi University, Yizhou, China
- School of Automation, Guangxi University of Science and Technology, Liuzhou, China
- Guangxi Key Laboratory of Automobile Components and Vehicle Technology, Guangxi University of Science and Technology, Liuzhou, China
| | - Xintao Pang
- Key Laboratory of AI and Information Processing (Hechi University), Education Department of Guangxi Zhuang Autonomous Region, Hechi University, Yizhou, China
- School of Automation, Guangxi University of Science and Technology, Liuzhou, China
- Guangxi Key Laboratory of Automobile Components and Vehicle Technology, Guangxi University of Science and Technology, Liuzhou, China
| | - Hao Yang
- School of Automation, Guangxi University of Science and Technology, Liuzhou, China
- Guangxi Key Laboratory of Automobile Components and Vehicle Technology, Guangxi University of Science and Technology, Liuzhou, China
| | - Yongcai Pan
- School of Automation, Guangxi University of Science and Technology, Liuzhou, China
- Guangxi Key Laboratory of Automobile Components and Vehicle Technology, Guangxi University of Science and Technology, Liuzhou, China
| | - Yuwei Zhang
- School of Automation, Guangxi University of Science and Technology, Liuzhou, China
- Guangxi Key Laboratory of Automobile Components and Vehicle Technology, Guangxi University of Science and Technology, Liuzhou, China
| |
Collapse
|
3
|
Yao Y, Zhang Z, Peng B, Tang J. Bio-Inspired Network for Diagnosing Liver Steatosis in Ultrasound Images. Bioengineering (Basel) 2023; 10:768. [PMID: 37508795 PMCID: PMC10376777 DOI: 10.3390/bioengineering10070768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 06/15/2023] [Accepted: 06/23/2023] [Indexed: 07/30/2023] Open
Abstract
Using ultrasound imaging to diagnose liver steatosis is of great significance for preventing diseases such as cirrhosis and liver cancer. Accurate diagnosis under conditions of low quality, noise and poor resolutions is still a challenging task. Physiological studies have shown that the visual cortex of the biological visual system has selective attention neural mechanisms and feedback regulation of high features to low features. When processing visual information, these cortical regions selectively focus on more sensitive information and ignore unimportant details, which can effectively extract important features from visual information. Inspired by this, we propose a new diagnostic network for hepatic steatosis. In order to simulate the selection mechanism and feedback regulation of the visual cortex in the ventral pathway, it consists of a receptive field feature extraction module, parallel attention module and feedback connection. The receptive field feature extraction module corresponds to the inhibition of the non-classical receptive field of V1 neurons on the classical receptive field. It processes the input image to suppress the unimportant background texture. Two types of attention are adopted in the parallel attention module to process the same visual information and extract different important features for fusion, which improves the overall performance of the model. In addition, we construct a new dataset of fatty liver ultrasound images and validate the proposed model on this dataset. The experimental results show that the network has good performance in terms of sensitivity, specificity and accuracy for the diagnosis of fatty liver disease.
Collapse
Affiliation(s)
- Yuan Yao
- General Practice Medical Center, West China Hospital, Sichuan University, Chengdu 610044, China
| | - Zhenguang Zhang
- School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China
| | - Bo Peng
- School of Computing and Artificial Intelligent, Southwest Jiaotong University, Chengdu 611756, China
| | - Jin Tang
- Tiaodenghe Community Health Service Center, Chengdu 610066, China
| |
Collapse
|
4
|
Al-Amaren A, Ahmad MO, Swamy M. A low-complexity residual deep neural network for image edge detection. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04062-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|