1
|
Liu X, Tang S, Ye M, Lu T, Duan L. Semantic discrete decoder based on adaptive pixel clustering for monocular depth estimation. Neural Netw 2025; 189:107565. [PMID: 40414148 DOI: 10.1016/j.neunet.2025.107565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2024] [Revised: 03/01/2025] [Accepted: 04/29/2025] [Indexed: 05/27/2025]
Abstract
Monocular depth estimation (MDE) has long been a popular and challenging task. Currently, mainstream methods mainly include regression methods based on geometric constraints and ordinal regression methods based on discretized depth intervals. However, they both overlook the fact that depth values within objects often exhibit some degree of continuity, while depth values between objects exhibit varying degrees of discontinuity. Based on this, we propose a more general approach to monocular depth estimation called APCDepth. This method does not treat MDE as an ordinal regression task but rather as a continuous regression task to ensure the continuity of depth values within objects. To focus on the discontinuity of depth values between objects, we propose an Adaptive Pixel Clustering (APC) module to semantically discretize encoder deep features, and align the discretized feature maps to a larger resolution using our proposed Cross-Semantic Alignment (CSA) module. Additionally, to tackle the quadratic complexity issue introduced by Transformers as decoders in depth estimation, we propose a Deformable Feature Pyramid Network (DeFPN) with sparse attention for multi-scale feature fusion. Furthermore, experimental results on the KITTI and NYU datasets validate the effectiveness of APCDepth and demonstrate outstanding performance.
Collapse
Affiliation(s)
- Xuanxuan Liu
- Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen 518110, China.
| | - Shuai Tang
- Institute of Future Technology, South China University of Technology, Guangdong 511442, China.
| | - Mingzhi Ye
- Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen 518110, China.
| | - Tongwei Lu
- School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China.
| | - Lixin Duan
- Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen 518110, China.
| |
Collapse
|
2
|
Bai Y, Shao S, Zhang J, Zhao X, Fang C, Wang T, Wang Y, Zhao H. A Review of Brain-Inspired Cognition and Navigation Technology for Mobile Robots. CYBORG AND BIONIC SYSTEMS 2024; 5:0128. [PMID: 38938902 PMCID: PMC11210290 DOI: 10.34133/cbsystems.0128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 04/23/2024] [Indexed: 06/29/2024] Open
Abstract
Brain-inspired navigation technologies combine environmental perception, spatial cognition, and target navigation to create a comprehensive navigation research system. Researchers have used various sensors to gather environmental data and enhance environmental perception using multimodal information fusion. In spatial cognition, a neural network model is used to simulate the navigation mechanism of the animal brain and to construct an environmental cognition map. However, existing models face challenges in achieving high navigation success rate and efficiency. In addition, the limited incorporation of navigation mechanisms borrowed from animal brains necessitates further exploration. On the basis of the brain-inspired navigation process, this paper launched a systematic study on brain-inspired environment perception, brain-inspired spatial cognition, and goal-based navigation in brain-inspired navigation, which provides a new classification of brain-inspired cognition and navigation techniques and a theoretical basis for subsequent experimental studies. In the future, brain-inspired navigation technology should learn from more perfect brain-inspired mechanisms to improve its generalization ability and be simultaneously applied to large-scale distributed intelligent body cluster navigation. The multidisciplinary nature of brain-inspired navigation technology presents challenges, and multidisciplinary scholars must cooperate to promote the development of this technology.
Collapse
Affiliation(s)
- Yanan Bai
- School of Computer Science and Engineering,
Northeastern University, Shenyang 110819, China
- State Key Laboratory of Robotics, Shenyang Institute of Automation,
Chinese Academy of Sciences, Shenyang 110016, China
- Institutes for Robotics and Intelligent Manufacturing,
Chinese Academy of Sciences, Shenyang 110169, China
| | - Shiliang Shao
- State Key Laboratory of Robotics, Shenyang Institute of Automation,
Chinese Academy of Sciences, Shenyang 110016, China
- Institutes for Robotics and Intelligent Manufacturing,
Chinese Academy of Sciences, Shenyang 110169, China
| | - Jin Zhang
- School of Computer Science and Engineering,
Northeastern University, Shenyang 110819, China
- State Key Laboratory of Robotics, Shenyang Institute of Automation,
Chinese Academy of Sciences, Shenyang 110016, China
- Institutes for Robotics and Intelligent Manufacturing,
Chinese Academy of Sciences, Shenyang 110169, China
| | - Xianzhe Zhao
- School of Computer Science and Engineering,
Northeastern University, Shenyang 110819, China
- State Key Laboratory of Robotics, Shenyang Institute of Automation,
Chinese Academy of Sciences, Shenyang 110016, China
- Institutes for Robotics and Intelligent Manufacturing,
Chinese Academy of Sciences, Shenyang 110169, China
| | - Chuxi Fang
- School of Computer Science and Engineering,
Northeastern University, Shenyang 110819, China
- State Key Laboratory of Robotics, Shenyang Institute of Automation,
Chinese Academy of Sciences, Shenyang 110016, China
- Institutes for Robotics and Intelligent Manufacturing,
Chinese Academy of Sciences, Shenyang 110169, China
| | - Ting Wang
- State Key Laboratory of Robotics, Shenyang Institute of Automation,
Chinese Academy of Sciences, Shenyang 110016, China
- Institutes for Robotics and Intelligent Manufacturing,
Chinese Academy of Sciences, Shenyang 110169, China
| | - Yongliang Wang
- Department of Artificial Intelligence,
University of Groningen, Groningen 9747 AG, Netherlands
| | - Hai Zhao
- School of Computer Science and Engineering,
Northeastern University, Shenyang 110819, China
| |
Collapse
|
3
|
Wang J, Qi Y. Multi-level feature fusion and joint refinement for simultaneous object pose estimation and camera localization. Neural Netw 2024; 174:106238. [PMID: 38508048 DOI: 10.1016/j.neunet.2024.106238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 01/22/2024] [Accepted: 03/13/2024] [Indexed: 03/22/2024]
Abstract
Object pose estimation and camera localization are critical in various applications. However, achieving algorithm universality, which refers to category-level pose estimation and scene-independent camera localization, presents challenges for both techniques. Although the two tasks keep close relationships due to spatial geometry constraints, different tasks require distinct feature extractions. This paper pays attention to a unified RGB-D based framework that simultaneously performs category-level object pose estimation and scene-independent camera localization. The framework consists of a pose estimation branch called SLO-ObjNet, a localization branch called SLO-LocNet, a pose confidence calculation process and object-level optimization. At the start, we obtain the initial camera and object results from SLO-LocNet and SLO-ObjNet. In these two networks, we design there-level feature fusion modules as well as the loss function to achieve feature sharing between two tasks. Then the proposed approach involves a confidence calculation process to determine the accuracy of object poses obtained. Additionally, an object-level Bundle Adjustment (BA) optimization algorithm is further used to improve the precision of these techniques. The BA algorithm establishes relationships among feature points, objects, and cameras with the usage of camera-point, camera-object, and object-point metrics. To evaluate the performance of this approach, experiments are conducted on localization and pose estimation datasets including REAL275, CAMERA25, LineMOD, YCB-Video, 7 Scenes, ScanNet and TUM RGB-D. The results show that this approach outperforms existing methods in terms of both estimation and localization accuracy. Additionally, SLO-LocNet and SLO-ObjNet are trained on ScanNet data and tested on 7 Scenes and TUM RGB-D datasets to demonstrate its universality performance. Finally, we also highlight the positive effects of fusion modules, loss function, confidence process and BA for improving overall performance.
Collapse
Affiliation(s)
- Junyi Wang
- School of Computer Science and Technology, Shandong University, Qingdao, China; Qingdao Research Institute of Beihang University, Qingdao, China.
| | - Yue Qi
- State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing, China; Qingdao Research Institute of Beihang University, Qingdao, China.
| |
Collapse
|
4
|
Liu X, Zhang T, Liu M. Joint estimation of pose, depth, and optical flow with a competition-cooperation transformer network. Neural Netw 2024; 171:263-275. [PMID: 38103436 DOI: 10.1016/j.neunet.2023.12.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 10/31/2023] [Accepted: 12/12/2023] [Indexed: 12/19/2023]
Abstract
Estimating depth, ego-motion, and optical flow from consecutive frames is a critical task in robot navigation and has received significant attention in recent years. In this study, we propose PDF-Former, an unsupervised joint estimation network comprising a full transformer-based framework, as well as a competition and cooperation mechanism. The transformer framework captures global feature dependencies and is customized for different task types, thereby improving the performance of sequential tasks. The competition and cooperation mechanisms enable the network to obtain additional supervisory information at different training stages. Specifically, the competition mechanism is implemented early in training to achieve iterative optimization of 6 DOF poses (rotation and translation information from the target image to the two reference images), the depth of target image, and optical flow (from the target image to the two reference images) estimation in a competitive manner. In contrast, the cooperation mechanism is implemented later in training to facilitate the transmission of results among the three networks and mutually optimize the estimation results. We conducted experiments on the KITTI dataset, and the results indicate that PDF-Former has significant potential to enhance the accuracy and robustness of sequential tasks in robot navigation.
Collapse
Affiliation(s)
- Xiaochen Liu
- School of Instrument Science & Engineering, Southeast University, Nanjing, 210096, China; Key Laboratory of Micro-Inertial Instrument and Advanced Navigation Technology, Ministry of Education, Southeast University, Nanjing, 210096, Jiangsu, China
| | - Tao Zhang
- School of Instrument Science & Engineering, Southeast University, Nanjing, 210096, China; Key Laboratory of Micro-Inertial Instrument and Advanced Navigation Technology, Ministry of Education, Southeast University, Nanjing, 210096, Jiangsu, China.
| | - Mingming Liu
- Department of Orthopedic Surgery, The Second People's Hospital of Lianyungang, Lianyungang, 222003, Jiangsu, China; Department of Orthopedic Surgery, The First People's Hospital of Xining, Xining, 810000, Qinghai, China.
| |
Collapse
|
5
|
Newman PM, Qi Y, Mou W, McNamara TP. Statistically Optimal Cue Integration During Human Spatial Navigation. Psychon Bull Rev 2023; 30:1621-1642. [PMID: 37038031 DOI: 10.3758/s13423-023-02254-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/08/2023] [Indexed: 04/12/2023]
Abstract
In 2007, Cheng and colleagues published their influential review wherein they analyzed the literature on spatial cue interaction during navigation through a Bayesian lens, and concluded that models of optimal cue integration often applied in psychophysical studies could explain cue interaction during navigation. Since then, numerous empirical investigations have been conducted to assess the degree to which human navigators are optimal when integrating multiple spatial cues during a variety of navigation-related tasks. In the current review, we discuss the literature on human cue integration during navigation that has been published since Cheng et al.'s original review. Evidence from most studies demonstrate optimal navigation behavior when humans are presented with multiple spatial cues. However, applications of optimal cue integration models vary in their underlying assumptions (e.g., uninformative priors and decision rules). Furthermore, cue integration behavior depends in part on the nature of the cues being integrated and the navigational task (e.g., homing versus non-home goal localization). We discuss the implications of these models and suggest directions for future research.
Collapse
Affiliation(s)
- Phillip M Newman
- Department of Psychology, Vanderbilt University, 301 Wilson Hall, 111 21st Avenue South, Nashville, TN, 37240, USA.
| | - Yafei Qi
- Department of Psychology, P-217 Biological Sciences Building, University of Alberta, Edmonton, Alberta, T6G 2R3, Canada
| | - Weimin Mou
- Department of Psychology, P-217 Biological Sciences Building, University of Alberta, Edmonton, Alberta, T6G 2R3, Canada
| | - Timothy P McNamara
- Department of Psychology, Vanderbilt University, 301 Wilson Hall, 111 21st Avenue South, Nashville, TN, 37240, USA
| |
Collapse
|
6
|
Liu X, Tang J, Shen C, Wang C, Zhao D, Guo X, Li J, Liu J. Brain-like position measurement method based on improved optical flow algorithm. ISA TRANSACTIONS 2023:S0019-0578(23)00412-3. [PMID: 37730462 DOI: 10.1016/j.isatra.2023.09.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Revised: 04/22/2021] [Accepted: 09/05/2023] [Indexed: 09/22/2023]
Abstract
In this paper, a brain-like navigation scheme based on fuzzy kernel C-means (FKCM) clustering assisted pyramid Lucas Kanade (LK) optical flow algorithm is developed to measure the position of vehicle. The Speed Cell and Place Cell in animals' brain are introduced to construct the brain-like navigation mechanism which involves the optical flow method and image template matching to imitate the cells above-mentioned separately. To eliminate the singular values during optical flow calculation, the output of pyramid LK algorithm is clustered by FKCM algorithm firstly. Then, the velocity is calculated and integrated to get the position of the vehicle, and the brain-like navigation scheme is introduced to correct the position measurement errors by eliminating the accumulated errors resulting from velocity integration. The prominent advantages of the presented method are: (i) a pure visual brain-like position measurement method based on the concept of speed cells and place cells is proposed, making visual navigation more accurate and intelligent; (ii) the FKCM algorithm is used to eliminate the singular value of the pyramid LK algorithm, which improves the calculated velocity accuracy. Also, experimental comparison with classical pyramid LK algorithm is given to illustrate the superiority of the proposed method in position measurement.
Collapse
Affiliation(s)
- Xiaochen Liu
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China; Key Laboratory of Micro-Inertial Instrument and Advanced Navigation Technology, Ministry of Education, School of Instrument Science & Engineering, Southeast University, Nanjing 210096, PR China
| | - Jun Tang
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China
| | - Chong Shen
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China.
| | - Chenguang Wang
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China
| | - Donghua Zhao
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China
| | - Xiaoting Guo
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China
| | - Jie Li
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China
| | - Jun Liu
- Key Laboratory of Instrumentation Science & Dynamic Measurement, Ministry of Education, School of Instrument and Electronics, North University of China, Taiyuan 030051, PR China
| |
Collapse
|
7
|
Zhang Y, Shi K, Luo X, Chen Y, Wang Y, Qu H. A biologically inspired auto-associative network with sparse temporal population coding. Neural Netw 2023; 166:670-682. [PMID: 37604076 DOI: 10.1016/j.neunet.2023.07.040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 06/25/2023] [Accepted: 07/26/2023] [Indexed: 08/23/2023]
Abstract
Associative system has attracted increasing attention for it can store basic information and then infer details to match perception with an efficient self-organization algorithm. However, the implementation of the associative system with the application of real-world data is relatively difficult. To address this issue, we propose a novel biologically inspired auto-associative (BIAA) network to explore the structure, encoding and formation of associative memory as well as to extend the ability to real-world application. Our network is constructed by imitating the organization of the cortical minicolumns where each minicolumn contains plenty of parallel biological spiking neurons. To allow the network to learn and predict one symbol per theta cycle, we incorporate synaptic delay and theta oscillation into the neuron dynamic process. Subsequently, we design a sparse temporal population (STP) coding scheme that allows each input symbol to be represented as stable, unique, and easily recallable sparsely distributed representations. By combining associative learning dynamics with the STP coding, our network realizes efficient storage and inference in an ordered manner. Experimental results indicate that the proposed network successfully performs sequence retrieval from partial text and sequence recovery from distorted information. BIAA network provides new insight into introducing biologically inspired mechanisms into associative system and has enormous potential for hardware and software applications.
Collapse
Affiliation(s)
- Ya Zhang
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, PR China
| | - Kexin Shi
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, PR China
| | - Xiaoling Luo
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, PR China
| | - Yi Chen
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, PR China
| | - Yucheng Wang
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, PR China
| | - Hong Qu
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, PR China.
| |
Collapse
|
8
|
Li H, Luo B, Song W, Yang C. Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target. Neural Netw 2023; 165:677-688. [PMID: 37385022 DOI: 10.1016/j.neunet.2023.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/18/2023] [Accepted: 06/04/2023] [Indexed: 07/01/2023]
Abstract
Deep reinforcement learning (DRL) has been proven as a powerful approach for robot navigation over the past few years. DRL-based navigation does not require the pre-construction of a map, instead, high-performance navigation skills can be learned from trial-and-error experiences. However, recent DRL-based approaches mostly focus on a fixed navigation target. It is noted that when navigating to a moving target without maps, the performance of the standard RL structure drops dramatically on both the success rate and path efficiency. To address the mapless navigation problem with moving target, the predictive hierarchical DRL (pH-DRL) framework is proposed by integrating the long-term trajectory prediction to provide a cost-effective solution. In the proposed framework, the lower-level policy of the RL agent learns robot control actions to a specified goal, and the higher-level policy learns to make long-range planning of shorter navigation routes by sufficiently exploiting the predicted trajectories. By means of making decisions over two level of policies, the pH-DRL framework is robust to the unavoidable errors in long-term predictions. With the application of deep deterministic policy gradient (DDPG) for policy optimization, the pH-DDPG algorithm is developed based on the pH-DRL structure. Finally, through comparative experiments on the Gazebo simulator with several variants of the DDPG algorithm, the results demonstrate that the pH-DDPG outperforms other algorithms and achieves a high success rate and efficiency even though the target moves fast and randomly.
Collapse
Affiliation(s)
- Hanxiao Li
- School of Automation, Central South University, Changsha 410083, China.
| | - Biao Luo
- School of Automation, Central South University, Changsha 410083, China.
| | - Wei Song
- Research Center for Intelligent Robotics, Research Institute of Interdisciplinary Innovation, Zhejiang Laboratory, Hangzhou 311100, China.
| | - Chunhua Yang
- School of Automation, Central South University, Changsha 410083, China.
| |
Collapse
|
9
|
Fu R, Xu D, Li W, Shi P. Single-trial motor imagery electroencephalogram intention recognition by optimal discriminant hyperplane and interpretable discriminative rectangle mixture model. Cogn Neurodyn 2022; 16:1073-1085. [PMID: 36237407 PMCID: PMC9508315 DOI: 10.1007/s11571-021-09768-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 11/24/2021] [Accepted: 12/05/2021] [Indexed: 11/03/2022] Open
Abstract
Spatial filtering is widely used in brain-computer interface (BCI) systems to augmented signal characteristics of electroencephalogram (EEG) signals. In this study, a spatial domain filtering based EEG feature extraction method, optimal discriminant hyperplane-common spatial subspace decomposition (ODH-CSSD) is proposed. Specifically, the multi-dimensional EEG features were extracted from the original EEG signals by common space subspace decomposition (CSSD) algorithm, and the optimal feature criterion was established to find the multi-dimensional optimal projection space. A classic method of data dimension optimizing is using the eigenvectors of a lumped covariance matrix corresponding to the maximum eigenvalues. Then, the cost function is defined as the extreme value of the discriminant criterion, and the orthogonal N discriminant vectors corresponding to the N extreme value of the criterion are solved and constructed into the N-dimensional optimal feature space. Finally, the multi-dimensional EEG features are projected into the N-dimensional optimal projection space to obtain the optimal N-dimensional EEG features. Moreover, this study involves the extraction of two-dimensional and three-dimensional optimal EEG features from motor imagery EEG datasets, and the optimal EEG features are identified using the interpretable discriminative rectangular mixture model (DRMM). Experimental results show that the accuracy of DRMM to identify two-dimensional optimal features is more than 0.91, and the highest accuracy even reaches 0.975. Meanwhile, DRMM has the most stable recognition accuracy for two-dimensional optimal features, and its average clustering accuracy reaches 0.942, the gap between the accuracy of the DRMM with the accuracy of the FCM and K-means can reach 0.26. And the optimal three-dimensional features, for most subjects, the clustering accuracy of DRMM is higher than that of FCM and K-means. In general, the decision rectangle obtained by DRMM can clearly explain the difference of each cluster, notably, the optimization of multidimensional EEG features by optimal projection is superior to Fisher's ratio, and this method provides an alternative for the application of BCI. Supplementary Information The online version contains supplementary material available at 10.1007/s11571-021-09768-w.
Collapse
Affiliation(s)
- Rongrong Fu
- Measurement Technology and Instrumentation Key Lab of Hebei Province, Yanshan University, Qinhuangdao, 066004 China
| | - Dong Xu
- Measurement Technology and Instrumentation Key Lab of Hebei Province, Yanshan University, Qinhuangdao, 066004 China
| | - Weishuai Li
- Measurement Technology and Instrumentation Key Lab of Hebei Province, Yanshan University, Qinhuangdao, 066004 China
| | - Peiming Shi
- Measurement Technology and Instrumentation Key Lab of Hebei Province, Yanshan University, Qinhuangdao, 066004 China
| |
Collapse
|
10
|
Navigational strategy of a biped robot using regression-adaptive PSO approach. Soft comput 2022. [DOI: 10.1007/s00500-022-07084-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
11
|
Bing Z, Sewisy AE, Zhuang G, Walter F, Morin FO, Huang K, Knoll A. Toward Cognitive Navigation: Design and Implementation of a Biologically Inspired Head Direction Cell Network. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:2147-2158. [PMID: 34860654 DOI: 10.1109/tnnls.2021.3128380] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
As a vital cognitive function of animals, the navigation skill is first built on the accurate perception of the directional heading in the environment. Head direction cells (HDCs), found in the limbic system of animals, are proven to play an important role in identifying the directional heading allocentrically in the horizontal plane, independent of the animal's location and the ambient conditions of the environment. However, practical HDC models that can be implemented in robotic applications are rarely investigated, especially those that are biologically plausible and yet applicable to the real world. In this article, we propose a computational HDC network that is consistent with several neurophysiological findings concerning biological HDCs and then implement it in robotic navigation tasks. The HDC network keeps a representation of the directional heading only relying on the angular velocity as an input. We examine the proposed HDC model in extensive simulations and real-world experiments and demonstrate its excellent performance in terms of accuracy and real-time capability.
Collapse
|
12
|
Kudithipudi D, Aguilar-Simon M, Babb J, Bazhenov M, Blackiston D, Bongard J, Brna AP, Chakravarthi Raja S, Cheney N, Clune J, Daram A, Fusi S, Helfer P, Kay L, Ketz N, Kira Z, Kolouri S, Krichmar JL, Kriegman S, Levin M, Madireddy S, Manicka S, Marjaninejad A, McNaughton B, Miikkulainen R, Navratilova Z, Pandit T, Parker A, Pilly PK, Risi S, Sejnowski TJ, Soltoggio A, Soures N, Tolias AS, Urbina-Meléndez D, Valero-Cuevas FJ, van de Ven GM, Vogelstein JT, Wang F, Weiss R, Yanguas-Gil A, Zou X, Siegelmann H. Biological underpinnings for lifelong learning machines. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00452-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
13
|
Zeng T, Si B, Li X. Entorhinal-hippocampal interactions lead to globally coherent representations of space. CURRENT RESEARCH IN NEUROBIOLOGY 2022; 3:100035. [PMID: 36685760 PMCID: PMC9846457 DOI: 10.1016/j.crneur.2022.100035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 02/08/2022] [Accepted: 03/09/2022] [Indexed: 01/25/2023] Open
Abstract
The firing maps of grid cells in the entorhinal cortex are thought to provide an efficient metric system capable of supporting spatial inference in all environments. However, whether spatial representations of grid cells are determined by local environment cues or are organized into globally coherent patterns remains undetermined. We propose a navigation model containing a path integration system in the entorhinal cortex and a cognitive map system in the hippocampus. In the path integration system, grid cell network and head direction (HD) cell network integrate movement and visual information, and form attractor states to represent the positions and head directions of the animal. In the cognitive map system, a topological map is constructed capturing the attractor states of the path integration system as nodes and the transitions between attractor states as links. On loop closure, when the animal revisits a familiar place, the topological map is calibrated to minimize odometry errors. The change of the topological map is mapped back to the path integration system, to correct the states of the grid cells and the HD cells. The proposed model was tested on iRat, a rat-like miniature robot, in a realistic maze. Experimental results showed that, after familiarization of the environment, both grid cells and HD cells develop globally coherent firing maps by map calibration and activity correction. These results demonstrate that the hippocampus and the entorhinal cortex work together to form globally coherent metric representations of the environment. The underlying mechanisms of the hippocampal-entorhinal circuit in capturing the structure of the environment from sequences of experience are critical for understanding episodic memory.
Collapse
Affiliation(s)
- Taiping Zeng
- International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Tokyo 113-0033, Japan
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, China
| | - Bailu Si
- School of Systems Science, Beijing Normal University, Beijing, 100875, China
- Peng Cheng Laboratory, Shenzhen, 518055, China
| | - Xiaoli Li
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, 100875, China
| |
Collapse
|
14
|
Zeng T, Si B, Feng J. A theory of geometry representations for spatial navigation. Prog Neurobiol 2022; 211:102228. [DOI: 10.1016/j.pneurobio.2022.102228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 01/17/2022] [Accepted: 01/20/2022] [Indexed: 11/29/2022]
|
15
|
Shikauchi Y, Miyakoshi M, Makeig S, Iversen JR. Bayesian models of human navigation behaviour in an augmented reality audiomaze. Eur J Neurosci 2020; 54:8308-8317. [PMID: 33237612 PMCID: PMC9292259 DOI: 10.1111/ejn.15061] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 11/19/2020] [Accepted: 11/19/2020] [Indexed: 11/29/2022]
Abstract
We investigated Bayesian modelling of human whole‐body motion capture data recorded during an exploratory real‐space navigation task in an “Audiomaze” environment (see the companion paper by Miyakoshi et al. in the same volume) to study the effect of map learning on navigation behaviour. There were three models, a feedback‐only model (no map learning), a map resetting model (single‐trial limited map learning), and a map updating model (map learning accumulated across three trials). The estimated behavioural variables included step sizes and turning angles. Results showed that the estimated step sizes were constantly more accurate using the map learning models than the feedback‐only model. The same effect was confirmed for turning angle estimates, but only for data from the third trial. We interpreted these results as Bayesian evidence of human map learning on navigation behaviour. Furthermore, separating the participants into groups of egocentric and allocentric navigators revealed an advantage for the map updating model in estimating step sizes, but only for the allocentric navigators. This interaction indicated that the allocentric navigators may take more advantage of map learning than do egocentric navigators. We discuss relationships of these results to simultaneous localization and mapping (SLAM) problem.
Collapse
Affiliation(s)
- Yumi Shikauchi
- JSPS Research Fellow, Tokyo, Japan.,Rhythm-based Brain Information Processing Unit, CBS-TOYOTA Collaboration Center, RIKEN Center for Brain Science, Saitama, Japan
| | - Makoto Miyakoshi
- Swartz Center for Neural Computation, Institute for Neural Computation, University of California San Diego, San Diego, CA, USA
| | - Scott Makeig
- Swartz Center for Neural Computation, Institute for Neural Computation, University of California San Diego, San Diego, CA, USA
| | - John R Iversen
- Swartz Center for Neural Computation, Institute for Neural Computation, University of California San Diego, San Diego, CA, USA
| |
Collapse
|
16
|
Zeng T, Si B. A brain-inspired compact cognitive mapping system. Cogn Neurodyn 2020; 15:91-101. [PMID: 33786082 DOI: 10.1007/s11571-020-09621-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/07/2020] [Accepted: 07/20/2020] [Indexed: 11/25/2022] Open
Abstract
In many simultaneous localization and mapping (SLAM) systems, the map of the environment grows over time as the robot explores the environment. The ever-growing map prevents long-term mapping, especially in large-scale environments. In this paper, we develop a compact cognitive mapping approach inspired by neurobiological experiments. Mimicking the firing activities of neighborhood cells, neighborhood fields determined by movement information, i.e. translation and rotation, are modeled to describe one of the distinct segments of the explored environment. The vertices with low neighborhood field activities are avoided to be added into the cognitive map. The optimization of the cognitive map is formulated as a robust non-linear least squares problem constrained by the transitions between vertices, and is numerically solved efficiently. According to the cognitive decision-making of place familiarity, loop closure edges are clustered depending on time intervals, and then batch global optimization of the cognitive map is performed to satisfy the combined constraint of the whole cluster. After the loop closure process, scene integration is performed, in which revisited vertices are removed subsequently to further reduce the size of the cognitive map. The compact cognitive mapping approach is tested on a monocular visual SLAM system in a naturalistic maze for a biomimetic animated robot. Our results demonstrate that the proposed method largely restricts the growth of the size of the cognitive map over time, and meanwhile, the compact cognitive map correctly represents the overall layout of the environment. The compact cognitive mapping method is well suitable for the representation of large-scale environments to achieve long-term robot navigation.
Collapse
Affiliation(s)
- Taiping Zeng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai, China
| | - Bailu Si
- School of Systems Science, Beijing Normal University, Beijing, 100875 China
| |
Collapse
|
17
|
Zeng T, Si B. Video data for the cognitive mapping process of NeuroBayesSLAM system. Data Brief 2020; 30:105637. [PMID: 32420426 PMCID: PMC7215087 DOI: 10.1016/j.dib.2020.105637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2020] [Accepted: 04/22/2020] [Indexed: 11/09/2022] Open
Abstract
Simultaneous localization and mapping (SLAM), which addresses the problem of constructing a spatial map of an unknown environment while simultaneously determining the mobile robot's position relative to this map, is regarded as one of the key technologies in mobile robot navigation. This data article presents four raw video files, demonstrating the mapping and localization processes of NeuroBayesSLAM, a neurobiologically inspired SLAM system, on two publicly available datasets, namely the St Lucia suburb dataset and the iRat Australia dataset. The cognitive mapping process was recorded by a free screen recorder software on ubuntu Linux system. Neural activities of the head-direction cells and the grid cells, the local view templates of visual scenes, and experience map were included. These data envision the possibility of transferring the multisensory integration mechanism found in the spatial memory circuits of the mammalian brain to develop intelligent cognitive mapping systems for indoor and large outdoor environments as in the research article “NeuroBayesSLAM: Neurobiologically Inspired Bayesian Integration of Multisensory Information for Robot Navigation” Zeng et al., 2020.
Collapse
Affiliation(s)
- Taiping Zeng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.,Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, China
| | - Bailu Si
- School of Systems Science, Beijing Normal University, 100875, China
| |
Collapse
|