1
|
Lin N, Wang S, Li Y, Wang B, Shi S, He Y, Zhang W, Yu Y, Zhang Y, Zhang X, Wong K, Wang S, Chen X, Jiang H, Zhang X, Lin P, Xu X, Qi X, Wang Z, Shang D, Liu Q, Liu M. Resistive memory-based zero-shot liquid state machine for multimodal event data learning. NATURE COMPUTATIONAL SCIENCE 2025; 5:37-47. [PMID: 39789264 DOI: 10.1038/s43588-024-00751-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Accepted: 11/25/2024] [Indexed: 01/12/2025]
Abstract
The human brain is a complex spiking neural network (SNN) capable of learning multimodal signals in a zero-shot manner by generalizing existing knowledge. Remarkably, it maintains minimal power consumption through event-based signal propagation. However, replicating the human brain in neuromorphic hardware presents both hardware and software challenges. Hardware limitations, such as the slowdown of Moore's law and Von Neumann bottleneck, hinder the efficiency of digital computers. In addition, SNNs are characterized by their software training complexities. Here, to this end, we propose a hardware-software co-design on a 40 nm 256 kB in-memory computing macro that physically integrates a fixed and random liquid state machine SNN encoder with trainable artificial neural network projections. We showcase the zero-shot learning of multimodal events on the N-MNIST and N-TIDIGITS datasets, including visual and audio data association, as well as neural and visual data alignment for brain-machine interfaces. Our co-design achieves classification accuracy comparable to fully optimized software models, resulting in a 152.83- and 393.07-fold reduction in training costs compared with state-of-the-art spiking recurrent neural network-based contrastive learning and prototypical networks, and a 23.34- and 160-fold improvement in energy efficiency compared with cutting-edge digital hardware, respectively. These proof-of-principle prototypes demonstrate zero-shot multimodal events learning capability for emerging efficient and compact neuromorphic hardware.
Collapse
Affiliation(s)
- Ning Lin
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China
- The School of Microelectronics, Southern University of Science and Technology, Shenzhen, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Shaocong Wang
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Yi Li
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China
| | - Bo Wang
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Shuhui Shi
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Yangu He
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Woyu Zhang
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yifei Yu
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Yue Zhang
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Xinyuan Zhang
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Kwunhang Wong
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Songqi Wang
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China
| | - Xiaoming Chen
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Hao Jiang
- Frontier Institute of Chip and System, Fudan University, Shanghai, China
| | - Xumeng Zhang
- Frontier Institute of Chip and System, Fudan University, Shanghai, China
| | - Peng Lin
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China
| | - Xiaoxin Xu
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xiaojuan Qi
- Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
| | - Zhongrui Wang
- The School of Microelectronics, Southern University of Science and Technology, Shenzhen, China.
- ACCESS - AI Chip Center for Emerging Smart Systems, InnoHK Centers, Hong Kong Science Park, Hong Kong, China.
| | - Dashan Shang
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| | - Qi Liu
- Frontier Institute of Chip and System, Fudan University, Shanghai, China
| | - Ming Liu
- Key Lab of Fabrication Technologies for Integrated Circuits and Key Laboratory of Microelectronic Devices and Integrated Technology, Institute of Microelectronics of the Chinese Academy of Sciences, Beijing, China
- Frontier Institute of Chip and System, Fudan University, Shanghai, China
| |
Collapse
|
2
|
Ahsan R, Chae HU, Jalal SAA, Wu Z, Tao J, Das S, Liu H, Wu JB, Cronin SB, Wang H, Sideris C, Kapadia R. Ultralow Power In-Sensor Neuronal Computing with Oscillatory Retinal Neurons for Frequency-Multiplexed, Parallel Machine Vision. ACS NANO 2024; 18:23785-23796. [PMID: 39140995 DOI: 10.1021/acsnano.4c09055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2024]
Abstract
In-sensor and near-sensor computing architectures enable multiply accumulate operations to be carried out directly at the point of sensing. In-sensor architectures offer dramatic power and speed improvements over traditional von Neumann architectures by eliminating multiple analog-to-digital conversions, data storage, and data movement operations. Current in-sensor processing approaches rely on tunable sensors or additional weighting elements to perform linear functions such as multiply accumulate operations as the sensor acquires data. This work implements in-sensor computing with an oscillatory retinal neuron device that converts incident optical signals into voltage oscillations. A computing scheme is introduced based on the frequency shift of coupled oscillators that enables parallel, frequency multiplexed, nonlinear operations on the inputs. An experimentally implemented 3 × 3 focal plane array of coupled neurons shows that functions approximating edge detection, thresholding, and segmentation occur in parallel. An example of inference on handwritten digits from the MNIST database is also experimentally demonstrated with a 3 × 3 array of coupled neurons feeding into a single hidden layer neural network, approximating a liquid-state machine. Finally, the equivalent energy consumption to carry out image processing operations, including peripherals such as the Fourier transform circuits, is projected to be <20 fJ/OP, possibly reaching as low as 15 aJ/OP.
Collapse
Affiliation(s)
- Ragib Ahsan
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Hyun Uk Chae
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Seyedeh Atiyeh Abbasi Jalal
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Zezhi Wu
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Jun Tao
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Subrata Das
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Hefei Liu
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Jiang-Bin Wu
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Stephen B Cronin
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Han Wang
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Constantine Sideris
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Rehan Kapadia
- Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
3
|
Schoepe T, Janotte E, Milde MB, Bertrand OJN, Egelhaaf M, Chicca E. Finding the gap: neuromorphic motion-vision in dense environments. Nat Commun 2024; 15:817. [PMID: 38280859 PMCID: PMC10821932 DOI: 10.1038/s41467-024-45063-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 01/15/2024] [Indexed: 01/29/2024] Open
Abstract
Animals have evolved mechanisms to travel safely and efficiently within different habitats. On a journey in dense terrains animals avoid collisions and cross narrow passages while controlling an overall course. Multiple hypotheses target how animals solve challenges faced during such travel. Here we show that a single mechanism enables safe and efficient travel. We developed a robot inspired by insects. It has remarkable capabilities to travel in dense terrain, avoiding collisions, crossing gaps and selecting safe passages. These capabilities are accomplished by a neuromorphic network steering the robot toward regions of low apparent motion. Our system leverages knowledge about vision processing and obstacle avoidance in insects. Our results demonstrate how insects might safely travel through diverse habitats. We anticipate our system to be a working hypothesis to study insects' travels in dense terrains. Furthermore, it illustrates that we can design novel hardware systems by understanding the underlying mechanisms driving behaviour.
Collapse
Affiliation(s)
- Thorben Schoepe
- Peter Grünberg Institut 15, Forschungszentrum Jülich, Aachen, Germany.
- Faculty of Technology and Cognitive Interaction Technology Center of Excellence (CITEC), Bielefeld University, Bielefeld, Germany.
- Bio-Inspired Circuits and Systems (BICS) Lab. Zernike Institute for Advanced Materials (Zernike Inst Adv Mat), University of Groningen, Groningen, Netherlands.
- CogniGron (Groningen Cognitive Systems and Materials Center), University of Groningen, Groningen, Netherlands.
| | - Ella Janotte
- Event Driven Perception for Robotics, Italian Institute of Technology, iCub facility, Genoa, Italy
| | - Moritz B Milde
- International Centre for Neuromorphic Systems, MARCS Institute, Western Sydney University, Penrith, Australia
| | | | - Martin Egelhaaf
- Neurobiology, Faculty of Biology, Bielefeld University, Bielefeld, Germany
| | - Elisabetta Chicca
- Faculty of Technology and Cognitive Interaction Technology Center of Excellence (CITEC), Bielefeld University, Bielefeld, Germany
- Bio-Inspired Circuits and Systems (BICS) Lab. Zernike Institute for Advanced Materials (Zernike Inst Adv Mat), University of Groningen, Groningen, Netherlands
- CogniGron (Groningen Cognitive Systems and Materials Center), University of Groningen, Groningen, Netherlands
| |
Collapse
|
4
|
Donati E, Valle G. Neuromorphic hardware for somatosensory neuroprostheses. Nat Commun 2024; 15:556. [PMID: 38228580 PMCID: PMC10791662 DOI: 10.1038/s41467-024-44723-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 01/03/2024] [Indexed: 01/18/2024] Open
Abstract
In individuals with sensory-motor impairments, missing limb functions can be restored using neuroprosthetic devices that directly interface with the nervous system. However, restoring the natural tactile experience through electrical neural stimulation requires complex encoding strategies. Indeed, they are presently limited in effectively conveying or restoring tactile sensations by bandwidth constraints. Neuromorphic technology, which mimics the natural behavior of neurons and synapses, holds promise for replicating the encoding of natural touch, potentially informing neurostimulation design. In this perspective, we propose that incorporating neuromorphic technologies into neuroprostheses could be an effective approach for developing more natural human-machine interfaces, potentially leading to advancements in device performance, acceptability, and embeddability. We also highlight ongoing challenges and the required actions to facilitate the future integration of these advanced technologies.
Collapse
Affiliation(s)
- Elisa Donati
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, Zurich, Switzerland.
| | - Giacomo Valle
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
5
|
Aboumerhi K, Güemes A, Liu H, Tenore F, Etienne-Cummings R. Neuromorphic applications in medicine. J Neural Eng 2023; 20:041004. [PMID: 37531951 DOI: 10.1088/1741-2552/aceca3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 08/02/2023] [Indexed: 08/04/2023]
Abstract
In recent years, there has been a growing demand for miniaturization, low power consumption, quick treatments, and non-invasive clinical strategies in the healthcare industry. To meet these demands, healthcare professionals are seeking new technological paradigms that can improve diagnostic accuracy while ensuring patient compliance. Neuromorphic engineering, which uses neural models in hardware and software to replicate brain-like behaviors, can help usher in a new era of medicine by delivering low power, low latency, small footprint, and high bandwidth solutions. This paper provides an overview of recent neuromorphic advancements in medicine, including medical imaging and cancer diagnosis, processing of biosignals for diagnosis, and biomedical interfaces, such as motor, cognitive, and perception prostheses. For each section, we provide examples of how brain-inspired models can successfully compete with conventional artificial intelligence algorithms, demonstrating the potential of neuromorphic engineering to meet demands and improve patient outcomes. Lastly, we discuss current struggles in fitting neuromorphic hardware with non-neuromorphic technologies and propose potential solutions for future bottlenecks in hardware compatibility.
Collapse
Affiliation(s)
- Khaled Aboumerhi
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, United States of America
| | - Amparo Güemes
- Electrical Engineering Division, Department of Engineering, University of Cambridge, 9 JJ Thomson Ave, Cambridge CB3 0FA, United Kingdom
| | - Hongtao Liu
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, United States of America
| | - Francesco Tenore
- Research and Exploratory Development Department, The Johns Hopkins University Applied Physics Laboratory, Laurel, MD, United States of America
| | - Ralph Etienne-Cummings
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, United States of America
| |
Collapse
|
6
|
Gutierrez-Galan D, Rios-Navarro A, Dominguez-Morales JP, Duran-Lopez L, Jimenez-Moreno G, Jimenez-Fernandez A. Interfacing PDM MEMS Microphones with PFM Spiking Systems: Application for Neuromorphic Auditory Sensors. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-10936-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
AbstractNeuromorphic computation processes sensors output in the spiking domain, which presents constraints in many cases when converting information to spikes, loosing, as example, temporal accuracy. This paper presents a spike-based system to adapt audio information from low-power pulse-density modulation (PDM) microelectromechanical systems microphones into rate coded spike frequencies. These spikes could be directly used by the neuromorphic auditory sensor (NAS) for frequency decomposition in different bands, avoiding the analog or digital conversion to spike streams. This improves the time response of the NAS, allowing its use in more time restrictive applications. This adaptation was conducted in VHDL as an interface for PDM microphones, converting their pulses into temporal distributed spikes following a pulse-frequency modulation scheme with an accurate inter-spike-interval, known as PDM to spikes interface (PSI). We introduce a new architecture of spike-based band-pass filter to reject DC components and distribute spikes in time. This was tested in two scenarios, first as a stand-alone circuit for its characterization, and then integrated with a NAS for verification. The PSI achieves a total harmonic distortion of $$-$$
-
46.18 dB and a signal-to-noise ratio of 63.47 dB, demands less than 1% of the resources of a Spartan-6 FPGA and its power consumption is around 7 mW.
Collapse
|
7
|
Moro F, Hardy E, Fain B, Dalgaty T, Clémençon P, De Prà A, Esmanhotto E, Castellani N, Blard F, Gardien F, Mesquida T, Rummens F, Esseni D, Casas J, Indiveri G, Payvand M, Vianello E. Neuromorphic object localization using resistive memories and ultrasonic transducers. Nat Commun 2022; 13:3506. [PMID: 35717413 PMCID: PMC9206646 DOI: 10.1038/s41467-022-31157-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 06/03/2022] [Indexed: 11/25/2022] Open
Abstract
Real-world sensory-processing applications require compact, low-latency, and low-power computing systems. Enabled by their in-memory event-driven computing abilities, hybrid memristive-Complementary Metal-Oxide Semiconductor neuromorphic architectures provide an ideal hardware substrate for such tasks. To demonstrate the full potential of such systems, we propose and experimentally demonstrate an end-to-end sensory processing solution for a real-world object localization application. Drawing inspiration from the barn owl's neuroanatomy, we developed a bio-inspired, event-driven object localization system that couples state-of-the-art piezoelectric micromachined ultrasound transducer sensors to a neuromorphic resistive memories-based computational map. We present measurement results from the fabricated system comprising resistive memories-based coincidence detectors, delay line circuits, and a full-custom ultrasound sensor. We use these experimental results to calibrate our system-level simulations. These simulations are then used to estimate the angular resolution and energy efficiency of the object localization model. The results reveal the potential of our approach, evaluated in orders of magnitude greater energy efficiency than a microcontroller performing the same task.
Collapse
Affiliation(s)
- Filippo Moro
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France.
| | - Emmanuel Hardy
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France
| | - Bruno Fain
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France
| | - Thomas Dalgaty
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France
- CEA, LIST, Université Grenoble Alpes, 38054, Grenoble, France
| | - Paul Clémençon
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France
- Insect Biology Research Institute, Université de Tours, 37020, Tours, France
| | - Alessio De Prà
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France
- DPIA, Università degli Studi di Udine, 33100, Udine, Italy
| | | | | | - François Blard
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France
| | | | - Thomas Mesquida
- CEA, LIST, Université Grenoble Alpes, 38054, Grenoble, France
| | | | - David Esseni
- DPIA, Università degli Studi di Udine, 33100, Udine, Italy
| | - Jérôme Casas
- Insect Biology Research Institute, Université de Tours, 37020, Tours, France
| | - Giacomo Indiveri
- Institute for Neuroinformatics, University of Zürich and ETH Zürich, 8057, Zürich, Switzerland
| | - Melika Payvand
- Institute for Neuroinformatics, University of Zürich and ETH Zürich, 8057, Zürich, Switzerland
| | - Elisa Vianello
- CEA, LETI, Université Grenoble Alpes, 38054, Grenoble, France.
| |
Collapse
|
8
|
Gutierrez-Galan D, Schoepe T, Dominguez-Morales JP, Jimenez-Fernandez A, Chicca E, Linares-Barranco A. An Event-Based Digital Time Difference Encoder Model Implementation for Neuromorphic Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:1959-1973. [PMID: 34495850 DOI: 10.1109/tnnls.2021.3108047] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Neuromorphic systems are a viable alternative to conventional systems for real-time tasks with constrained resources. Their low power consumption, compact hardware realization, and low-latency response characteristics are the key ingredients of such systems. Furthermore, the event-based signal processing approach can be exploited for reducing the computational load and avoiding data loss due to its inherently sparse representation of sensed data and adaptive sampling time. In event-based systems, the information is commonly coded by the number of spikes within a specific temporal window. However, the temporal information of event-based signals can be difficult to extract when using rate coding. In this work, we present a novel digital implementation of the model, called time difference encoder (TDE), for temporal encoding on event-based signals, which translates the time difference between two consecutive input events into a burst of output events. The number of output events along with the time between them encodes the temporal information. The proposed model has been implemented as a digital circuit with a configurable time constant, allowing it to be used in a wide range of sensing tasks that require the encoding of the time difference between events, such as optical flow-based obstacle avoidance, sound source localization, and gas source localization. This proposed bioinspired model offers an alternative to the Jeffress model for the interaural time difference estimation, which is validated in this work with a sound source lateralization proof-of-concept system. The model was simulated and implemented on a field-programmable gate array (FPGA), requiring 122 slice registers of hardware resources and less than 1 mW of power consumption.
Collapse
|
9
|
Vogginger B, Kreutz F, López-Randulfe J, Liu C, Dietrich R, Gonzalez HA, Scholz D, Reeb N, Auge D, Hille J, Arsalan M, Mirus F, Grassmann C, Knoll A, Mayr C. Automotive Radar Processing With Spiking Neural Networks: Concepts and Challenges. Front Neurosci 2022; 16:851774. [PMID: 35431782 PMCID: PMC9012531 DOI: 10.3389/fnins.2022.851774] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/07/2022] [Indexed: 11/13/2022] Open
Abstract
Frequency-modulated continuous wave radar sensors play an essential role for assisted and autonomous driving as they are robust under all weather and light conditions. However, the rising number of transmitters and receivers for obtaining a higher angular resolution increases the cost for digital signal processing. One promising approach for energy-efficient signal processing is the usage of brain-inspired spiking neural networks (SNNs) implemented on neuromorphic hardware. In this article we perform a step-by-step analysis of automotive radar processing and argue how spiking neural networks could replace or complement the conventional processing. We provide SNN examples for two processing steps and evaluate their accuracy and computational efficiency. For radar target detection, an SNN with temporal coding is competitive to the conventional approach at a low compute overhead. Instead, our SNN for target classification achieves an accuracy close to a reference artificial neural network while requiring 200 times less operations. Finally, we discuss the specific requirements and challenges for SNN-based radar processing on neuromorphic hardware. This study proves the general applicability of SNNs for automotive radar processing and sustains the prospect of energy-efficient realizations in automated vehicles.
Collapse
Affiliation(s)
- Bernhard Vogginger
- Chair of Highly-Parallel VLSI-Systems and Neuro-Microelectronics, Faculty of Electrical and Computer Engineering, Institute of Principles of Electrical and Electronic Engineering, Technische Universität Dresden, Dresden, Germany
- *Correspondence: Bernhard Vogginger
| | - Felix Kreutz
- Chair of Highly-Parallel VLSI-Systems and Neuro-Microelectronics, Faculty of Electrical and Computer Engineering, Institute of Principles of Electrical and Electronic Engineering, Technische Universität Dresden, Dresden, Germany
- Infineon Technologies Dresden GmbH & Co., KG, Dresden, Germany
| | | | - Chen Liu
- Chair of Highly-Parallel VLSI-Systems and Neuro-Microelectronics, Faculty of Electrical and Computer Engineering, Institute of Principles of Electrical and Electronic Engineering, Technische Universität Dresden, Dresden, Germany
| | - Robin Dietrich
- Department of Informatics, Technical University of Munich, Munich, Germany
| | - Hector A. Gonzalez
- Chair of Highly-Parallel VLSI-Systems and Neuro-Microelectronics, Faculty of Electrical and Computer Engineering, Institute of Principles of Electrical and Electronic Engineering, Technische Universität Dresden, Dresden, Germany
| | - Daniel Scholz
- Chair of Highly-Parallel VLSI-Systems and Neuro-Microelectronics, Faculty of Electrical and Computer Engineering, Institute of Principles of Electrical and Electronic Engineering, Technische Universität Dresden, Dresden, Germany
- Infineon Technologies Dresden GmbH & Co., KG, Dresden, Germany
| | - Nico Reeb
- Department of Informatics, Technical University of Munich, Munich, Germany
| | - Daniel Auge
- Department of Informatics, Technical University of Munich, Munich, Germany
- Infineon Technologies AG, Munich, Germany
| | - Julian Hille
- Department of Informatics, Technical University of Munich, Munich, Germany
- Infineon Technologies AG, Munich, Germany
| | | | - Florian Mirus
- BMW Group, Research, New Technologies, Garching, Germany
| | | | - Alois Knoll
- Department of Informatics, Technical University of Munich, Munich, Germany
| | - Christian Mayr
- Chair of Highly-Parallel VLSI-Systems and Neuro-Microelectronics, Faculty of Electrical and Computer Engineering, Institute of Principles of Electrical and Electronic Engineering, Technische Universität Dresden, Dresden, Germany
- Centre for Tactile Internet (CeTI) With Human-In-The-Loop, Cluster of Excellence, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
10
|
Abstract
The design of robots that interact autonomously with the environment and exhibit complex behaviours is an open challenge that can benefit from understanding what makes living beings fit to act in the world. Neuromorphic engineering studies neural computational principles to develop technologies that can provide a computing substrate for building compact and low-power processing systems. We discuss why endowing robots with neuromorphic technologies - from perception to motor control - represents a promising approach for the creation of robots which can seamlessly integrate in society. We present initial attempts in this direction, highlight open challenges, and propose actions required to overcome current limitations.
Collapse
Affiliation(s)
- Chiara Bartolozzi
- Event-Driven Perception for Robotics, Istituto Italiano di Tecnologia, via San Quirico 19D, 16163, Genova, Italy.
| | - Giacomo Indiveri
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland
| | - Elisa Donati
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland
| |
Collapse
|
11
|
Deng B, Fan Y, Wang J, Yang S. Reconstruction of a Fully Paralleled Auditory Spiking Neural Network and FPGA Implementation. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2021; 15:1320-1331. [PMID: 34699367 DOI: 10.1109/tbcas.2021.3122549] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
This paper presents a field-programmable gate array (FPGA) implementation of an auditory system, which is biologically inspired and has the advantages of robustness and anti-noise ability. We propose an FPGA implementation of an eleven-channel hierarchical spiking neuron network (SNN) model, which has a sparsely connected architecture with low power consumption. According to the mechanism of the auditory pathway in human brain, spiking trains generated by the cochlea are analyzed in the hierarchical SNN, and the specific word can be identified by a Bayesian classifier. Modified leaky integrate-and-fire (LIF) model is used to realize the hierarchical SNN, which achieves both high efficiency and low hardware consumption. The hierarchical SNN implemented on FPGA enables the auditory system to be operated at high speed and can be interfaced and applied with external machines and sensors. A set of speech from different speakers mixed with noise are used as input to test the performance our system, and the experimental results show that the system can classify words in a biologically plausible way with the presence of noise. The method of our system is flexible and the system can be modified into desirable scale. These confirm that the proposed biologically plausible auditory system provides a better method for on-chip speech recognition. Compare to the state-of-the-art, our auditory system achieves a higher speed with a maximum frequency of 65.03 MHz and a lower energy consumption of 276.83 μJ for a single operation. It can be applied in the field of brain-computer interface and intelligent robots.
Collapse
|
12
|
Real-time detection of bursts in neuronal cultures using a neuromorphic auditory sensor and spiking neural networks. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.03.109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
13
|
Dominguez-Morales JP, Gutierrez-Galan D, Rios-Navarro A, Duran-Lopez L, Dominguez-Morales M, Jimenez-Fernandez A. pyNAVIS: An open-source cross-platform software for spike-based neuromorphic audio information processing. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.03.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
14
|
Bhardwaj A, Khyam MO, Müller R. Biomimetic detection of dynamic signatures in foliage echoes. BIOINSPIRATION & BIOMIMETICS 2021; 16:046026. [PMID: 33862609 DOI: 10.1088/1748-3190/abf910] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 04/16/2021] [Indexed: 06/12/2023]
Abstract
Certain bat species (familyRhinolophidae) dynamically deform their emission baffles (noseleaves) and reception baffles (pinnae) during echolocation. Prior research using numerical models, laboratory characterizations, and experiments with simple targets have suggested that this dynamics may manifest itself in time-variant echo signatures. Since the pronounced random nature of echoes from natural targets such as foliage has not been reflected in these experiments, we have collected a large number (>55 000) of foliage echoes outdoors with a sonar head that mimics the dynamic periphery in bats. The echo data was processed with a custom auditory processing model to create spike-based echo representations. Deep-learning classifiers were able to estimate the dynamic state of the periphery, i.e., static or dynamic, based on single echoes with accuracies of up to 80%. This suggests that the effects of the peripheral dynamics are present in the bat brains and could hence be used by the animals. The best classification performances were obtained for data obtained within a spatially confined area. Hence, if the bat brains suffer from the same generalization issues, they would have to have a way to adapt their neural echo processing to such local fluctuations to exploit the dynamic effects successfully.
Collapse
Affiliation(s)
- Ananya Bhardwaj
- Department of Mechanical Engineering, Virginia Tech, Blacksburg 24060, United States of America
| | - M Omar Khyam
- Department of Mechanical Engineering, Virginia Tech, Blacksburg 24060, United States of America
| | - Rolf Müller
- Department of Mechanical Engineering, Virginia Tech, Blacksburg 24060, United States of America
| |
Collapse
|
15
|
López-Randulfe J, Duswald T, Bing Z, Knoll A. Spiking Neural Network for Fourier Transform and Object Detection for Automotive Radar. Front Neurorobot 2021; 15:688344. [PMID: 34163347 PMCID: PMC8216499 DOI: 10.3389/fnbot.2021.688344] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 05/06/2021] [Indexed: 11/13/2022] Open
Abstract
The development of advanced autonomous driving applications is hindered by the complex temporal structure of sensory data, as well as by the limited computational and energy resources of their on-board systems. Currently, neuromorphic engineering is a rapidly growing field that aims to design information processing systems similar to the human brain by leveraging novel algorithms based on spiking neural networks (SNNs). These systems are well-suited to recognize temporal patterns in data while maintaining a low energy consumption and offering highly parallel architectures for fast computation. However, the lack of effective algorithms for SNNs impedes their wide usage in mobile robot applications. This paper addresses the problem of radar signal processing by introducing a novel SNN that substitutes the discrete Fourier transform and constant false-alarm rate algorithm for raw radar data, where the weights and architecture of the SNN are derived from the original algorithms. We demonstrate that our proposed SNN can achieve competitive results compared to that of the original algorithms in simulated driving scenarios while retaining its spike-based nature.
Collapse
Affiliation(s)
| | - Tobias Duswald
- Department of Informatics, Technical University of Munich, Munich, Germany
| | - Zhenshan Bing
- Department of Informatics, Technical University of Munich, Munich, Germany
| | - Alois Knoll
- Department of Informatics, Technical University of Munich, Munich, Germany
| |
Collapse
|
16
|
Tayarani-Najaran MH, Schmuker M. Event-Based Sensing and Signal Processing in the Visual, Auditory, and Olfactory Domain: A Review. Front Neural Circuits 2021; 15:610446. [PMID: 34135736 PMCID: PMC8203204 DOI: 10.3389/fncir.2021.610446] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 04/27/2021] [Indexed: 11/13/2022] Open
Abstract
The nervous systems converts the physical quantities sensed by its primary receptors into trains of events that are then processed in the brain. The unmatched efficiency in information processing has long inspired engineers to seek brain-like approaches to sensing and signal processing. The key principle pursued in neuromorphic sensing is to shed the traditional approach of periodic sampling in favor of an event-driven scheme that mimicks sampling as it occurs in the nervous system, where events are preferably emitted upon the change of the sensed stimulus. In this paper we highlight the advantages and challenges of event-based sensing and signal processing in the visual, auditory and olfactory domains. We also provide a survey of the literature covering neuromorphic sensing and signal processing in all three modalities. Our aim is to facilitate research in event-based sensing and signal processing by providing a comprehensive overview of the research performed previously as well as highlighting conceptual advantages, current progress and future challenges in the field.
Collapse
Affiliation(s)
| | - Michael Schmuker
- School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield, United Kingdom
| |
Collapse
|
17
|
Gutierrez-Galan D, Dominguez-Morales J, Jimenez-Fernandez A, Linares-Barranco A, Jimenez-Moreno G. OpenNAS: Open Source Neuromorphic Auditory Sensor HDL code generator for FPGA implementations. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.12.062] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
18
|
Lele A, Fang Y, Ting J, Raychowdhury A. An End-to-end Spiking Neural Network Platform for Edge Robotics: From Event-Cameras to Central Pattern Generation. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2021.3097675] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
19
|
Linares-Barranco A, Perez-Peña F, Jimenez-Fernandez A, Chicca E. ED-BioRob: A Neuromorphic Robotic Arm With FPGA-Based Infrastructure for Bio-Inspired Spiking Motor Controllers. Front Neurorobot 2020; 14:590163. [PMID: 33328951 PMCID: PMC7735321 DOI: 10.3389/fnbot.2020.590163] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 10/27/2020] [Indexed: 11/13/2022] Open
Abstract
Compared to classic robotics, biological nervous systems respond to stimuli in a fast and efficient way regarding the body motor actions. Decision making, once the sensory information arrives to the brain, is in the order of ms, while the whole process from sensing to movement requires tens of ms. Classic robotic systems usually require complex computational abilities. Key differences between biological systems and robotic machines lie in the way information is coded and transmitted. A neuron is the "basic" element that constitutes biological nervous systems. Neurons communicate in an event-driven way through small currents or ionic pulses (spikes). When neurons are arranged in networks, they allow not only for the processing of sensory information, but also for the actuation over the muscles in the same spiking manner. This paper presents the application of a classic motor control model (proportional-integral-derivative) developed with the biological spike processing principle, including the motor actuation with time enlarged spikes instead of the classic pulse-width-modulation. This closed-loop control model, called spike-based PID controller (sPID), was improved and adapted for a dual FPGA-based system to control the four joints of a bioinspired light robot (BioRob X5), called event-driven BioRob (ED-BioRob). The use of spiking signals allowed the system to achieve a current consumption bellow 1A for the entire 4 DoF working at the same time. Furthermore, the robot joints commands can be received from a population of silicon-neurons running on the Dynap-SE platform. Thus, our proposal aims to bridge the gap between a general purpose processing analog neuromorphic hardware and the spiking actuation of a robotic platform.
Collapse
Affiliation(s)
- Alejandro Linares-Barranco
- Robotics and Technology of Computers Lab (ETSII-EPS), Universidad de Sevilla, Sevilla, Spain.,Smart Computer Systems Researh and Engineering Lab (SCORE), Research Institute of Computer Engineering (I3US), Universidad de Sevilla, Sevilla, Spain
| | | | - Angel Jimenez-Fernandez
- Robotics and Technology of Computers Lab (ETSII-EPS), Universidad de Sevilla, Sevilla, Spain.,Smart Computer Systems Researh and Engineering Lab (SCORE), Research Institute of Computer Engineering (I3US), Universidad de Sevilla, Sevilla, Spain
| | - Elisabetta Chicca
- Faculty of Technology and Cognitive Interaction Technology Center of Excellence (CITEC) - Bielefeld University, Bielefeld, Germany.,Bio-Inspired Circuits and Systems Lab (BICS), Zernike Institute for Advanced Materials, University of Groningen, Groningen, Netherlands.,Groningen Cognitive Systems and Materials Center (CogniGron), University of Groningen, Groningen, Netherlands
| |
Collapse
|
20
|
Tapiador-Morales R, Maro JM, Jimenez-Fernandez A, Jimenez-Moreno G, Benosman R, Linares-Barranco A. Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA. SENSORS (BASEL, SWITZERLAND) 2020; 20:E3404. [PMID: 32560238 PMCID: PMC7349403 DOI: 10.3390/s20123404] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 06/06/2020] [Accepted: 06/12/2020] [Indexed: 11/16/2022]
Abstract
Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 μ s to 6.7 μ s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.
Collapse
Affiliation(s)
- Ricardo Tapiador-Morales
- Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, Spain; (A.J.-F.); (G.J.-M.); (A.L.-B.)
- aiCTX AG, 8092 Zurich, Switzerland
| | - Jean-Matthieu Maro
- Neuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, France; (J.-M.M.); (R.B.)
| | - Angel Jimenez-Fernandez
- Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, Spain; (A.J.-F.); (G.J.-M.); (A.L.-B.)
- SCORE Lab, Research Institute of Computer Engineering (I3US), University of Seville, 41089 Seville, Spain
| | - Gabriel Jimenez-Moreno
- Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, Spain; (A.J.-F.); (G.J.-M.); (A.L.-B.)
- SCORE Lab, Research Institute of Computer Engineering (I3US), University of Seville, 41089 Seville, Spain
| | - Ryad Benosman
- Neuromorphic Vision and Natural Computation, Sorbonne Université, 75006 Paris, France; (J.-M.M.); (R.B.)
| | - Alejandro Linares-Barranco
- Robotics and Technology of Computers Lab (ETSII-EPS), University of Seville, 41089 Sevilla, Spain; (A.J.-F.); (G.J.-M.); (A.L.-B.)
- SCORE Lab, Research Institute of Computer Engineering (I3US), University of Seville, 41089 Seville, Spain
| |
Collapse
|
21
|
Li C, Peng T, Zhu Y. A Novel Approach for Acoustic Signal Processing of a Drum Shearer Based on Improved Variational Mode Decomposition and Cluster Analysis. SENSORS 2020; 20:s20102949. [PMID: 32456034 PMCID: PMC7288331 DOI: 10.3390/s20102949] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 05/02/2020] [Accepted: 05/11/2020] [Indexed: 11/30/2022]
Abstract
During operation, the acoustic signal of the drum shearer contains a wealth of information. The monitoring or diagnosis system based on acoustic signal has obvious advantages. However, the signal is challenging to extract and recognize. Therefore, this paper proposes an approach for acoustic signal processing of a shearer based on the parameter optimized variational mode decomposition (VMD) method and a clustering algorithm. First, the particle swarm optimization (PSO) algorithm searched for the best parameter combination of the VMD. According to the results, the approach determined the number of modes and penalty parameters for VMD. Then the improved VMD algorithm decomposed the acoustic signal. It selected the ideal component through the minimum envelope entropy. The PSO was designed to optimize the clustering analysis, and the minimum envelope entropy of the acoustic signal was regarded as the feature for classification. We then use a shearer simulation platform to collect the acoustic signal and use the approach proposed in this paper to process and classify the signal. The experimental results show that the approach proposed can effectively extract the features of the acoustic signal of the shearer. The recognition accuracy of the acoustic signal was high, which has practical application value.
Collapse
|
22
|
Gutierrez-Galan D, Dominguez-Morales JP, Perez-Peña F, Jimenez-Fernandez A, Linares-Barranco A. Neuropod: A real-time neuromorphic spiking CPG applied to robotics. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.11.007] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
23
|
Stereo Matching in Address-Event-Representation (AER) Bio-Inspired Binocular Systems in a Field-Programmable Gate Array (FPGA). ELECTRONICS 2019. [DOI: 10.3390/electronics8040410] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In stereo-vision processing, the image-matching step is essential for results, although it involves a very high computational cost. Moreover, the more information is processed, the more time is spent by the matching algorithm, and the more inefficient it is. Spike-based processing is a relatively new approach that implements processing methods by manipulating spikes one by one at the time they are transmitted, like a human brain. The mammal nervous system can solve much more complex problems, such as visual recognition by manipulating neuron spikes. The spike-based philosophy for visual information processing based on the neuro-inspired address-event-representation (AER) is currently achieving very high performance. The aim of this work was to study the viability of a matching mechanism in stereo-vision systems, using AER codification and its implementation in a field-programmable gate array (FPGA). Some studies have been done before in an AER system with monitored data using a computer; however, this kind of mechanism has not been implemented directly on hardware. To this end, an epipolar geometry basis applied to AER systems was studied and implemented, with other restrictions, in order to achieve good results in a real-time scenario. The results and conclusions are shown, and the viability of its implementation is proven.
Collapse
|
24
|
Tapiador-Morales R, Linares-Barranco A, Jimenez-Fernandez A, Jimenez-Moreno G. Neuromorphic LIF Row-by-Row Multiconvolution Processor for FPGA. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2019; 13:159-169. [PMID: 30418884 DOI: 10.1109/tbcas.2018.2880012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Deep Learning algorithms have become state-of-the-art methods for multiple fields, including computer vision, speech recognition, natural language processing, and audio recognition, among others. In image vision, convolutional neural networks (CNN) stand out. This kind of network is expensive in terms of computational resources due to the large number of operations required to process a frame. In recent years, several frame-based chip solutions to deploy CNN for real time have been developed. Despite the good results in power and accuracy given by these solutions, the number of operations is still high, due the complexity of the current network models. However, it is possible to reduce the number of operations using different computer vision techniques other than frame-based, e.g., neuromorphic event-based techniques. There exist several neuromorphic vision sensors whose pixels detect changes in luminosity. Inspired in the leaky integrate-and-fire (LIF) neuron, we propose in this manuscript an event-based field-programmable gate array (FPGA) multiconvolution system. Its main novelty is the combination of a memory arbiter for efficient memory access to allow row-by-row kernel processing. This system is able to convolve 64 filters across multiple kernel sizes, from 1 × 1 to 7 × 7, with latencies of 1.3 μs and 9.01 μs, respectively, generating a continuous flow of output events. The proposed architecture will easily fit spike-based CNNs.
Collapse
|
25
|
Venieris SI, Bouganis CS. fpgaConvNet: Mapping Regular and Irregular Convolutional Neural Networks on FPGAs. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:326-342. [PMID: 29994725 DOI: 10.1109/tnnls.2018.2844093] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Since neural networks renaissance, convolutional neural networks (ConvNets) have demonstrated a state-of-the-art performance in several emerging artificial intelligence tasks. The deployment of ConvNets in real-life applications requires power-efficient designs that meet the application-level performance needs. In this context, field-programmable gate arrays (FPGAs) can provide a potential platform that can be tailored to application-specific requirements. However, with the complexity of ConvNet models increasing rapidly, the ConvNet-to-FPGA design space becomes prohibitively large. This paper presents fpgaConvNet, an end-to-end framework for the optimized mapping of ConvNets on FPGAs. The proposed framework comprises an automated design methodology based on the synchronous dataflow (SDF) paradigm and defines a set of SDF transformations in order to efficiently navigate the architectural design space. By proposing a systematic multiobjective optimization formulation, the presented framework is able to generate hardware designs that are cooptimized for the ConvNet workload, the target device, and the application's performance metric of interest. Quantitative evaluation shows that the proposed methodology yields hardware designs that improve the performance by up to 6.65× over highly optimized graphics processing unit designs for the same power constraints and achieve up to 2.94× higher performance density compared with the state-of-the-art FPGA-based ConvNet architectures.
Collapse
|
26
|
Abstract
The medical field is creating large amount of data that physicians are unable to decipher and use efficiently. Moreover, rule-based expert systems are inefficient in solving complicated medical tasks or for creating insights using big data. Deep learning has emerged as a more accurate and effective technology in a wide range of medical problems such as diagnosis, prediction, and intervention. Deep learning is a representation learning method that consists of layers that transform data nonlinearly, thus, revealing hierarchical relationships and structures. In this review, we survey deep learning application papers that use structured data, and signal and imaging modalities from cardiology. We discuss the advantages and limitations of applying deep learning in cardiology that also apply in medicine in general, while proposing certain directions as the most viable for clinical use.
Collapse
|
27
|
Harczos T, Klefenz FM. Modeling Pitch Perception With an Active Auditory Model Extended by Octopus Cells. Front Neurosci 2018; 12:660. [PMID: 30319340 PMCID: PMC6167605 DOI: 10.3389/fnins.2018.00660] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 09/04/2018] [Indexed: 11/13/2022] Open
Abstract
Pitch is an essential category for musical sensations. Models of pitch perception are vividly discussed up to date. Most of them rely on definitions of mathematical methods in the spectral or temporal domain. Our proposed pitch perception model is composed of an active auditory model extended by octopus cells. The active auditory model is the same as used in the Stimulation based on Auditory Modeling (SAM), a successful cochlear implant sound processing strategy extended here by modeling the functional behavior of the octopus cells in the ventral cochlear nucleus and by modeling their connections to the auditory nerve fibers (ANFs). The neurophysiological parameterization of the extended model is fully described in the time domain. The model is based on latency-phase en- and decoding as octopus cells are latency-phase rectifiers in their local receptive fields. Pitch is ubiquitously represented by cascaded firing sweeps of octopus cells. Based on the firing patterns of octopus cells, inter-spike interval histograms can be aggregated, in which the place of the global maximum is assumed to encode the pitch.
Collapse
Affiliation(s)
- Tamas Harczos
- Fraunhofer Institute for Digital Media Technology, Ilmenau, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Goettingen, Germany
- Institut für Mikroelektronik- und Mechatronik-Systeme gGmbH, Ilmenau, Germany
| | | |
Collapse
|
28
|
Sherfey JS, Ardid S, Hass J, Hasselmo ME, Kopell NJ. Flexible resonance in prefrontal networks with strong feedback inhibition. PLoS Comput Biol 2018; 14:e1006357. [PMID: 30091975 PMCID: PMC6103521 DOI: 10.1371/journal.pcbi.1006357] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Revised: 08/21/2018] [Accepted: 07/11/2018] [Indexed: 11/24/2022] Open
Abstract
Oscillations are ubiquitous features of brain dynamics that undergo task-related changes in synchrony, power, and frequency. The impact of those changes on target networks is poorly understood. In this work, we used a biophysically detailed model of prefrontal cortex (PFC) to explore the effects of varying the spike rate, synchrony, and waveform of strong oscillatory inputs on the behavior of cortical networks driven by them. Interacting populations of excitatory and inhibitory neurons with strong feedback inhibition are inhibition-based network oscillators that exhibit resonance (i.e., larger responses to preferred input frequencies). We quantified network responses in terms of mean firing rates and the population frequency of network oscillation; and characterized their behavior in terms of the natural response to asynchronous input and the resonant response to oscillatory inputs. We show that strong feedback inhibition causes the PFC to generate internal (natural) oscillations in the beta/gamma frequency range (>15 Hz) and to maximize principal cell spiking in response to external oscillations at slightly higher frequencies. Importantly, we found that the fastest oscillation frequency that can be relayed by the network maximizes local inhibition and is equal to a frequency even higher than that which maximizes the firing rate of excitatory cells; we call this phenomenon population frequency resonance. This form of resonance is shown to determine the optimal driving frequency for suppressing responses to asynchronous activity. Lastly, we demonstrate that the natural and resonant frequencies can be tuned by changes in neuronal excitability, the duration of feedback inhibition, and dynamic properties of the input. Our results predict that PFC networks are tuned for generating and selectively responding to beta- and gamma-rhythmic signals due to the natural and resonant properties of inhibition-based oscillators. They also suggest strategies for optimizing transcranial stimulation and using oscillatory networks in neuromorphic engineering.
Collapse
Affiliation(s)
- Jason S. Sherfey
- Department of Mathematics and Statistics, Boston University, Boston, Massachusetts, United States of America
- Department of Psychological and Brain Sciences, Center for Systems Neuroscience, Boston University, Massachusetts, United States of America
| | - Salva Ardid
- Department of Mathematics and Statistics, Boston University, Boston, Massachusetts, United States of America
| | - Joachim Hass
- Department of Theoretical Neuroscience, Bernstein Center for Computational Neuroscience, Central Institute of Mental Health, Heidelberg University, Mannheim, Germany
- Faculty of Applied Psychology, SRH University for Applied Sciences Heidelberg, Heidelberg, Germany
| | - Michael E. Hasselmo
- Department of Psychological and Brain Sciences, Center for Systems Neuroscience, Boston University, Massachusetts, United States of America
| | - Nancy J. Kopell
- Department of Mathematics and Statistics, Boston University, Boston, Massachusetts, United States of America
| |
Collapse
|
29
|
Xu Y, Thakur CS, Singh RK, Hamilton TJ, Wang RM, van Schaik A. A FPGA Implementation of the CAR-FAC Cochlear Model. Front Neurosci 2018; 12:198. [PMID: 29692700 PMCID: PMC5902704 DOI: 10.3389/fnins.2018.00198] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 03/12/2018] [Indexed: 11/19/2022] Open
Abstract
This paper presents a digital implementation of the Cascade of Asymmetric Resonators with Fast-Acting Compression (CAR-FAC) cochlear model. The CAR part simulates the basilar membrane's (BM) response to sound. The FAC part models the outer hair cell (OHC), the inner hair cell (IHC), and the medial olivocochlear efferent system functions. The FAC feeds back to the CAR by moving the poles and zeros of the CAR resonators automatically. We have implemented a 70-section, 44.1 kHz sampling rate CAR-FAC system on an Altera Cyclone V Field Programmable Gate Array (FPGA) with 18% ALM utilization by using time-multiplexing and pipeline parallelizing techniques and present measurement results here. The fully digital reconfigurable CAR-FAC system is stable, scalable, easy to use, and provides an excellent input stage to more complex machine hearing tasks such as sound localization, sound segregation, speech recognition, and so on.
Collapse
Affiliation(s)
| | | | | | | | | | - André van Schaik
- MARCS Institute, Western Sydney University, Sydney, NSW, Australia
| |
Collapse
|
30
|
Dominguez-Morales JP, Jimenez-Fernandez AF, Dominguez-Morales MJ, Jimenez-Moreno G. Deep Neural Networks for the Recognition and Classification of Heart Murmurs Using Neuromorphic Auditory Sensors. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2018; 12:24-34. [PMID: 28952948 DOI: 10.1109/tbcas.2017.2751545] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Auscultation is one of the most used techniques for detecting cardiovascular diseases, which is one of the main causes of death in the world. Heart murmurs are the most common abnormal finding when a patient visits the physician for auscultation. These heart sounds can either be innocent, which are harmless, or abnormal, which may be a sign of a more serious heart condition. However, the accuracy rate of primary care physicians and expert cardiologists when auscultating is not good enough to avoid most of both type-I (healthy patients are sent for echocardiogram) and type-II (pathological patients are sent home without medication or treatment) errors made. In this paper, the authors present a novel convolutional neural network based tool for classifying between healthy people and pathological patients using a neuromorphic auditory sensor for FPGA that is able to decompose the audio into frequency bands in real time. For this purpose, different networks have been trained with the heart murmur information contained in heart sound recordings obtained from nine different heart sound databases sourced from multiple research groups. These samples are segmented and preprocessed using the neuromorphic auditory sensor to decompose their audio information into frequency bands and, after that, sonogram images with the same size are generated. These images have been used to train and test different convolutional neural network architectures. The best results have been obtained with a modified version of the AlexNet model, achieving 97% accuracy (specificity: 95.12%, sensitivity: 93.20%, PhysioNet/CinC Challenge 2016 score: 0.9416). This tool could aid cardiologists and primary care physicians in the auscultation process, improving the decision making task and reducing type-I and type-II errors.
Collapse
|