1
|
Kuc R. Brain-inspired sensorimotor echolocation system for confident landmark recognition. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:1272. [PMID: 36182295 DOI: 10.1121/10.0013833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Accepted: 08/11/2022] [Indexed: 06/16/2023]
Abstract
A landmark is a familiar target in terms of the echoes that it can produce and is important for echolocation-based navigation by bats, robots, and blind humans. A brain-inspired system (BIS) achieves confident recognition, defined as classification to an arbitrarily small error probability (PE), by employing a voting process with an echo sequence. The BIS contains sensory neurons implemented with binary single-layer perceptrons trained to classify echo spectrograms with PE and generate excitatory and inhibitory votes in face neurons until a landmark-specific face neuron achieves recognition by reaching a confidence vote level (CVL). A discrete random step process models the vote count to show the recognition probability can achieve any desired accuracy by decreasing PE or increasing CVL. A hierarchical approach first classifies surface reflector and volume scatterer target categories and then uses that result to classify two subcategories that form four landmarks. The BIS models blind human echolocation to recognize four human-made and foliage landmarks by acquiring suitably sized and dense audible echo sequences. The sensorimotor BIS employs landmark-specific CVL values and a 2.7° view increment to acquire echo sequences that achieve zero-error recognition of each landmark independent of the initial view.
Collapse
Affiliation(s)
- Roman Kuc
- Department of Electrical Engineering and Wu Tsai Institute, Yale University, New Haven, Connecticut 06511, USA
| |
Collapse
|
2
|
Ming C, Haro S, Simmons AM, Simmons JA. A comprehensive computational model of animal biosonar signal processing. PLoS Comput Biol 2021; 17:e1008677. [PMID: 33596199 PMCID: PMC7888678 DOI: 10.1371/journal.pcbi.1008677] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 01/07/2021] [Indexed: 11/23/2022] Open
Abstract
Computational models of animal biosonar seek to identify critical aspects of echo processing responsible for the superior, real-time performance of echolocating bats and dolphins in target tracking and clutter rejection. The Spectrogram Correlation and Transformation (SCAT) model replicates aspects of biosonar imaging in both species by processing wideband biosonar sounds and echoes with auditory mechanisms identified from experiments with bats. The model acquires broadband biosonar broadcasts and echoes, represents them as time-frequency spectrograms using parallel bandpass filters, translates the filtered signals into ten parallel amplitude threshold levels, and then operates on the resulting time-of-occurrence values at each frequency to estimate overall echo range delay. It uses the structure of the echo spectrum by depicting it as a series of local frequency nulls arranged regularly along the frequency axis of the spectrograms after dechirping them relative to the broadcast. Computations take place entirely on the timing of threshold-crossing events for each echo relative to threshold-events for the broadcast. Threshold-crossing times take into account amplitude-latency trading, a physiological feature absent from conventional digital signal processing. Amplitude-latency trading transposes the profile of amplitudes across frequencies into a profile of time-registrations across frequencies. Target shape is extracted from the spacing of the object's individual acoustic reflecting points, or glints, using the mutual interference pattern of peaks and nulls in the echo spectrum. These are merged with the overall range-delay estimate to produce a delay-based reconstruction of the object's distance as well as its glints. Clutter echoes indiscriminately activate multiple parts in the null-detecting system, which then produces the equivalent glint-delay spacings in images, thus blurring the overall echo-delay estimates by adding spurious glint delays to the image. Blurring acts as an anticorrelation process that rejects clutter intrusion into perceptions.
Collapse
Affiliation(s)
- Chen Ming
- Department of Neuroscience and Carney Institute for Brain Science, Brown University Providence, United States of America
| | - Stephanie Haro
- Speech and Hearing Biosciences and Technology, Harvard University, Boston, United States of America
| | - Andrea Megela Simmons
- Department of Cognitive, Linguistic and Psychological Sciences and Carney Institute for Brain Science, Brown University Providence, United States of America
| | - James A. Simmons
- Department of Neuroscience and Carney Institute for Brain Science, Brown University Providence, United States of America
| |
Collapse
|
3
|
Kuc R. Artificial neural network classification of foliage targets from spectrograms of sequential echoes using a biomimetic audible sonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:3270. [PMID: 33261369 DOI: 10.1121/10.0002651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 10/28/2020] [Indexed: 06/12/2023]
Abstract
Classifying foliage targets using echolocation is important for recognizing landmarks by bats using ultrasonic emissions and blind human echolocators (BEs) using palatal clicks. Previous attempts to classify foliage used ultrasonic frequencies and single sensor (monaural) detection. Motivated by the echolocation capabilities of BEs, a biomimetic sonar emitting audible clicks acquired 5600 binaural echoes from five sequential emissions that probed two foliage targets at aspect angles separated by 18°. Echo spectrograms formed feature vector inputs to artificial neural networks (ANNs) for classifying two targets, Ficus benjamina and Schefflera arboricola, with leaf areas that differ by a factor of four. Classification performances of ANNs without and with hidden layers were analyzed using tenfold cross-validation. Performance improved with input feature size, with binaural echo classification outperforming that using monaural echoes for the same number of emissions and for the same number of echoes. Linear classification accuracy was comparable to that using nonlinear classification with both achieving fewer than 1% errors with binaural spectrogram features from five sequential emissions. This result was better by a factor of 20 compared to previous classification of these targets using only the time envelopes of the same echoes.
Collapse
Affiliation(s)
- Roman Kuc
- Department of Electrical Engineering, Yale University, New Haven, Connecticut 06511, USA
| |
Collapse
|
4
|
Mazar O, Yovel Y. A sensorimotor model shows why a spectral jamming avoidance response does not help bats deal with jamming. eLife 2020; 9:55539. [PMID: 32718437 PMCID: PMC7406351 DOI: 10.7554/elife.55539] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 07/21/2020] [Indexed: 12/05/2022] Open
Abstract
For decades, researchers have speculated how echolocating bats deal with masking by conspecific calls when flying in aggregations. To date, only a few attempts have been made to mathematically quantify the probability of jamming, or its effects. We developed a comprehensive sensorimotor predator-prey simulation, modeling numerous bats foraging in proximity. We used this model to examine the effectiveness of a spectral Jamming Avoidance Response (JAR) as a solution for the masking problem. We found that foraging performance deteriorates when bats forage near conspecifics, however, applying a JAR does not improve insect sensing or capture. Because bats constantly adjust their echolocation to the performed task (even when flying alone), further shifting the signals' frequencies does not mitigate jamming. Our simulations explain how bats can hunt successfully in a group despite competition and despite potential masking. This research demonstrates the advantages of a modeling approach when examining a complex biological system.
Collapse
Affiliation(s)
- Omer Mazar
- Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
| | - Yossi Yovel
- Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel.,Department of Zoology, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
5
|
Abstract
The wide frequency spectrum of FM bat biosonar sounds enables accurate perception of echo delay (target distance) by contributing numerous delay estimates across frequencies. However, bats require the lowest frequencies in the broadcast to be present in echoes for all higher frequencies to contribute, too. By incorporating this feature into an existing auditory model of FM biosonar, the model can reject echoes that lack the lowest frequencies in the most recent broadcast, thus suppressing echoes of an earlier broadcast that has slightly higher low-end frequencies. This biologically inspired method adopts the bat’s frequency-hopping technique to suppress pulse-echo ambiguity in wideband systems, a serious problem for man-made wideband radar and sonar systems. Big brown bats transmit wideband FM biosonar sounds that sweep from 55 to 25 kHz (first harmonic, FM1) and from 110 to 50 kHz (second harmonic, FM2). FM1 is required to perceive echo delay for target ranging; FM2 contributes only if corresponding FM1 frequencies are present. We show that echoes need only the lowest FM1 broadcast frequencies of 25 to 30 kHz for delay perception. If these frequencies are removed, no delay is perceived. Bats begin echo processing at the lowest frequencies and accumulate perceptual acuity over successively higher frequencies, but they cannot proceed without the low-frequency starting point in their broadcasts. This reveals a solution to pulse-echo ambiguity, a serious problem for radar or sonar. In dense, extended biosonar scenes, bats have to emit sounds rapidly to avoid collisions with near objects. But if a new broadcast is emitted when echoes of the previous broadcast still are arriving, echoes from both broadcasts intermingle, creating ambiguity about which echo corresponds to which broadcast. Frequency hopping by several kilohertz from one broadcast to the next can segregate overlapping narrowband echo streams, but wideband FM echoes ordinarily do not segregate because their spectra still overlap. By starting echo processing at the lowest frequencies in frequency-hopped broadcasts, echoes of the higher hopped broadcast are prevented from being accepted by lower hopped broadcasts, and ambiguity is avoided. The bat-inspired spectrogram correlation and transformation (SCAT) model also begins at the lowest frequencies; echoes that lack them are eliminated from processing of delay and no longer cause ambiguity.
Collapse
|
6
|
Kuc R. Artificial neural network classification of surface reflectors and volume scatterers using sequential echoes acquired with a biomimetic audible sonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:2357. [PMID: 32359283 DOI: 10.1121/10.0001083] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Accepted: 03/25/2020] [Indexed: 06/11/2023]
Abstract
This paper investigates classifying two target groups, surface reflectors (SR) and volume scatterers (VS), using echo envelope features. SR targets have convex surface patches that exhibit echo persistence over aspect angle, while VS targets are composed of random range-distributed and oriented reflectors producing echoes that become uncorrelated with small changes in aspect angle. The SR target group contains single-post (P1) and multiple-post (PM) types and the VS group contains Ficus benjamina (F) and Schefflera arboricola (S) foliage types with leaf areas that differ by a factor of 4. A biomimetic sonar emitting audible clicks acquired sequences of up to three binaural echoes from target views separated by 18°. Two artificial neural networks performing linear and nonlinear classification first differentiated SR/VS target groups and then P1/PM and F/S types. Classification performance improved with echo number, from a single monaural echo to three pairs of binaural echoes, demonstrating the benefit of sequential echoes. Linear and nonlinear classification of SR/VS targets achieved a minimum generalization error probability PEG = 0.003. Nonlinear P1/PM classification achieved PEG = 0.009 that was four times smaller than linear classification. Nonlinear F/S classification achieved PEG = 0.220, indicating that envelope features by themselves are inadequate to accurately differentiate foliage targets.
Collapse
Affiliation(s)
- Roman Kuc
- Department of Electrical Engineering, Yale University, New Haven, Connecticut 06511, USA
| |
Collapse
|
7
|
Fiorillo AS, Pullano SA, Critello CD. Spiral-Shaped Biologically-Inspired Ultrasonic Sensor. IEEE TRANSACTIONS ON ULTRASONICS, FERROELECTRICS, AND FREQUENCY CONTROL 2020; 67:635-642. [PMID: 31647427 DOI: 10.1109/tuffc.2019.2948817] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Up to now, low-frequency ultrasonic transducers have been manufactured using different materials and technologies and have been inspired by the biological world, mainly by the biosonar of dolphins and bats. Our research moves in this context, which is dedicated to investigating the feasibility of developing a piezopolymer sensor capable of covering the wide frequency range of a bat's biosonar. We propose an ultrasonic sensor manufactured using a sheet of polyvinylidene fluoride curved according to a logarithmic spiral geometry as it is present in biological models of the cochlea. Experiments were carried out both in transmission and reception, and demonstrated that a spiral-shaped transducer can transmit and receive ultrasonic signals similar to the specific vocalizations of most of the bats in the range between 20 and 80 kHz. The resonant frequencies of the transducer were evaluated through a finite element analysis, in agreement with experimental data covering the entire broadband. During transmission, the sound pressure level showed a maximum value of 90 dB, while during reception, the sensitivity spanned from t103.8 up to t89.1 dB. Directivity measurements demonstrated omnidirectional properties both on horizontal and vertical planes, representing a breakthrough in the field of broadband ultrasonic sensors.
Collapse
|
8
|
Kuc R. Comparing phase-sensitive and phase-insensitive echolocation target images using a monaural audible sonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:2379. [PMID: 29716281 DOI: 10.1121/1.5033903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
This paper describes phase-sensitive and phase-insensitive processing of monaural echolocation waveforms to generate target maps. Composite waveforms containing both the emission and echoes are processed to estimate the target impulse response using an audible sonar. Phase-sensitive processing yields the composite signal envelope, while phase-insensitive processing that starts with the composite waveform power spectrum yields the envelope of the autocorrelation function. Analysis and experimental verification show that multiple echoes form an autocorrelation function that produces near-range phantom-reflector artifacts. These artifacts interfere with true target echoes when the first true echo occurs at a time that is less than the total duration of the target echoes. Initial comparison of phase-sensitive and phase-insensitive maps indicates that both display important target features, indicating that phase is not vital. A closer comparison illustrates the improved resolution of phase-sensitive processing, the near-range phantom-reflectors produced by phase-insensitive processing, and echo interference and multiple reflection artifacts that were independent of the processing.
Collapse
Affiliation(s)
- Roman Kuc
- Department of Electrical Engineering, Yale University, New Haven, Connecticut 06511, USA
| |
Collapse
|
9
|
Simmons JA. Noise interference with echo delay discrimination in bat biosonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:2942. [PMID: 29195421 DOI: 10.1121/1.5010159] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Echolocating big brown bats (Eptesicus fuscus) were trained in a two-choice task to discriminate differences in the delay of electronic echoes at 1.7 ms delay (30 cm simulated range). Difference thresholds (∼45 μs) were comparable to previously published results. At selected above-threshold differences (116 and 232 μs delay), performance was measured in the presence of wideband random noise at increasing amplitudes in 10-dB steps to determine the noise level that prevented discrimination. Performance eventually failed, but the bats increased the amplitude and duration of their broadcasts to compensate for increasing noise, which allowed performance to persist at noise levels about 25 dB higher than without compensation. In the 232-μs delay discrimination condition, echo signal-to-noise ratio (2E/N0) was 8-10 dB at the noise level that depressed performance to chance. Predicted echo-delay accuracy using big brown bat signals follows the Cramér-Rao bound for signal-to-noise ratios above 15 dB, but worsens below 15 dB due to side-peak ambiguity. At 2E/N0 = 7-10 dB, predicted Cramér-Rao delay accuracy would be about 1 μs; considering side-peak ambiguity it would be about 200-300 μs. The bats' 232 μs performance reflects the intrusion of side-peak ambiguity into delay accuracy at low signal-to-noise ratios.
Collapse
Affiliation(s)
- J A Simmons
- Department of Neuroscience, Brown University, 185 Meeting Street, Providence, Rhode Island 02912, USA
| |
Collapse
|
10
|
Abstract
Bats fail to perceive vertical mirroring structures, such as large windows, and collide with them
Collapse
Affiliation(s)
- Peter Stilz
- Department of Animal Physiology, Institute for Neurobiology, University of Tübingen, Germany.
| |
Collapse
|
11
|
Hague DA, Buck JR, Bilik I. A deterministic compressive sensing model for bat biosonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:4041-4052. [PMID: 23231133 DOI: 10.1121/1.4756953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The big brown bat (Eptesicus fuscus) uses frequency modulated (FM) echolocation calls to accurately estimate range and resolve closely spaced objects in clutter and noise. They resolve glints spaced down to 2 μs in time delay which surpasses what traditional signal processing techniques can achieve using the same echolocation call. The Matched Filter (MF) attains 10-12 μs resolution while the Inverse Filter (IF) achieves higher resolution at the cost of significantly degraded detection performance. Recent work by Fontaine and Peremans [J. Acoustic. Soc. Am. 125, 3052-3059 (2009)] demonstrated that a sparse representation of bat echolocation calls coupled with a decimating sensing method facilitates distinguishing closely spaced objects over realistic SNRs. Their work raises the intriguing question of whether sensing approaches structured more like a mammalian auditory system contains the necessary information for the hyper-resolution observed in behavioral tests. This research estimates sparse echo signatures using a gammatone filterbank decimation sensing method which loosely models the processing of the bat's auditory system. The decimated filterbank outputs are processed with [script-l](1) minimization. Simulations demonstrate that this model maintains higher resolution than the MF and significantly better detection performance than the IF for SNRs of 5-45 dB while undersampling the return signal by a factor of six.
Collapse
Affiliation(s)
- David A Hague
- Department of Electrical and Computer Engineering, University of Massachusetts Dartmouth, North Dartmouth, Massachusetts 02747-2300, USA.
| | | | | |
Collapse
|
12
|
Cheng Bin-bin, Zhang Hai, Zhang Xiaoping, Li Hesheng. Bats' acoustic detection system and echolocation bionics. 2012 IEEE RADAR CONFERENCE 2012. [DOI: 10.1109/radar.2012.6212280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
13
|
Yovel Y, Geva-Sagiv M, Ulanovsky N. Click-based echolocation in bats: not so primitive after all. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 2011; 197:515-30. [DOI: 10.1007/s00359-011-0639-4] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2011] [Revised: 03/14/2011] [Accepted: 03/15/2011] [Indexed: 10/18/2022]
|
14
|
Bates ME, Simmons JA. Perception of echo delay is disrupted by small temporal misalignment of echo harmonics in bat sonar. J Exp Biol 2011; 214:394-401. [PMID: 21228198 PMCID: PMC3020147 DOI: 10.1242/jeb.048983] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/07/2010] [Indexed: 11/20/2022]
Abstract
Echolocating big brown bats emit ultrasonic frequency-modulated (FM) biosonar sounds containing two prominent downward-sweeping harmonics (FM1 and FM2) and perceive target distance from echo delay. In naturally occurring echoes, FM1 and FM2 are delayed by the same amount. Even though echoes from targets located off-axis or far away are lowpass filtered, which weakens FM2 relative to FM1, their delays remain the same. We show here that misalignment of FM2 with FM1 by only 2.6 μs is sufficient to significantly disrupt acuity, which then persists for larger misalignments up to 300 μs. However, when FM2 is eliminated entirely rather than just misaligned, acuity is effectively restored. For naturally occurring, lowpass-filtered echoes, neuronal responses to weakened FM2 are retarded relative to FM1 because of amplitude-latency trading, which misaligns the harmonics in the bat's internal auditory representations. Electronically delaying FM2 relative to FM1 mimics the retarded neuronal responses for FM2 relative to FM1 caused by amplitude-latency trading. Echoes with either electronically or physiologically misaligned harmonics are not perceived as having a clearly defined delay. This virtual collapse of delay acuity may suppress interference from off-axis or distant clutter through degradation of delay images for clutter in contrast to sharp images for nearer, frontal targets.
Collapse
Affiliation(s)
- Mary E Bates
- Department of Psychology, Brown University, Providence, RI 02912, USA.
| | | |
Collapse
|
15
|
Yovel Y, Franz MO, Stilz P, Schnitzler HU. Complex echo classification by echo-locating bats: a review. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 2010; 197:475-90. [PMID: 20848111 DOI: 10.1007/s00359-010-0584-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2010] [Revised: 08/11/2010] [Accepted: 08/28/2010] [Indexed: 11/29/2022]
Abstract
Echo-locating bats constantly emit ultrasonic pulses and analyze the returning echoes to detect, localize, and classify objects in their surroundings. Echo classification is essential for bats' everyday life; for instance, it enables bats to use acoustical landmarks for navigation and to recognize food sources from other objects. Most of the research of echo based object classification in echo-locating bats was done in the context of simple artificial objects. These objects might represent prey, flower, or fruit and are characterized by simple echoes with a single up to several reflectors. Bats, however, must also be able to use echoes that return from complex structures such as plants or other types of background. Such echoes are characterized by superpositions of many reflections that can only be described using a stochastic statistical approach. Scientists have only lately started to address the issue of complex echo classification by echo-locating bats. Some behavioral evidence showing that bats can classify complex echoes has been accumulated and several hypotheses have been suggested as to how they do so. Here, we present a first review of this data. We raise some hypotheses regarding possible interpretations of the data and point out necessary future directions that should be pursued.
Collapse
Affiliation(s)
- Yossi Yovel
- Department of Neurobiology, Weizmann Institute of Science, Rehovot, Israel.
| | | | | | | |
Collapse
|
16
|
Park M, Allen R. Pattern-matching analysis of fine echo delays by the spectrogram correlation and transformation receiver. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:1490-1500. [PMID: 20815484 DOI: 10.1121/1.3466844] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Among a few previous attempts to model the outstanding echolocation capability of bats, the work by Saillant et al. [J. Acoust. Soc. Am. 94, 2691-2712 (1993)] is, arguably, one of the most frequently referenced studies in which the predictions of spectrogram correlation and transformation (SCAT) model were compared to the results of relevant behavioral experiments. The SCAT model consists of cochlear, spectrogram correlation and spectrogram transformation blocks, where the latter two processes estimate the overall and the fine time delays between the animal's call and the echoes, given the neural representation of the acoustic signals generated by the cochlear block. This paper first provides a rigorous account of the spectrogram transformation (ST) block. By approximating the neural signals in analytic forms, many aspects of the ST block are explained and discussed in relation to the predictive scope of the model. Furthermore, based on these analytical arguments, the ST block is investigated from a different point of view, interpreted as a pattern-matching process which may operate at the high level of the animal's auditory pathway.
Collapse
Affiliation(s)
- Munhum Park
- Philips Research Laboratories Eindhoven, Professor Holstlaan 4, 5656 AE Eindhoven, The Netherlands.
| | | |
Collapse
|
17
|
Fontaine B, Peremans H. Determining biosonar images using sparse representations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 125:3052-3059. [PMID: 19425648 DOI: 10.1121/1.3101485] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Echolocating bats are thought to be able to create an image of their environment by emitting pulses and analyzing the reflected echoes. In this paper, the theory of sparse representations and its more recent further development into compressed sensing are applied to this biosonar image formation task. Considering the target image representation as sparse allows formulation of this inverse problem as a convex optimization problem for which well defined and efficient solution methods have been established. The resulting technique, referred to as L1-minimization, is applied to simulated data to analyze its performance relative to delay accuracy and delay resolution experiments. This method performs comparably to the coherent receiver for the delay accuracy experiments, is quite robust to noise, and can reconstruct complex target impulse responses as generated by many closely spaced reflectors with different reflection strengths. This same technique, in addition to reconstructing biosonar target images, can be used to simultaneously localize these complex targets by interpreting location cues induced by the bat's head related transfer function. Finally, a tentative explanation is proposed for specific bat behavioral experiments in terms of the properties of target images as reconstructed by the L1-minimization method.
Collapse
Affiliation(s)
- Bertrand Fontaine
- Active Perception Laboratory, Universiteit Antwerpen, 13 Prinsstraat, 2000 Antwerpen, Belgium.
| | | |
Collapse
|
18
|
Wiegrebe L. An autocorrelation model of bat sonar. BIOLOGICAL CYBERNETICS 2008; 98:587-595. [PMID: 18491168 DOI: 10.1007/s00422-008-0216-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2007] [Accepted: 01/21/2008] [Indexed: 05/26/2023]
Abstract
Their sonar system allows echolocating bats to navigate with high skill through a complex, three- dimensional environment at high speed and low light. The auditory analysis of the echoes of their ultrasonic sounds requires a detailed comparison of the emission and echoes. Here an auditory model of bat sonar is introduced and evaluated against a set of psychophysical phantom-target, echo-acoustic experiments. The model consists of a relatively detailed simulation of auditory peripheral processing in the bat, Phyllostomus discolor, followed by a functional module consisting of a strobed, normalised, autocorrelation in each frequency channel. The model output is accumulated in a sonar image buffer. The model evaluation is based on the comparison of the image-buffer contents generated in individually simulated psychophysical trials. The model provides reasonably good predictions for both temporal and spectral behavioural sonar processing in terms of sonar delay-, roughness, and phase sensitivity and in terms of sensitivity to the temporal separations in two-front targets and the classification of spectrally divergent phantom targets.
Collapse
Affiliation(s)
- Lutz Wiegrebe
- Biozentrum, Ludwig-Maximilians-Universität München, München, Germany.
| |
Collapse
|
19
|
|
20
|
Boonman A, Ostwald J. A modeling approach to explain pulse design in bats. BIOLOGICAL CYBERNETICS 2007; 97:159-72. [PMID: 17610077 DOI: 10.1007/s00422-007-0164-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2006] [Accepted: 05/08/2007] [Indexed: 05/16/2023]
Abstract
In this modeling study we wanted to find out why bats of the family Vespertilionidae (and probably also members of other families of bats) use pulses with a certain bandwidth and duration. Previous studies have only speculated on the function of bandwidth and pulse duration in bat echolocation or addressed this problem by assuming that bats optimize echolocation parameters to achieve very fine acuities in receiving single echoes. Here, we take a different approach by assuming that bats in nature rarely receive single echoes from each pulse emission, but rather many highly overlapping echoes. Some echolocation tasks require individual echoes to be separated to reconstruct reflection points in space. We used an established hearing model to investigate how the parameters bandwidth and pulse duration influence the separation of overlapping echoes. Our findings corroborate the following previously unknown or unsubstantiated facts: 1. Broadening the bandwidth improves the bat's lower resolution limit. 2. Increasing the sweep rate (defined by bandwidth and pulse duration) improves acuity of each extracted echo. 3. Decreasing the sweep rate improves the probability of frequency channels being activated. Since facts 2 and 3 affect sweep rate in an opposing fashion, an optimum sweep rate will exist, depending on the quality of the returning echoes and the requirements of the bat to improve acuity. The existence of an optimal sweep rate explains why bats are likely to use certain combinations of bandwidth and pulse duration to obtain such sweep rates.
Collapse
Affiliation(s)
- Arjan Boonman
- INCM - CNRS UMR6193, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France.
| | | |
Collapse
|
21
|
Holderied MW, von Helversen O. 'Binaural echo disparity' as a potential indicator of object orientation and cue for object recognition in echolocating nectar-feeding bats. ACTA ACUST UNITED AC 2006; 209:3457-68. [PMID: 16916981 DOI: 10.1242/jeb.02386] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Echolocating bats emit ultrasonic calls through their mouth or their nostrils and receive echoes from objects with both their ears. Information conveyed in the echoes is the basis for their three-dimensional acoustic perception of the surroundings. The direction of an object is encoded in binaural echo differences, i.e. on the one hand in the different arrival times of its echo at the two ears, and on the other hand in spectral differences through direction-dependent frequency filtering of head and pinnae. Insufficient attention has been paid, however, to the fact that three-dimensional objects produce structured spatial echo fields, and that the position of the ear in this field determines the echo it receives. We were interested to determine whether the two ears, in addition to direction-specific echo differences, receive object-specific echo disparities that might be useful for the bat. Our measurements with an artificial bat head, which consisted of two microphones and a small ultrasound loudspeaker arranged to resemble a bat's ears and mouth, revealed that echoes at the two ears differed largely depending on the shape and orientation of the echo-giving object. Binaural echo disparities of a bat-pollinated flower did indeed carry information about the orientation and, to a lesser extent, the shape of the flower. During flower approach such object-specific binaural echo disparities even exceed the binaural differences encoding direction of echo incidence, because the echo from the flower in front undergoes the same directional filtering by the two symmetrical ears. Nectar-feeding bats could use these object-specific binaural echo disparities not only to determine the object's orientation relative to the approaching bat, facilitating flight planning, but also to improve object recognition through spatial reconstruction of details of the object creating the echo. Our results suggest that the evaluation of binaural echo disparity has a greater importance for these tasks than has previously been assumed.
Collapse
Affiliation(s)
- Marc W Holderied
- Institut für Zoologie II, Universität Erlangen-Nürnberg, Staudtstrasse 5, 91058 Erlangen, Germany.
| | | |
Collapse
|
22
|
Simmons JA, Neretti N, Intrator N, Altes RA, Ferragamo MJ, Sanderson MI. Delay accuracy in bat sonar is related to the reciprocal of normalized echo bandwidth, or Q. Proc Natl Acad Sci U S A 2004; 101:3638-43. [PMID: 14990794 PMCID: PMC373515 DOI: 10.1073/pnas.0308279101] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2003] [Indexed: 11/18/2022] Open
Abstract
Big brown bats (Eptesicus fuscus) emit wideband, frequency-modulated biosonar sounds and perceive the distance to objects from the delay of echoes. Bats remember delays and patterns of delay from one broadcast to the next, and they may rely on delays to perceive target scenes. While emitting a series of broadcasts, they can detect very small changes in delay based on their estimates of delay for successive echoes, which are derived from an auditory time/frequency representation of frequency-modulated sounds. To understand how bats perceive objects, we need to know how information distributed across the time/frequency surface is brought together to estimate delay. To assess this transformation, we measured how alteration of the frequency content of echoes affects the sharpness of the bat's delay estimates from the distribution of errors in a psychophysical task for detecting changes in delay. For unrestricted echo frequency content and high echo signal-to-noise ratio, bats can detect extremely small changes in delay of about 10 ns. When echo bandwidth is restricted by filtering out low or high frequencies, the bat's delay acuity declines in relation to the reciprocal of relative echo bandwidth, expressed as Q, which also is the relative width of the target impulse response in cycles rather than time. This normalized-time dimension may be efficient for target classification if it leads to target shape being displayed independent of size. This relation may originate from cochlear transduction by parallel frequency channels with active amplification, which creates the auditory time/frequency representation itself.
Collapse
Affiliation(s)
- James A Simmons
- Department of Neuroscience and Physics, Brown University, Providence, RI 02912, USA.
| | | | | | | | | | | |
Collapse
|
23
|
Sanderson MI, Neretti N, Intrator N, Simmons JA. Evaluation of an auditory model for echo delay accuracy in wideband biosonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:1648-1659. [PMID: 14514218 DOI: 10.1121/1.1598195] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
In a psychophysical task with echoes that jitter in delay, big brown bats can detect changes as small as 10-20 ns at an echo signal-to-noise ratio of approximately 49 dB and 40 ns at approximately 36 dB. This performance is possible to achieve with ideal coherent processing of the wideband echoes, but it is widely assumed that the bat's peripheral auditory system is incapable of encoding signal waveforms to represent delay with the requisite precision or phase at ultrasonic frequencies. This assumption was examined by modeling inner-ear transduction with a bank of parallel bandpass filters followed by low-pass smoothing. Several versions of the filterbank model were tested to learn how the smoothing filters, which are the most critical parameter for controlling the coherence of the representation, affect replication of the bat's performance. When tested at a signal-to-noise ratio of 36 dB, the model achieved a delay acuity of 83 ns using a second-order smoothing filter with a cutoff frequency of 8 kHz. The same model achieved a delay acuity of 17 ns when tested with a signal-to-noise ratio of 50 dB. Jitter detection thresholds were an order of magnitude worse than the bat for fifth-order smoothing or for lower cutoff frequencies. Most surprising is that effectively coherent reception is possible with filter cutoff frequencies well below any of the ultrasonic frequencies contained in the bat's sonar sounds. The results suggest that only a modest rise in the frequency response of smoothing in the bat's inner ear can confer full phase sensitivity on subsequent processing and account for the bat's fine acuity or delay.
Collapse
Affiliation(s)
- Mark I Sanderson
- Department of Neuroscience, Brown University, Providence, Rhode Island 02912, USA.
| | | | | | | |
Collapse
|
24
|
Neretti N, Sanderson MI, Intrator N, Simmons JA. Time-frequency model for echo-delay resolution in wideband biosonar. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:2137-2145. [PMID: 12703724 DOI: 10.1121/1.1554693] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
A time/frequency model of the bat's auditory system was developed to examine the basis for the fine (approximately 2 micros) echo-delay resolution of big brown bats (Eptesicus fuscus), and its performance at resolving closely spaced FM sonar echoes in the bat's 20-100-kHz band at different signal-to-noise ratios was computed. The model uses parallel bandpass filters spaced over this band to generate envelopes that individually can have much lower bandwidth than the bat's ultrasonic sonar sounds and still achieve fine delay resolution. Because fine delay separations are inside the integration time of the model's filters (approximately 250-300 micros), resolving them means using interference patterns along the frequency dimension (spectral peaks and notches). The low bandwidth content of the filter outputs is suitable for relay of information to higher auditory areas that have intrinsically poor temporal response properties. If implemented in fully parallel analog-digital hardware, the model is computationally extremely efficient and would improve resolution in military and industrial sonar receivers.
Collapse
Affiliation(s)
- Nicola Neretti
- Brain Sciences, Brown University, Providence, Rhode Island 02912, USA.
| | | | | | | |
Collapse
|
25
|
Boonman AM, Parsons S, Jones G. The influence of flight speed on the ranging performance of bats using frequency modulated echolocation pulses. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:617-628. [PMID: 12558297 DOI: 10.1121/1.1528175] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Many species of bat use ultrasonic frequency modulated (FM) pulses to measure the distance to objects by timing the emission and reception of each pulse. Echolocation is mainly used in flight. Since the flight speed of bats often exceeds 1% of the speed of sound, Doppler effects will lead to compression of the time between emission and reception as well as an elevation of the echo frequencies, resulting in a distortion of the perceived range. This paper describes the consequences of these Doppler effects on the ranging performance of bats using different pulse designs. The consequences of Doppler effects on ranging performance described in this paper assume bats to have a very accurate ranging resolution, which is feasible with a filterbank receiver. By modeling two receiver types, it was first established that the effects of Doppler compression are virtually independent of the receiver type. Then, used a cross-correlation model was used to investigate the effect of flight speed on Doppler tolerance and range-Doppler coupling separately. This paper further shows how pulse duration, bandwidth, function type, and harmonics influence Doppler tolerance and range-Doppler coupling. The influence of each signal parameter is illustrated using calls of several bat species. It is argued that range-Doppler coupling is a significant source of error in bat echolocation, and various strategies bats could employ to deal with this problem, including the use of range rate information are discussed.
Collapse
Affiliation(s)
- Arjan M Boonman
- School of Biological Sciences, University of Bristol, Woodland Road, Bristol BS8 1UG, United Kingdom.
| | | | | |
Collapse
|
26
|
Simmons JA, Wotton JM, Ferragamo MJ, Moss CF. Transformation of external-ear spectral cues into perceived delays by the big brown bat, Eptesicus fuscus. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:2771-2782. [PMID: 12083212 DOI: 10.1121/1.1466869] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The external-ear transfer function for big brown bats (Eptesicus fuscus) contains two prominent notches that vary from 30 to 55 kHz and from 70 to 100 kHz, respectively, as sound-source elevation moves from -40 to +10 degrees. These notches resemble a higher-frequency version of external-ear cues for vertical localization in humans and other mammals. However, they also resemble interference notches created in echoes when reflected sounds overlap at short time separations of 30-50 micros. Psychophysical experiments have shown that bats actually perceive small time separations from interference notches, and here we used the same technique to test whether external-ear notches are recognized as a corresponding time separation, too. The bats' performance reveals the elevation dependence of a time-separation estimate at 25-45 micros in perceived delay. Convergence of target-shape and external-ear cues onto echo spectra creates ambiguity about whether a particular notch relates to the object or to its location, which the bat could resolve by ignoring the presence of notches at external-ear frequencies. Instead, the bat registers the frequencies of notches caused by the external ear along with notches caused by the target's structure and employs spectrogram correlation and transformation (SCAT) to convert them all into a family of delay estimates that includes elevation.
Collapse
Affiliation(s)
- James A Simmons
- Department of Neuroscience, Brown University, Providence, Rhode Island 02912, USA.
| | | | | | | |
Collapse
|
27
|
Sanderson MI, Simmons JA. Selectivity for echo spectral interference and delay in the auditory cortex of the big brown bat Eptesicus fuscus. J Neurophysiol 2002; 87:2823-34. [PMID: 12037185 DOI: 10.1152/jn.00628.2001] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The acoustic environment for an echolocating bat can contain multiple objects that reflect echoes so closely separated in time that they are almost completely overlapping. This results in a single echo with a spectrum characterized by deep notches due to interference. The object of this study was to document the possible selectivity, or lack thereof, of auditory neurons to the temporal separation of biosonar signals on a coarse (ms) and fine (micros) temporal scale. We recorded single-unit activity from the auditory cortex of big brown bats while presenting four protocol designs using wideband FM signals. The protocols simulated a pair of partially overlapping echoes where the separation between the first and second echo varied between 0 and 72 micros, a pulse followed by a single echo at varying delay from 0 to 30 ms, a pulse followed at a fixed delay by a pair of partially overlapping echoes that had a varying temporal separation of 0-72 micros, and a pulse followed, with a varying delay between 0 and 30 ms, by a pair of echoes that themselves had a fixed temporal separation on a microsecond time scale. About half of the cortical units showed increased spike counts to pairs of partially overlapping echoes at particular separations (6-72 micros) compared with a baseline stimulus at 0-micros separation. For many neurons tested with a pulse followed by two overlapping echoes, we observed a sensitivity to the coarse delay between the pulse and pair of overlapping echoes and to the separation between the two echoes themselves. The sensitivity to the partial overlap between the two echoes was not tuned to a single temporal separation. For bats, this means that the absolute range to the closest reflector and range between reflectors may be jointly encoded across a small population of single units. There are several possible neuronal mechanisms for encoding the separation between two nearby echoes based on the sensitivity to spectral notches.
Collapse
Affiliation(s)
- Mark I Sanderson
- Department of Neuroscience, Brown University, Providence, Rhode Island 02912, USA.
| | | |
Collapse
|
28
|
Matsuo I, Tani J, Yano M. A model of echolocation of multiple targets in 3D space from a single emission. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2001; 110:607-624. [PMID: 11508986 DOI: 10.1121/1.1377294] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Bats, using frequency-modulated echolocation sounds, can capture a moving target in real 3D space. The process by which they are able to accomplish this, however, is not completely understood. This work offers and analyzes a model for description of one mechanism that may play a role in the echolocation process of real bats. This mechanism allows for the localization of targets in 3D space from the echoes produced by a single emission. It is impossible to locate multiple targets in 3D space by using only the delay time between an emission and the resulting echoes received at two points (i.e., two ears). To locate multiple targets in 3D space requires directional information for each target. The frequency of the spectral notch, which is the frequency corresponding to the minimum of the external ear's transfer function, provides a crucial cue for directional localization. The spectrum of the echoes from nearly equidistant targets includes spectral components of both the interference between the echoes and the interference resulting from the physical process of reception at the external ear. Thus, in order to extract the spectral component associated with the external ear, this component must first be distinguished from the spectral components associated with the interference of echoes from nearly equidistant targets. In the model presented, a computation that consists of the deconvolution of the spectrum is used to extract the external-ear-dependent component in the time domain. This model describes one mechanism that can be used to locate multiple targets in 3D space.
Collapse
Affiliation(s)
- I Matsuo
- Research Institute of Electrical Communication, Tohoku University, Sendai, Japan.
| | | | | |
Collapse
|
29
|
Sanderson MI, Simmons JA. Neural responses to overlapping FM sounds in the inferior colliculus of echolocating bats. J Neurophysiol 2000; 83:1840-55. [PMID: 10758096 DOI: 10.1152/jn.2000.83.4.1840] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The big brown bat, Eptesicus fuscus, navigates and hunts prey with echolocation, a modality that uses the temporal and spectral differences between vocalizations and echoes from objects to build spatial images. Closely spaced surfaces ("glints") return overlapping echoes if two echoes return within the integration time of the cochlea ( approximately 300-400 micros). The overlap results in spectral interference that provides information about target structure or texture. Previous studies have shown that two acoustic events separated in time by less than approximately 500 micros evoke only a single response from neural elements in the auditory brain stem. How does the auditory system encode multiple echoes in time when only a single response is available? We presented paired FM stimuli with delay separations from 0 to 24 micros to big brown bats and recorded local field potentials (LFPs) and single-unit responses from the inferior colliculus (IC). These stimuli have one or two interference notches positioned in their spectrum as a function of two-glint separation. For the majority of single units, response counts decreased for two-glint separations when the resulting FM signal had a spectral notch positioned at the cell's best frequency (BF). The smallest two-glint separation that reliably evoked a decrease in spike count was 6 micros. In addition, first-spike latency increased for two-glint stimuli with notches positioned nearby BF. The N(4) potential of averaged LFPs showed a decrease in amplitude for two-glint separations that had a spectral notch near the BF of the recording site. Derived LFPs were computed by subtracting a common-mode signal from each LFP evoked by the two-glint FM stimuli. The derived LFP records show clear changes in both the amplitude and latency as a function of two-glint separation. These observations in relation with the single-unit data suggest that both response amplitude and latency can carry information about two-glint separation in the auditory system of E. fuscus.
Collapse
Affiliation(s)
- M I Sanderson
- Department of Neuroscience, Brown University, Providence, Rhode Island 02912, USA
| | | |
Collapse
|