1
|
Woźniak S, Kowalczyk K. Reverberation-Robust Self-Calibration and Synchronization of Distributed Microphone Arrays by Mitigating Heteroscedasticity and Outlier Occurrence in TDoA Measurements. SENSORS (BASEL, SWITZERLAND) 2023; 24:114. [PMID: 38202976 PMCID: PMC10781177 DOI: 10.3390/s24010114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/18/2023] [Accepted: 12/20/2023] [Indexed: 01/12/2024]
Abstract
The network of distributed microphone arrays is usually established in an ad hoc manner; hence, network parameters such as the mutual positioning and rotation of the arrays, positions of sources, and synchronization of their recording onset times are initially unknown. In this article, we consider the problem of passively jointly self-calibrating and synchronizing distributed arrays in reverberant rooms. We use a typical two-step approach where, initially, the relative geometry of the network is estimated using Direction of Arrival (DoA) measurements. Subsequently, the absolute scale and synchronization parameters are estimated using Time Difference of Arrival (TDoA) measurements. This article presents methods to improve the robustness and accuracy of estimation of the absolute geometric scaling and synchronization parameters in reverberant conditions, in which TDoA measurements do not follow a normal distribution; furthermore, outliers often occur. To remedy these issues, we propose a Weighted Least Squares (WLS) estimator and schema for weighting the TDoA measurements to increase the estimation accuracy from heteroscedastic TDoA measurements. In addition, we propose an iterative reweighing algorithm with a binary weight to detect and reject TDoA outliers, which exploits the residuals of the parametric model in the least absolute value minimization. A numerical evaluation shows significant improvements in the proposed method over the state of the art in terms of the relative scaling error and mean absolute value of the synchronization parameters.
Collapse
Affiliation(s)
- Szymon Woźniak
- Faculty of Computer Science, Electronics and Telecommunications, Institute of Electronics, AGH University of Kraków, al. Adama Mickiewicza 30, 30-059 Kraków, Poland;
| | | |
Collapse
|
2
|
Dibiase M, De Marchi L. An Optimal Shaped Sensor Array Derivation. MICROMACHINES 2023; 14:1154. [PMID: 37374739 DOI: 10.3390/mi14061154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 05/25/2023] [Accepted: 05/28/2023] [Indexed: 06/29/2023]
Abstract
In Structural Health Monitoring (SHM) applications, the Direction of Arrival (DoA) estimation of Guided Waves (GW) on sensor arrays is often used as a fundamental means to locate Acoustic Sources (AS) generated by damages growth or undesired impacts in thin-wall structures (e.g., plates or shells). In this paper, we consider the problem of designing the arrangement and shape of piezo-sensors in planar clusters in order to optimize the DoA estimation performance in noise-affected measurements. We assume that: (i) the wave propagation velocity is unknown, (ii) the DoA is estimated via the time delays of wavefronts between sensors, and (iii) the maximum value of the time delays is limited. The optimality criterion is derived basing on the Theory of Measurements. The sensor array design is so that the DoA variance is minimized in an average sense by exploiting the Calculus of Variations. In this way, considering a three-sensor cluster and a monitored angles sector of 90°, the optimal time delays-DoA relations are derived. A suitable re-shaping procedure is used to impose such relations and, at the same time, to induce the same spatial filtering effect between sensors so that the sensor acquired signals are equal except for a time-shift. In order to achieve the last aim, the sensors shape is realized by exploiting a technique called Error Diffusion, which is able to emulate piezo-load functions with continuously modulated values. In this way, the Shaped Sensors Optimal Cluster (SS-OC) is derived. A numerical assessment via Green's functions simulations shows improved performance in DoA estimation by means of the SS-OC when compared to clusters realized with conventional piezo-disk transducers.
Collapse
Affiliation(s)
- Marco Dibiase
- Department of Computer Science and Engineering, University of Bologna, 40136 Bologna, Italy
| | - Luca De Marchi
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi", University of Bologna, 40136 Bologna, Italy
| |
Collapse
|
3
|
Dibiase M, Mohammadgholiha M, De Marchi L. Optimal Array Design and Directive Sensors for Guided Waves DoA Estimation. SENSORS (BASEL, SWITZERLAND) 2022; 22:780. [PMID: 35161527 PMCID: PMC8838149 DOI: 10.3390/s22030780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 01/08/2022] [Accepted: 01/14/2022] [Indexed: 06/14/2023]
Abstract
The estimation of Direction of Arrival (DoA) of guided ultrasonic waves is an important task in many Structural Health Monitoring (SHM) applications. The aim is to locate sources of elastic waves which can be generated by impacts or defects in the inspected structures. In this paper, the array geometry and the shape of the piezo-sensors are designed to optimize the DoA estimation on a pre-defined angular sector, from acquisitions affected by noise and interference. In the proposed approach, the DoA of a wave generated by a single source is considered as a random variable that is uniformly distributed in a given range. The wave velocity is assumed to be unknown and the DoA estimation is performed by measuring the Differences in Time of Arrival (DToAs) of wavefronts impinging on the sensors. The optimization procedure of sensors positioning is based on the computation of the DoA and wave velocity parameters Cramér-Rao Matrix Bound (CRMB) with a Bayesian approach. An efficient DoA estimator is found based on the DToAs Gauss-Markov estimator for a three sensors array. Moreover, a novel directive sensor for guided waves is introduced to cancel out undesired Acoustic Sources impinging from DoAs out of the given angles range. Numerical results show the capability to filter directional interference of the novel sensor and a considerably improved DoA estimation performance provided by the optimized sensor cluster in the pre-defined angular sector, as compared to conventional approaches.
Collapse
Affiliation(s)
- Marco Dibiase
- Department of Computer Science and Engineering, University of Bologna, 40136 Bologna, Italy;
| | - Masoud Mohammadgholiha
- Department of Electrical, Electronic, and Information Engineering “Guglielmo Marconi”, University of Bologna, 40136 Bologna, Italy;
| | - Luca De Marchi
- Department of Electrical, Electronic, and Information Engineering “Guglielmo Marconi”, University of Bologna, 40136 Bologna, Italy;
| |
Collapse
|
4
|
Rhinehart TA, Chronister LM, Devlin T, Kitzes J. Acoustic localization of terrestrial wildlife: Current practices and future opportunities. Ecol Evol 2020; 10:6794-6818. [PMID: 32724552 PMCID: PMC7381569 DOI: 10.1002/ece3.6216] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 03/02/2020] [Accepted: 03/04/2020] [Indexed: 01/17/2023] Open
Abstract
Autonomous acoustic recorders are an increasingly popular method for low-disturbance, large-scale monitoring of sound-producing animals, such as birds, anurans, bats, and other mammals. A specialized use of autonomous recording units (ARUs) is acoustic localization, in which a vocalizing animal is located spatially, usually by quantifying the time delay of arrival of its sound at an array of time-synchronized microphones. To describe trends in the literature, identify considerations for field biologists who wish to use these systems, and suggest advancements that will improve the field of acoustic localization, we comprehensively review published applications of wildlife localization in terrestrial environments. We describe the wide variety of methods used to complete the five steps of acoustic localization: (1) define the research question, (2) obtain or build a time-synchronizing microphone array, (3) deploy the array to record sounds in the field, (4) process recordings captured in the field, and (5) determine animal location using position estimation algorithms. We find eight general purposes in ecology and animal behavior for localization systems: assessing individual animals' positions or movements, localizing multiple individuals simultaneously to study their interactions, determining animals' individual identities, quantifying sound amplitude or directionality, selecting subsets of sounds for further acoustic analysis, calculating species abundance, inferring territory boundaries or habitat use, and separating animal sounds from background noise to improve species classification. We find that the labor-intensive steps of processing recordings and estimating animal positions have not yet been automated. In the near future, we expect that increased availability of recording hardware, development of automated and open-source localization software, and improvement of automated sound classification algorithms will broaden the use of acoustic localization. With these three advances, ecologists will be better able to embrace acoustic localization, enabling low-disturbance, large-scale collection of animal position data.
Collapse
Affiliation(s)
- Tessa A. Rhinehart
- Department of Biological SciencesUniversity of PittsburghPittsburghPAUSA
| | | | - Trieste Devlin
- Department of Biological SciencesUniversity of PittsburghPittsburghPAUSA
| | - Justin Kitzes
- Department of Biological SciencesUniversity of PittsburghPittsburghPAUSA
| |
Collapse
|
5
|
Time Difference of Arrival (TDoA) Localization Combining Weighted Least Squares and Firefly Algorithm. SENSORS 2019; 19:s19112554. [PMID: 31167498 PMCID: PMC6603714 DOI: 10.3390/s19112554] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Revised: 05/28/2019] [Accepted: 06/02/2019] [Indexed: 11/26/2022]
Abstract
Time difference of arrival (TDoA) based on a group of sensor nodes with known locations has been widely used to locate targets. Two-step weighted least squares (TSWLS), constrained weighted least squares (CWLS), and Newton–Raphson (NR) iteration are commonly used passive location methods, among which the initial position is needed and the complexity is high. This paper proposes a hybrid firefly algorithm (hybrid-FA) method, combining the weighted least squares (WLS) algorithm and FA, which can reduce computation as well as achieve high accuracy. The WLS algorithm is performed first, the result of which is used to restrict the search region for the FA method. Simulations showed that the hybrid-FA method required far fewer iterations than the FA method alone to achieve the same accuracy. Additionally, two experiments were conducted to compare the results of hybrid-FA with other methods. The findings indicated that the root-mean-square error (RMSE) and mean distance error of the hybrid-FA method were lower than that of the NR, TSWLS, and genetic algorithm (GA). On the whole, the hybrid-FA outperformed the NR, TSWLS, and GA for TDoA measurement.
Collapse
|
6
|
A Bias Compensation Method for Distributed Moving Source Localization Using TDOA and FDOA with Sensor Location Errors. SENSORS 2018; 18:s18113747. [PMID: 30400239 PMCID: PMC6263974 DOI: 10.3390/s18113747] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Revised: 10/26/2018] [Accepted: 10/30/2018] [Indexed: 11/17/2022]
Abstract
Current bias compensation methods for distributed localization consider the time difference of arrival (TDOA) and frequency difference of arrival (FDOA) measurements noise, but ignore the negative influence by the sensor location uncertainties on source localization accuracy. Therefore, a new bias compensation method for distributed localization is proposed to improve the localization accuracy in this paper. This paper derives the theoretical bias of maximum likelihood estimation when the sensor location errors and positioning measurements noise both exist. Using the rough estimate result by MLE to subtract the theoretical bias can obtain a more accurate source location estimation. Theoretical analysis and simulation results indicate that the theoretical bias derived in this paper matches well with the actual bias in moderate noise level so that it can prove the correctness of the theoretical derivation. Furthermore, after bias compensation, the estimate accuracy of the proposed method achieves a certain improvement compared with existing methods.
Collapse
|
7
|
A survey on sound source localization in robotics: From binaural to array processing methods. COMPUT SPEECH LANG 2015. [DOI: 10.1016/j.csl.2015.03.003] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
8
|
Escolano J, Xiang N, Perez-Lorenzo JM, Cobos M, Lopez JJ. A Bayesian direction-of-arrival model for an undetermined number of sources using a two-microphone array. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:742-753. [PMID: 25234883 DOI: 10.1121/1.4861356] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Sound source localization using a two-microphone array is an active area of research, with considerable potential for use with video conferencing, mobile devices, and robotics. Based on the observed time-differences of arrival between sound signals, a probability distribution of the location of the sources is considered to estimate the actual source positions. However, these algorithms assume a given number of sound sources. This paper describes an updated research account on the solution presented in Escolano et al. [J. Acoust. Am. Soc. 132(3), 1257-1260 (2012)], where nested sampling is used to explore a probability distribution of the source position using a Laplacian mixture model, which allows both the number and position of speech sources to be inferred. This paper presents different experimental setups and scenarios to demonstrate the viability of the proposed method, which is compared with some of the most popular sampling methods, demonstrating that nested sampling is an accurate tool for speech localization.
Collapse
Affiliation(s)
| | - Ning Xiang
- Graduate Program in Architectural Acoustics, School of Architecture, Rensselaer Polytechnic Institute, Troy, New York 12180
| | - Jose M Perez-Lorenzo
- Multimedia and Multimodal Processing Research Group, University of Jaén, 23700, Linares, Spain
| | - Maximo Cobos
- Computer Science Department, University of Valencia, 46100, Burjassot, Spain
| | - Jose J Lopez
- Institute for Telecommunication and Multimedia Applications, Universidad Politécnica de Valencia, 46021, Valencia, Spain
| |
Collapse
|
9
|
|
10
|
Brutti A, Nesta F. Tracking of multidimensional TDOA for multiple sources with distributed microphone pairs. COMPUT SPEECH LANG 2013. [DOI: 10.1016/j.csl.2012.08.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
11
|
Escolano J, Perez-Lorenzo JM, Xiang N, Cobos M, López JJ. A Bayesian inference model for speech localization (L). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:1257-1260. [PMID: 22978853 DOI: 10.1121/1.4740489] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The localization of active speakers with microphone arrays is an active research line with a considerable interest in many acoustic areas. Many algorithms for source localization are based on the computation of the Generalized Cross-Correlation function between microphone pairs employing phase transform weighting. Unfortunately, the performance of these methods is severely reduced when wall reflections and multiple sound sources are present in the acoustic environment. As a result, estimating the number of active sound sources and their actual directions becomes a challenging task. To effectively tackle this problem, a Bayesian inference framework is proposed. Based on a nested sampling algorithm, a mixture model and its parameters are estimated, indicating both the number of sources-model selection-and their angle of arrival-parameter estimation, respectively. A set of measured data demonstrates the accuracy of the proposed model.
Collapse
Affiliation(s)
- José Escolano
- Multimedia and Multimodal Processing Research Group, University of Jaén, 23700, Linares, Spain.
| | | | | | | | | |
Collapse
|
12
|
Nesta F, Omologo M. Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources. ACTA ACUST UNITED AC 2012. [DOI: 10.1109/tasl.2011.2160168] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
13
|
Li H, Huang J, Guo M, Zhao Q. Spatial Localization of Concurrent Multiple Sound Sources Using Phase Candidate Histogram. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2011. [DOI: 10.20965/jaciii.2011.p1277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Mobile robots communicating with people would benefit from being able to detect sound sources to help localize interesting events in real-life settings. We propose using a spherical robot with four microphones to determine the spatial locations of multiple sound sources in ordinary rooms. The arrival temporal disparities from phase difference histograms are used to calculate the time differences. A precedence effect model suppresses the influence of echoes in reverberant environments. To integrate spatial cues of different microphones, we map the correlation between different microphone pairs on a 3D map corresponding to the azimuth and elevation of sound source direction. Results of experiments indicate that our proposed system provides sound source distribution very clearly and precisely, even concurrently in reverberant environments with the Echo Avoidance (EA) model.
Collapse
|
14
|
Duong NQK, Vincent E, Gribonval R. Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model. ACTA ACUST UNITED AC 2010. [DOI: 10.1109/tasl.2010.2050716] [Citation(s) in RCA: 286] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
15
|
Gaubitch ND, Ward DB, Naylor PA. Statistical analysis of the autoregressive modeling of reverberant speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 120:4031-9. [PMID: 17225429 DOI: 10.1121/1.2356840] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Hands-free speech input is required in many modern telecommunication applications that employ autoregressive (AR) techniques such as linear predictive coding. When the hands-free input is obtained in enclosed reverberant spaces such as typical office rooms, the speech signal is distorted by the room transfer function. This paper utilizes theoretical results from statistical room acoustics to analyze the AR modeling of speech under these reverberant conditions. Three cases are considered: (i) AR coefficients calculated from a single observation; (ii) AR coefficients calculated jointly from an M-channel observation (M > 1); and (iii) AR coefficients calculated from the output of a delay-and sum beamformer. The statistical analysis, with supporting simulations, shows that the spatial expectation of the AR coefficients for cases (i) and (ii) are approximately equal to those from the original speech, while for case (iii) there is a discrepancy due to spatial correlation between the microphones which can be significant. It is subsequently demonstrated that at each individual source-microphone position (without spatial expectation), the M-channel AR coefficients from case (ii) provide the best approximation to the clean speech coefficients when microphones are closely spaced (<0.3m).
Collapse
Affiliation(s)
- Nikolay D Gaubitch
- Department of Electrical and Electronic Engineering, Imperial College London, Exhibition Road, London SW7 2AZ, United Kingdom.
| | | | | |
Collapse
|
16
|
Trivedi M, Huang K, Mikic I. Dynamic Context Capture and Distributed Video Arrays for Intelligent Spaces. ACTA ACUST UNITED AC 2005. [DOI: 10.1109/tsmca.2004.838480] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|