1
|
Osses Vecchi A, Varnet L, Carney LH, Dau T, Bruce IC, Verhulst S, Majdak P. A comparative study of eight human auditory models of monaural processing. ACTA ACUSTICA. EUROPEAN ACOUSTICS ASSOCIATION 2022; 6:17. [PMID: 36325461 PMCID: PMC9625898 DOI: 10.1051/aacus/2022008] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
A number of auditory models have been developed using diverging approaches, either physiological or perceptual, but they share comparable stages of signal processing, as they are inspired by the same constitutive parts of the auditory system. We compare eight monaural models that are openly accessible in the Auditory Modelling Toolbox. We discuss the considerations required to make the model outputs comparable to each other, as well as the results for the following model processing stages or their equivalents: Outer and middle ear, cochlear filter bank, inner hair cell, auditory nerve synapse, cochlear nucleus, and inferior colliculus. The discussion includes a list of recommendations for future applications of auditory models.
Collapse
Affiliation(s)
- Alejandro Osses Vecchi
- Laboratoire des systèmes perceptifs, Département d’études cognitives, École Normale Supérieure, PSL University, CNRS, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des systèmes perceptifs, Département d’études cognitives, École Normale Supérieure, PSL University, CNRS, 75005 Paris, France
| | - Laurel H. Carney
- Departments of Biomedical Engineering and Neuroscience, University of Rochester, Rochester, NY 14642, USA
| | - Torsten Dau
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark
| | - Ian C. Bruce
- Department of Electrical and Computer Engineering, McMaster University, Hamilton, ON L8S 4K1, Canada
| | - Sarah Verhulst
- Hearing Technology group, WAVES, Department of Information Technology, Ghent University, 9000 Ghent, Belgium
| | - Piotr Majdak
- Acoustics Research Institute, Austrian Academy of Sciences, 1040 Vienna, Austria
| |
Collapse
|
2
|
Neidhardt A, Schneiderwind C, Klein F. Perceptual Matching of Room Acoustics for Auditory Augmented Reality in Small Rooms - Literature Review and Theoretical Framework. Trends Hear 2022; 26:23312165221092919. [PMID: 35505625 PMCID: PMC9073123 DOI: 10.1177/23312165221092919] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 02/28/2022] [Accepted: 03/22/2022] [Indexed: 12/04/2022] Open
Abstract
For the realization of auditory augmented reality (AAR), it is important that the room acoustical properties of the virtual elements are perceived in agreement with the acoustics of the actual environment. This perceptual matching of room acoustics is the subject reviewed in this paper. Realizations of AAR that fulfill the listeners' expectations were achieved based on pre-characterization of the room acoustics, for example, by measuring acoustic impulse responses or creating detailed room models for acoustic simulations. For future applications, the goal is to realize an online adaptation in (close to) real-time. Perfect physical matching is hard to achieve with these practical constraints. For this reason, an understanding of the essential psychoacoustic cues is of interest and will help to explore options for simplifications. This paper reviews a broad selection of previous studies and derives a theoretical framework to examine possibilities for psychoacoustical optimization of room acoustical matching.
Collapse
|
3
|
Osses Vecchi A, Kohlrausch A. Perceptual similarity between piano notes: Simulations with a template-based perception model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:3534. [PMID: 34241098 DOI: 10.1121/10.0004818] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Accepted: 04/11/2021] [Indexed: 06/13/2023]
Abstract
In this paper, the auditory model developed by Dau, Kollmeier, and Kohlrausch [(1997). J. Acoust. Soc. Am. 102, 2892-2905] was used to simulate the perceptual similarity between complex sounds. As complex sounds, a set of piano recordings was used, whose perceptual similarity has recently been measured by Osses, Kohlrausch, and Chaigne [(2019). J. Acoust. Soc. Am. 146, 1024-1035] using a three-alternative forced-choice discrimination task in noise. To simulate this discrimination task, the auditory model required a new back-end stage, the central processor, which is preceded by several processing stages that are to a greater or lesser extent inspired by physiological aspects of the normal-hearing system. Therefore, a comprehensive review of the model parameters as used in the literature is given, indicating the fixed set of parameter values that is used in all simulations. Due to the perceptual relevance of the piano note onsets, this review includes an in-depth description of the auditory adaptation stage, the adaptation loops. A moderate to high correlation was found between the simulation results and existing experimental data.
Collapse
Affiliation(s)
- Alejandro Osses Vecchi
- Human-Technology Interaction Group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, 5600MB Eindhoven, The Netherlands
| | - Armin Kohlrausch
- Human-Technology Interaction Group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, 5600MB Eindhoven, The Netherlands
| |
Collapse
|
4
|
Osses Vecchi A, McLachlan G, Kohlrausch A. Assessing the perceived reverberation in different rooms for a set of musical instrument sounds. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:EL93. [PMID: 32752774 DOI: 10.1121/10.0001416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Accepted: 05/27/2020] [Indexed: 06/11/2023]
Abstract
Previous research has shown that the perceived reverberation in a room, or reverberance, depends on the sound source that is being listened to. In a study by Osses Vecchi, Kohlrausch, Lachenmayr, and Mommertz [(2017). J. Acoust. Soc. Am. 141(4), EL381-EL387], reverberance estimates obtained from an auditory model for 23 musical instrument sounds in 8 rooms predicted a sound-source dependency. As a follow-up to that study, a listening experiment with 24 participants was conducted using a subset of the original sounds with the purpose of mapping each test sound onto a reverberance scale. Consistent with the literature, the experimental reverberance estimates were significantly dependent on the instrument sound being listened to, but on the top of that, the estimates were significantly correlated with simulated reverberance estimates for the test stimuli as well as for the previously reported long-duration sounds.
Collapse
Affiliation(s)
- Alejandro Osses Vecchi
- Human-Technology Interaction group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, Eindhoven, the , ,
| | - Glen McLachlan
- Human-Technology Interaction group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, Eindhoven, the , ,
| | - Armin Kohlrausch
- Human-Technology Interaction group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, Eindhoven, the , ,
| |
Collapse
|
5
|
Lee D, van Dorp Schuitman J, Qiu X, Burnett I. Development of a clarity parameter using a time-varying loudness model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:3455. [PMID: 29960417 DOI: 10.1121/1.5040480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The perceived sound clarity is often estimated with the clarity index, which is calculated on the basis of physical acoustic measures that can correlate weakly to the way humans perceive sound for certain test conditions. Therefore, this study proposes a clarity parameter based on a binaural room impulse response processed with a time-varying loudness model. The proposed parameter is validated by calculating the correlation coefficient with subject responses collected from previous listening experiments. Results show that the parameter outperforms the clarity index in most of the tested conditions, but its performance is less robust than parameter for clarity (PCLA).
Collapse
Affiliation(s)
- Doheon Lee
- Faculty of Engineering and Information Technology, University of Technology Sydney, New South Wales 2007, Australia
| | | | - Xiaojun Qiu
- Faculty of Engineering and Information Technology, University of Technology Sydney, New South Wales 2007, Australia
| | - Ian Burnett
- Faculty of Engineering and Information Technology, University of Technology Sydney, New South Wales 2007, Australia
| |
Collapse
|
6
|
Lawless MS, Vigeant MC. Effects of test method and participant musical training on preference ratings of stimuli with different reverberation times. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:2258. [PMID: 29092592 DOI: 10.1121/1.5006065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Selecting an appropriate listening test design for concert hall research depends on several factors, including listening test method and participant critical-listening experience. Although expert listeners afford more reliable data, their perceptions may not be broadly representative. The present paper contains two studies that examined the validity and reliability of the data obtained from two listening test methods, a successive and a comparative method, and two types of participants, musicians and non-musicians. Participants rated their overall preference of auralizations generated from eight concert hall conditions with a range of reverberation times (0.0-7.2 s). Study 1, with 34 participants, assessed the two methods. The comparative method yielded similar results and reliability as the successive method. Additionally, the comparative method was rated as less difficult and more preferable. For study 2, an additional 37 participants rated the stimuli using the comparative method only. An analysis of variance of the responses from both studies revealed that musicians are better than non-musicians at discerning their preferences across stimuli. This result was confirmed with a k-means clustering analysis on the entire dataset that revealed five preference groups. Four groups exhibited clear preferences to the stimuli, while the fifth group, predominantly comprising non-musicians, demonstrated no clear preference.
Collapse
Affiliation(s)
- Martin S Lawless
- Graduate Program in Acoustics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Michelle C Vigeant
- Graduate Program in Acoustics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
7
|
Lee D, van Dorp Schuitman J, Cabrera D, Qiu X, Burnett I. Comparison of psychoacoustic-based reverberance parameters. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:1832. [PMID: 29092554 DOI: 10.1121/1.5005508] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
This study compared psychoacoustic reverberance parameters to each other, as well as to reverberation time (RT) and early decay time (EDT) under various acoustic conditions. The psychoacoustic parameters were loudness-based RT (TN), loudness-based EDT [EDTN; Lee, Cabrera, and Martens, J. Acoust. Soc. Am. 131, 1194-1205 (2012a)], and parameter for reverberance [PREV; van Dorp Schuitman, de Vries, and Lindau., J. Acoust. Soc. Am. 133, 1572-1585 (2013)]. For the comparisons, a wide range of sound pressure levels (SPLs) from 20 dB to 100 dB and RTs from 0.5 s to 5.0 s were evaluated, and two sets of subjective data from the previous studies were used for the cross-validation and comparison. Results of the comparisons show that the psychoacoustic reverberance parameters provided better matches to reverberance than RT and EDT; however, the performance of these psychoacoustic reverberance parameters varied with the SPL range, the type of audio sample, and the reverberation conditions. This study reveals that PREV is the most relevant for estimating a relative change in reverberance between samples when the SPL range is small, while EDTN is useful in estimating the absolute reverberance. This study also suggests the use of PREV and EDTN for speech and music samples, respectively.
Collapse
Affiliation(s)
- Doheon Lee
- Faculty of Engineering and Information Technology, University of Technology, Sydney, New South Wales 2007, Australia
| | | | - Densil Cabrera
- Sydney School of Architecture, Design and Planning, The University of Sydney, Sydney, New South Wales 2006, Australia
| | - Xiaojun Qiu
- Faculty of Engineering and Information Technology, University of Technology, Sydney, New South Wales 2007, Australia
| | - Ian Burnett
- Faculty of Engineering and Information Technology, University of Technology, Sydney, New South Wales 2007, Australia
| |
Collapse
|
8
|
Osses Vecchi A, Kohlrausch A, Lachenmayr W, Mommertz E. Predicting the perceived reverberation in different room acoustic environments using a binaural auditory model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:EL381. [PMID: 28464633 DOI: 10.1121/1.4979853] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
In this paper a binaural auditory model was used to compute reverberance estimates in four simulated halls. For three of the halls different absorption conditions were evaluated. The model estimates (pRev) were obtained using music excerpts of an orchestra consisting of 23 instrument sections and then compared with the room acoustic parameters of reverberation time (T30) and early decay time (EDT) at mid frequencies. Although the results showed that pRev has a higher correlation with EDT rather than with T30, this relationship depends on the properties of the instruments. The simulations show that pRev depends on the presentation level and that for instruments with similar critical-band spectrum, pRev follows a similar trend across acoustic conditions. A computational framework and sound stimuli are provided to encourage the search of experimental evidence of the aspects addressed in this study.
Collapse
Affiliation(s)
- Alejandro Osses Vecchi
- Human-Technology Interaction group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, 5600 MB Eindhoven, the Netherlands ,
| | - Armin Kohlrausch
- Human-Technology Interaction group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, 5600 MB Eindhoven, the Netherlands ,
| | | | | |
Collapse
|