1
|
Berthomieu G, Koehl V, Paquier M. Does Loudness Relate to the Strength of the Sound Produced by the Source or Received by the Ears? A Review of How Focus Affects Loudness. Front Psychol 2021; 12:583690. [PMID: 33633631 PMCID: PMC7901973 DOI: 10.3389/fpsyg.2021.583690] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 01/04/2020] [Indexed: 11/13/2022] Open
Abstract
Loudness is the magnitude of the auditory sensation that a listener experiences when exposed to a sound. Several sound attributes are reported to affect loudness, such as the sound pressure level at the listener's ears and the spectral content. In addition to these physical attributes of the stimulus, some subjective attributes also appear to affect loudness. When presented with a sound, a listener interacts with an auditory object and can focus on several aspects of the latter. Loudness appears to differ depending on how listeners apprehend this object, notably whether they focus on the sound that reaches their ears or that is produced by the source. The way listeners focus on the auditory object may depend on the stimulus itself. For instance, they might be more likely to focus on the sound emitted by the source if the latter is visible. The instructions given by the experimenters can also explicitly direct the listener's focus on the sound reaching the ears or emitted by the source. The present review aims at understanding how listeners focus on the auditory object depending on the stimuli and instructions they are provided with, and to describe how loudness depends on this focus.
Collapse
|
2
|
Moore BCJ, Glasberg BR, Varathanathan A, Schlittenlacher J. A Loudness Model for Time-Varying Sounds Incorporating Binaural Inhibition. Trends Hear 2018; 20:2331216516682698. [PMID: 28215113 PMCID: PMC5318944 DOI: 10.1177/2331216516682698] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
This article describes a model of loudness for time-varying sounds that incorporates the concept of binaural inhibition, namely, that the signal applied to one ear can reduce the internal response to a signal at the other ear. For each ear, the model includes the following: a filter to allow for the effects of transfer of sound through the outer and middle ear; a short-term spectral analysis with greater frequency resolution at low than at high frequencies; calculation of an excitation pattern, representing the magnitudes of the outputs of the auditory filters as a function of center frequency; application of a compressive nonlinearity to the output of each auditory filter; and smoothing over time of the resulting instantaneous specific loudness pattern using an averaging process resembling an automatic gain control. The resulting short-term specific loudness patterns are used to calculate broadly tuned binaural inhibition functions, the amount of inhibition depending on the relative short-term specific loudness at the two ears. The inhibited specific loudness patterns are summed across frequency to give an estimate of the short-term loudness for each ear. The overall short-term loudness is calculated as the sum of the short-term loudness values for the two ears. The long-term loudness for each ear is calculated by smoothing the short-term loudness for that ear, again by a process resembling automatic gain control, and the overall loudness impression is obtained by summing the long-term loudness across ears. The predictions of the model are more accurate than those of an earlier model that did not incorporate binaural inhibition.
Collapse
Affiliation(s)
- Brian C J Moore
- 1 Department of Experimental Psychology, University of Cambridge, Cambridge, England
| | - Brian R Glasberg
- 1 Department of Experimental Psychology, University of Cambridge, Cambridge, England
| | - Ajanth Varathanathan
- 1 Department of Experimental Psychology, University of Cambridge, Cambridge, England
| | - Josef Schlittenlacher
- 1 Department of Experimental Psychology, University of Cambridge, Cambridge, England
| |
Collapse
|
3
|
Moore BCJ, Jervis M, Harries L, Schlittenlacher J. Testing and refining a loudness model for time-varying sounds incorporating binaural inhibition. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:1504. [PMID: 29604698 DOI: 10.1121/1.5027246] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
This paper describes some experimental tests and modifications to a model of loudness for time-varying sounds incorporating the concept of binaural inhibition. Experiment 1 examined the loudness of a 100% sinusoidally amplitude-modulated 1000-Hz sinusoidal carrier as a function of the interaural modulation phase difference (IMPD). The IMPD of the test sound was 90° or 180° and that of the comparison sound was 0°. The level difference between the test and the comparison sounds at the point of equal loudness (the LDEL) was estimated for baseline levels of 30 and 70 dB sound pressure level and modulation rates of 1, 2, 4, 8, 16, and 32 Hz. The LDELs were negative (mean = -1.1 and -1.5 dB for IMPDs of 90° and 180°), indicating that non-zero IMPDs led to increased loudness. The original version of the model predicted the general form of the results, but there were some systematic errors. Modifications to the time constants of the model gave a better fit to the data. Experiment 2 assessed the loudness of unintelligible speech-like signals, generated using a noise vocoder, whose spectra and time pattern differed at the two ears. Both the original and modified models gave good fits to the data.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Matthew Jervis
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Luke Harries
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Josef Schlittenlacher
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| |
Collapse
|
4
|
Rawool VW, Parrill M. Objective evaluation of binaural summation through acoustic reflex measures. Int J Audiol 2018; 57:493-501. [PMID: 29433374 DOI: 10.1080/14992027.2018.1435914] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Vishakha W. Rawool
- Department of Communication Sciences & Disorders, West Virginia University, Morgantown, WV, USA
| | - Madaline Parrill
- Department of Communication Sciences & Disorders, West Virginia University, Morgantown, WV, USA
| |
Collapse
|
5
|
Roberts B, Summers RJ, Bailey PJ. Acoustic source characteristics, across-formant integration, and speech intelligibility under competitive conditions. J Exp Psychol Hum Percept Perform 2015; 41:680-91. [PMID: 25751040 PMCID: PMC4445382 DOI: 10.1037/xhp0000038] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
An important aspect of speech perception is the ability to group or select formants using cues in the acoustic source characteristics—for example, fundamental frequency (F0) differences between formants promote their segregation. This study explored the role of more radical differences in source characteristics. Three-formant (F1+F2+F3) synthetic speech analogues were derived from natural sentences. In Experiment 1, F1+F3 were generated by passing a harmonic glottal source (F0 = 140 Hz) through second-order resonators (H1+H3); in Experiment 2, F1+F3 were tonal (sine-wave) analogues (T1+T3). F2 could take either form (H2 or T2). In some conditions, the target formants were presented alone, either monaurally or dichotically (left ear = F1+F3; right ear = F2). In others, they were accompanied by a competitor for F2 (F1+F2C+F3; F2), which listeners must reject to optimize recognition. Competitors (H2C or T2C) were created using the time-reversed frequency and amplitude contours of F2. Dichotic presentation of F2 and F2C ensured that the impact of the competitor arose primarily through informational masking. In the absence of F2C, the effect of a source mismatch between F1+F3 and F2 was relatively modest. When F2C was present, intelligibility was lowest when F2 was tonal and F2C was harmonic, irrespective of which type matched F1+F3. This finding suggests that source type and context, rather than similarity, govern the phonetic contribution of a formant. It is proposed that wideband harmonic analogues are more effective informational maskers than narrowband tonal analogues, and so become dominant in across-frequency integration of phonetic information when placed in competition.
Collapse
Affiliation(s)
- Brian Roberts
- Psychology, School of Life and Health Sciences, Aston University
| | - Robert J Summers
- Psychology, School of Life and Health Sciences, Aston University
| | | |
Collapse
|
6
|
Abstract
This article reviews the evolution of a series of models of loudness developed in Cambridge, UK. The first model, applicable to stationary sounds, was based on modifications of the model developed by Zwicker, including the introduction of a filter to allow for the effects of transfer of sound through the outer and middle ear prior to the calculation of an excitation pattern, and changes in the way that the excitation pattern was calculated. Later, modifications were introduced to the assumed middle-ear transfer function and to the way that specific loudness was calculated from excitation level. These modifications led to a finite calculated loudness at absolute threshold, which made it possible to predict accurately the absolute thresholds of broadband and narrowband sounds, based on the assumption that the absolute threshold corresponds to a fixed small loudness. The model was also modified to give predictions of partial loudness—the loudness of one sound in the presence of another. This allowed predictions of masked thresholds based on the assumption that the masked threshold corresponds to a fixed small partial loudness. Versions of the model for time-varying sounds were developed, which allowed prediction of the masked threshold of any sound in a background of any other sound. More recent extensions incorporate binaural processing to account for the summation of loudness across ears. In parallel, versions of the model for predicting loudness for hearing-impaired ears have been developed and have been applied to the development of methods for fitting multichannel compression hearing aids.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, UK
| |
Collapse
|
7
|
Moore BCJ, Gibbs A, Onions G, Glasberg BR. Measurement and modeling of binaural loudness summation for hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:736-747. [PMID: 25096108 DOI: 10.1121/1.4889868] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The summation of loudness across ears is often studied by measuring the level difference required for equal loudness (LDEL) of monaural and diotic sounds. Typically, the LDEL is ∼5-6 dB, consistent with the idea that a diotic sound is ∼1.5 times as loud as the same sound presented monaurally at the same level, as predicted by the loudness model of Moore and Glasberg [J. Acoust. Soc. Am. 121, 1604-1612 (2007)]. One might expect that the LDEL would be <5-6 dB for hearing-impaired listeners, because loudness recruitment leads to a more rapid change of loudness for a given change in level. However, previous data sometimes showed similar LDEL values for normal-hearing and hearing-impaired listeners. Here, the LDEL was measured for hearing-impaired listeners using narrowband and broadband noises centered at 500 Hz, where audiometric thresholds were near-normal, and at 3000 or 4000 Hz, where audiometric thresholds were elevated. The mean LDEL was 5.6 dB at 500 Hz and 4.2 dB at the higher center frequencies. The results were predicted reasonably well by an extension of the loudness model of Moore and Glasberg.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Alexander Gibbs
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Grace Onions
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Brian R Glasberg
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| |
Collapse
|
8
|
Strelcyk O, Nooraei N, Kalluri S, Edwards B. Restoration of loudness summation and differential loudness growth in hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:2557-2568. [PMID: 23039449 DOI: 10.1121/1.4747018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
When normal-hearing (NH) listeners compare the loudness of narrowband and wideband sounds presented at identical sound pressure levels, the wideband sound will most often be perceived as louder than the narrowband sound, a phenomenon referred to as loudness summation. Hearing-impaired (HI) listeners typically show less-than-normal loudness summation, due to reduced cochlear compressive gain and degraded frequency selectivity. In the present study, loudness summation at 1 and 3 kHz was estimated monaurally for five NH and eight HI listeners by matching the loudness of narrowband and wideband noise stimuli. The loudness summation was measured as a function both of noise bandwidth and level. The HI listeners were tested unaided and aided using three different compression systems to investigate the possibility of restoring loudness summation in these listeners. A compression system employing level-dependent compression channels yielded the most promising outcome. The present results inform the development of future loudness models and advanced compensation strategies for the hearing impaired.
Collapse
Affiliation(s)
- Olaf Strelcyk
- Starkey Hearing Research Center, 2150 Shattuck Avenue, Suite 408, Berkeley, California 94704, USA.
| | | | | | | |
Collapse
|
9
|
Moore BCJ. Contributions of von Békésy to psychoacoustics. Hear Res 2012; 293:51-7. [PMID: 22561280 DOI: 10.1016/j.heares.2012.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/31/2012] [Revised: 03/30/2012] [Accepted: 04/12/2012] [Indexed: 10/28/2022]
Abstract
This paper reviews the contributions of von Békésy to psychoacoustics, comparing his findings and interpretations to those that have emerged since his work. The areas covered include the perception of pitch for pure tones and complex tones, the effect of frequency on the apparent location of pure tones, estimation of the velocity of the traveling wave on the basilar membrane using judgments of lateralization, and the relative loudness of monaural and diotic sounds. While subsequent research has failed to replicate some of his findings, other findings have stood the test of time. There is no doubt that von Békésy made very substantial contributions to psychoacoustic research.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom.
| |
Collapse
|
10
|
Epstein M, Florentine M. Binaural loudness summation for speech presented via earphones and loudspeaker with and without visual cues. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:3981-3988. [PMID: 22559371 PMCID: PMC3356317 DOI: 10.1121/1.3701984] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2011] [Revised: 03/21/2012] [Accepted: 03/21/2012] [Indexed: 05/31/2023]
Abstract
Preliminary data [M. Epstein and M. Florentine, Ear. Hear. 30, 234-237 (2009)] obtained using speech stimuli from a visually present talker heard via loudspeakers in a sound-attenuating chamber indicate little difference in loudness when listening with one or two ears (i.e., significantly reduced binaural loudness summation, BLS), which is known as "binaural loudness constancy." These data challenge current understanding drawn from laboratory measurements that indicate a tone presented binaurally is louder than the same tone presented monaurally. Twelve normal listeners were presented recorded spondees, monaurally and binaurally across a wide range of levels via earphones and a loudspeaker with and without visual cues. Statistical analyses of binaural-to-monaural ratios of magnitude estimates indicate that the amount of BLS is significantly less for speech presented via a loudspeaker with visual cues than for stimuli with any other combination of test parameters (i.e., speech presented via earphones or a loudspeaker without visual cues, and speech presented via earphones with visual cues). These results indicate that the loudness of a visually present talker in daily environments is little affected by switching between binaural and monaural listening. This supports the phenomenon of binaural loudness constancy and underscores the importance of ecological validity in loudness research.
Collapse
Affiliation(s)
- Michael Epstein
- Auditory Modeling and Processing Laboratory, Department of Speech-Language Pathology and Audiology 106A FR, Communications and Digital Signal Processing Center, Department of Electrical and Computer Engineering, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 02115, USA.
| | | |
Collapse
|