1
|
Biberger T, Ewert SD. Binaural detection thresholds and audio quality of speech and music signals in complex acoustic environments. Front Psychol 2022; 13:994047. [PMID: 36507051 PMCID: PMC9729260 DOI: 10.3389/fpsyg.2022.994047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 09/26/2022] [Indexed: 11/25/2022] Open
Abstract
Every-day acoustical environments are often complex, typically comprising one attended target sound in the presence of interfering sounds (e.g., disturbing conversations) and reverberation. Here we assessed binaural detection thresholds and (supra-threshold) binaural audio quality ratings of four distortions types: spectral ripples, non-linear saturation, intensity and spatial modifications applied to speech, guitar, and noise targets in such complex acoustic environments (CAEs). The target and (up to) two masker sounds were either co-located as if contained in a common audio stream, or were spatially separated as if originating from different sound sources. The amount of reverberation was systematically varied. Masker and reverberation had a significant effect on the distortion-detection thresholds of speech signals. Quality ratings were affected by reverberation, whereas the effect of maskers depended on the distortion. The results suggest that detection thresholds and quality ratings for distorted speech in anechoic conditions are also valid for rooms with mild reverberation, but not for moderate reverberation. Furthermore, for spectral ripples, a significant relationship between the listeners' individual detection thresholds and quality ratings was found. The current results provide baseline data for detection thresholds and audio quality ratings of different distortions of a target sound in CAEs, supporting the future development of binaural auditory models.
Collapse
Affiliation(s)
- Thomas Biberger
- Department of Medical Physics and Acoustics and Cluster of Excellence Hearing4all, University of Oldenburg, Oldenburg, Germany
| | | |
Collapse
|
2
|
Relaño-Iborra H, Dau T. Speech intelligibility prediction based on modulation frequency-selective processing. Hear Res 2022; 426:108610. [PMID: 36163219 DOI: 10.1016/j.heares.2022.108610] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 08/22/2022] [Accepted: 09/12/2022] [Indexed: 11/17/2022]
Abstract
Speech intelligibility models can provide insights regarding the auditory processes involved in human speech perception and communication. One successful approach to modelling speech intelligibility has been based on the analysis of the amplitude modulations present in speech as well as competing interferers. This review covers speech intelligibility models that include a modulation-frequency selective processing stage i.e., a modulation filterbank, as part of their front end. The speech-based envelope power spectrum model [sEPSM, Jørgensen and Dau (2011). J. Acoust. Soc. Am. 130(3), 1475-1487], several variants of the sEPSM including modifications with respect to temporal resolution, spectro-temporal processing and binaural processing, as well as the speech-based computational auditory signal processing and perception model [sCASP; Relaño-Iborra et al. J. Acoust. Soc. Am. 146(5), 3306-3317], which is based on an established auditory signal detection and masking model, are discussed. The key processing stages of these models for the prediction of speech intelligibility across a variety of acoustic conditions are addressed in relation to competing modeling approaches. The strengths and weaknesses of the modulation-based analysis are outlined and perspectives presented, particularly in connection with the challenge of predicting the consequences of individual hearing loss on speech intelligibility.
Collapse
Affiliation(s)
- Helia Relaño-Iborra
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark, Kgs. Lyngby 2800, Denmark; Cognitive Systems Section, Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs, Lyngby 2800, Denmark.
| | - Torsten Dau
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark, Kgs. Lyngby 2800, Denmark
| |
Collapse
|
3
|
Attia S, King A, Varnet L, Ponsot E, Lorenzi C. Double-pass consistency for amplitude- and frequency-modulation detection in normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:3631. [PMID: 34852611 DOI: 10.1121/10.0006811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 10/05/2021] [Indexed: 06/13/2023]
Abstract
Amplitude modulation (AM) and frequency modulation (FM) provide crucial auditory information. If FM is encoded as AM, it should be possible to give a unified account of AM and FM perception both in terms of response consistency and performance. These two aspects of behavior were estimated for normal-hearing participants using a constant-stimuli, forced-choice detection task repeated twice with the same stimuli (double pass). Sinusoidal AM or FM with rates of 2 or 20 Hz were applied to a 500-Hz pure-tone carrier and presented at detection threshold. All stimuli were masked by a modulation noise. Percent agreement of responses across passes and percent-correct detection for the two passes were used to estimate consistency and performance, respectively. These data were simulated using a model implementing peripheral processes, a central modulation filterbank, an additive internal noise, and a template-matching device. Different levels of internal noise were required to reproduce AM and FM data, but a single level could account for the 2- and 20-Hz AM data. As for FM, two levels of internal noise were needed to account for detection at slow and fast rates. Finally, the level of internal noise yielding best predictions increased with the level of the modulation-noise masker. Overall, these results suggest that different sources of internal variability are involved for AM and FM detection at low audio frequencies.
Collapse
Affiliation(s)
- Sarah Attia
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Andrew King
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Emmanuel Ponsot
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Christian Lorenzi
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| |
Collapse
|
4
|
Ellis RJ, Rönnberg J. Temporal fine structure: associations with cognition and speech-in-noise recognition in adults with normal hearing or hearing impairment. Int J Audiol 2021; 61:778-786. [PMID: 34292115 DOI: 10.1080/14992027.2021.1948119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
OBJECTIVES To investigate associations between sensitivity to temporal fine structure (TFS) and performance in cognitive and speech-in-noise recognition tests. DESIGN A binaural test of TFS sensitivity (the TFS-LF) was used. Measures of cognition included the reading span, Raven's, and text-reception threshold tests. Measures of speech recognition included the Hearing in noise (HINT) and the Hagerman matrix sentence tests in three signal processing conditions. STUDY SAMPLE Analyses are based on the performance of 324/317 adults with and without hearing impairment. RESULTS Sensitivity to TFS was significantly correlated with both the reading span test and the recognition of speech-in-noise processed using noise reduction, the latter only when limited to participants with hearing impairment. Neither association was significant when the effects of age were partialled out. CONCLUSIONS The findings are consistent with previous research in finding no evidence of a link between sensitivity to TFS and working memory once the effects of age had been partialled out. The results provide some evidence of an influence of signal processing strategy on the association between TFS sensitivity and speech-in-noise recognition. However, further research is necessary to assess the generalisability of the findings before any claims can be made regarding any clinical implications of these findings.
Collapse
Affiliation(s)
- Rachel J Ellis
- Institute for Behavioural Sciences and Learning, Linköping University, Linköping, Sweden.,Swedish Institute for Disability Research, Linköping, Sweden
| | - Jerker Rönnberg
- Institute for Behavioural Sciences and Learning, Linköping University, Linköping, Sweden.,Swedish Institute for Disability Research, Linköping, Sweden
| |
Collapse
|
5
|
Whiteford KL, Kreft HA, Oxenham AJ. The role of cochlear place coding in the perception of frequency modulation. eLife 2020; 9:58468. [PMID: 32996463 PMCID: PMC7556860 DOI: 10.7554/elife.58468] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 09/29/2020] [Indexed: 12/17/2022] Open
Abstract
Natural sounds convey information via frequency and amplitude modulations (FM and AM). Humans are acutely sensitive to the slow rates of FM that are crucial for speech and music. This sensitivity has long been thought to rely on precise stimulus-driven auditory-nerve spike timing (time code), whereas a coarser code, based on variations in the cochlear place of stimulation (place code), represents faster FM rates. We tested this theory in listeners with normal and impaired hearing, spanning a wide range of place-coding fidelity. Contrary to predictions, sensitivity to both slow and fast FM correlated with place-coding fidelity. We also used incoherent AM on two carriers to simulate place coding of FM and observed poorer sensitivity at high carrier frequencies and fast rates, two properties of FM detection previously ascribed to the limits of time coding. The results suggest a unitary place-based neural code for FM across all rates and carrier frequencies.
Collapse
Affiliation(s)
- Kelly L Whiteford
- Department of Psychology, University of Minnesota, Minneapolis, United States
| | - Heather A Kreft
- Department of Psychology, University of Minnesota, Minneapolis, United States
| | - Andrew J Oxenham
- Department of Psychology, University of Minnesota, Minneapolis, United States
| |
Collapse
|
6
|
Effects of aging on event-related potentials to single-cycle binaural beats and diotic amplitude modulation of a tone. Brain Res 2020; 1740:146849. [PMID: 32330517 DOI: 10.1016/j.brainres.2020.146849] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 04/08/2020] [Accepted: 04/20/2020] [Indexed: 11/24/2022]
Abstract
Aim of the study is to determine whether the auditory processing of temporal fine structure (TFS) is affected with normal aging, even in the presence of normal audiometric hearing and fine cognitive state; and, if it is, to see whether a comparable effect is also observed in the processing of a diotic change in sound envelope. The event-related potentials (ERPs) to binaural beats (BBs), which are the responses of the binaural mechanisms processing TFS of a sound, and the ERPs to diotic amplitude modulation (AM) stimuli, which are the responses of the monaural mechanisms processing the changes in its envelope, were recorded from thirteen young university students and ten senior but active university professors, all with normal hearing in low frequencies. To obtain directly the specific BB responses without confounding monaural frequency change-evoked responses, we used single-cycle BB stimuli with temporary sub-threshold frequency shifts. BBs of a 250-Hz tone and diotic AM of the same tone with similar perceptual salience were presented with 2-second stimulus onset asynchrony. The N1 components of the ERPs to both stimuli displayed notable age-dependent changes in their scalp topography and significant amplitude reduction and latency prolongation in the elderly. These amplitude and latency changes were at similar rates for the two stimulus types, implying that the auditory TFS and envelope processing mechanisms are proportionally affected by physiological aging. These results may serve as control data in future studies investigating the effect of aging-associated cognitive pathologies on auditory TFS processing.
Collapse
|
7
|
Henry KS, Amburgey KN, Abrams KS, Carney LH. Identifying cues for tone-in-noise detection using decision variable correlation in the budgerigar (Melopsittacus undulatus). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:984. [PMID: 32113293 PMCID: PMC7010520 DOI: 10.1121/10.0000621] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 01/02/2020] [Accepted: 01/07/2020] [Indexed: 06/05/2023]
Abstract
Previous studies evaluated cues for masked tone detection using reproducible noise waveforms. Human results founded on this approach suggest that tone detection is based on combined energy and envelope (ENV) cues, but detection cues in nonhuman species are less clear. Decision variable correlation (DVC) was used to evaluate tone-in-noise detection cues in the budgerigar, an avian species with human-like behavioral sensitivity to many complex sounds. DVC quantifies a model's ability to predict trial-by-trial variance in behavioral responses. Budgerigars were behaviorally conditioned to detect 500-Hz tones in wideband (WB; 100-3000 Hz) and narrowband (NB; 452-552 Hz) noise. Behavioral responses were obtained using a single-interval, two-alternative discrimination task and two-down, one-up adaptive tracking procedures. Tone-detection thresholds in WB noise were higher than human thresholds, putatively due to broader peripheral frequency tuning, whereas NB thresholds were within ∼1 dB of human results. Budgerigar average hit and false-alarm rates across noise waveforms were consistent, highly correlated across subjects, and correlated to human results. Trial-by-trial behavioral results in NB noise were best explained by a model combining energy and ENV cues. In contrast, WB results were better predicted by ENV-based or multiple-channel energy detector models. These results suggest that budgerigars and humans use similar cues for tone-in-noise detection.
Collapse
Affiliation(s)
- Kenneth S Henry
- Department of Otolaryngology, University of Rochester, Rochester, New York 14642, USA
| | - Kassidy N Amburgey
- Department of Otolaryngology, University of Rochester, Rochester, New York 14642, USA
| | - Kristina S Abrams
- Department of Neuroscience, University of Rochester, Rochester, New York 14642, USA
| | - Laurel H Carney
- Department of Biomedical Engineering, University of Rochester, Rochester, New York 14642, USA
| |
Collapse
|
8
|
Ellis RJ, Rönnberg J. Retracted Article: Temporal fine structure: relations to cognition and aided speech recognition. Int J Audiol 2019; 61:473-481. [PMID: 31613169 DOI: 10.1080/14992027.2019.1672899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Retraction statementWe, the Editor and Publisher of the International Journal of Audiology, have retracted the following article.Rachel J. Ellis, and Jerker Rönnberg. 2019. "Temporal fine structure: relations to cognition and aided speech recognition." International Journal of Audiology. doi:10.1080/14992027.2019.1672899.The authors of the above-mentioned article published in the International Journal of Audiology have identified errors in the reported analysis (relating to the inclusion of data that should have been excluded) which impact the validity of the findings. The authors have, therefore, requested that the article be retracted.We have been informed in our decision-making by our policy on publishing ethics and integrity and the COPE guidelines on retractions.The retracted article will remain online to maintain the scholarly record, but it will be digitally watermarked on each page as "Retracted".
Collapse
Affiliation(s)
- Rachel J Ellis
- Department of Behavioural Sciences and Learning, Swedish Institute for Disability Research, Linköping University, Linköping, Sweden
| | - Jerker Rönnberg
- Department of Behavioural Sciences and Learning, Swedish Institute for Disability Research, Linköping University, Linköping, Sweden
| |
Collapse
|
9
|
Teng X, Cogan GB, Poeppel D. Speech fine structure contains critical temporal cues to support speech segmentation. Neuroimage 2019; 202:116152. [PMID: 31484039 DOI: 10.1016/j.neuroimage.2019.116152] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Revised: 08/10/2019] [Accepted: 08/31/2019] [Indexed: 11/16/2022] Open
Abstract
Segmenting the continuous speech stream into units for further perceptual and linguistic analyses is fundamental to speech recognition. The speech amplitude envelope (SE) has long been considered a fundamental temporal cue for segmenting speech. Does the temporal fine structure (TFS), a significant part of speech signals often considered to contain primarily spectral information, contribute to speech segmentation? Using magnetoencephalography, we show that the TFS entrains cortical responses between 3 and 6 Hz and demonstrate, using mutual information analysis, that (i) the temporal information in the TFS can be reconstructed from a measure of frame-to-frame spectral change and correlates with the SE and (ii) that spectral resolution is key to the extraction of such temporal information. Furthermore, we show behavioural evidence that, when the SE is temporally distorted, the TFS provides cues for speech segmentation and aids speech recognition significantly. Our findings show that it is insufficient to investigate solely the SE to understand temporal speech segmentation, as the SE and the TFS derived from a band-filtering method convey comparable, if not inseparable, temporal information. We argue for a more synthetic view of speech segmentation - the auditory system groups speech signals coherently in both temporal and spectral domains.
Collapse
Affiliation(s)
- Xiangbin Teng
- Department of Neuroscience, Max-Planck-Institute for Empirical Aesthetics, Frankfurt, 60322, Germany.
| | - Gregory B Cogan
- Department of Neurosurgery, Duke University, Durham, NC, USA, 27710
| | - David Poeppel
- Department of Neuroscience, Max-Planck-Institute for Empirical Aesthetics, Frankfurt, 60322, Germany; Department of Psychology, New York University, New York, NY, USA, 10003
| |
Collapse
|
10
|
Hoover EC, Kinney BN, Bell KL, Gallun FJ, Eddins DA. A Comparison of Behavioral Methods for Indexing the Auditory Processing of Temporal Fine Structure Cues. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:2018-2034. [PMID: 31145649 PMCID: PMC6808371 DOI: 10.1044/2019_jslhr-h-18-0217] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 12/20/2018] [Accepted: 02/13/2019] [Indexed: 05/12/2023]
Abstract
Purpose Growing evidence supports the inclusion of perceptual tests that quantify the processing of temporal fine structure (TFS) in clinical hearing assessment. Many tasks have been used to evaluate TFS in the laboratory that vary greatly in the stimuli used and whether the judgments require monaural or binaural comparisons of TFS. The purpose of this study was to compare laboratory measures of TFS for inclusion in a battery of suprathreshold auditory tests. A subset of available TFS tasks were selected on the basis of potential clinical utility and were evaluated using metrics that focus on characteristics important for clinical use. Method TFS measures were implemented in replication of studies that demonstrated clinical utility. Monaural, diotic, and dichotic measures were evaluated in 11 young listeners with normal hearing. Measures included frequency modulation (FM) tasks, harmonic frequency shift detection, interaural phase difference (TFS-low frequency), interaural time difference (ITD), monaural gap duration discrimination, and tone detection in noise with and without a difference in interaural phase (N0S0, N0Sπ). Data were compared with published results and evaluated with metrics of consistency and efficiency. Results Thresholds obtained were consistent with published data. There was no evidence of predictive relationships among the measures consistent with a homogenous group. The most stable tasks across repeated testing were TFS-low frequency, diotic and dichotic FM, and N0Sπ. Monaural and diotic FM had the lowest normalized variance and were the most efficient accounting for differences in total test duration, followed by ITD. Conclusions Despite a long stimulus duration, FM tasks dominated comparisons of consistency and efficiency. Small differences separated the dichotic tasks FM, ITD, and N0Sπ. Future comparisons following procedural optimization of the tasks will evaluate clinical efficiency in populations with impairment.
Collapse
Affiliation(s)
- Eric C. Hoover
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Brianna N. Kinney
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Karen L. Bell
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| | - Frederick J. Gallun
- National Center for Rehabilitative Auditory Research, Portland VA Medical Center, Oregon
- Department of Otolaryngology–Head and Neck Surgery, Oregon Health and Science University, Portland
| | - David A. Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa
| |
Collapse
|
11
|
Event-related potentials to single-cycle binaural beats and diotic amplitude modulation of a tone. Exp Brain Res 2019; 237:1931-1945. [PMID: 31111175 DOI: 10.1007/s00221-019-05562-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Accepted: 05/16/2019] [Indexed: 10/26/2022]
Abstract
When two tones with slightly different frequencies are dichotically presented, binaural beats (BBs) are experienced. BBs resulting from the cycling change in interaural phase difference elicit electroencephalographic responses. Because they repeat at short periods, allowing poor recovery of the cortical responses, these steady-state responses have small amplitudes, and their various wave components intermingle and might mask each other. Using single-cycle BBs separated by relatively long inter-onset intervals would be a solution, but introducing a transient interaural frequency shift requires response subtraction which may not be acceptable for non-additive brain responses. The proposed stimulation method employs transient and monaurally subthreshold frequency shifts in opposite directions in the two ears to produce single-cycle BBs of a 250 Hz tone. These shifts are perceived as distinct BBs when presented dichotically, but remain subthreshold when presented monotically. Therefore, no frequency-shift response is elicited, and the specific BB response is obtained with no need for waveform subtraction. We recorded from 19 normal hearing participants the event-related potentials (ERPs) to single-cycle BBs and also to temporary diotic amplitude modulation (AM) with matched perceptual salience. The ERPs to single-cycle BBs presented at 2 s inter-onset intervals had N1-P2 responses with up to seven times larger amplitudes than the conventional steady-state BB responses in the literature. Significant differences were found between the scalp potential distributions of the N1 responses to BB and AM stimuli, suggesting that the cortical sites, where envelope-based level processing and temporal fine structure-based spatial processing of the stimulus take place, are not totally overlapped.
Collapse
|
12
|
King A, Varnet L, Lorenzi C. Accounting for masking of frequency modulation by amplitude modulation with the modulation filter-bank concept. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:2277. [PMID: 31046322 DOI: 10.1121/1.5094344] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 02/25/2019] [Indexed: 06/09/2023]
Abstract
Frequency modulation (FM) is assumed to be detected through amplitude modulation (AM) created by cochlear filtering for modulation rates above 10 Hz and carrier frequencies (fc) above 4 kHz. If this is the case, a model of modulation perception based on the concept of AM filters should predict masking effects between AM and FM. To test this, masking effects of sinusoidal AM on sinusoidal FM detection thresholds were assessed on normal-hearing listeners as a function of FM rate, fc, duration, AM rate, AM depth, and phase difference between FM and AM. The data were compared to predictions of a computational model implementing an AM filter-bank. Consistent with model predictions, AM masked FM with some AM-masking-AM features (broad tuning and effect of AM-masker depth). Similar masking was predicted and observed at fc = 0.5 and 5 kHz for a 2 Hz AM masker, inconsistent with the notion that additional (e.g., temporal fine-structure) cues drive slow-rate FM detection at low fc. However, masking was lower than predicted and, unlike model predictions, did not show beating or phase effects. Broadly, the modulation filter-bank concept successfully explained some AM-masking-FM effects, but could not give a complete account of both AM and FM detection.
Collapse
Affiliation(s)
- Andrew King
- Laboratoire des systèmes perceptifs, UMR CNRS 8248, Département d'Etudes Cognitives, École normale supérieure, Université Paris Sciences & Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des systèmes perceptifs, UMR CNRS 8248, Département d'Etudes Cognitives, École normale supérieure, Université Paris Sciences & Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Christian Lorenzi
- Laboratoire des systèmes perceptifs, UMR CNRS 8248, Département d'Etudes Cognitives, École normale supérieure, Université Paris Sciences & Lettres, 29 rue d'Ulm, 75005 Paris, France
| |
Collapse
|
13
|
Moore BCJ, Mariathasan S, Sęk AP. Effects of Age and Hearing Loss on the Discrimination of Amplitude and Frequency Modulation for 2- and 10-Hz Rates. Trends Hear 2019; 23:2331216519853963. [PMID: 31250705 PMCID: PMC6600487 DOI: 10.1177/2331216519853963] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 05/07/2019] [Accepted: 05/08/2019] [Indexed: 11/16/2022] Open
Abstract
Detection of frequency modulation (FM) with rate = 10 Hz may depend on conversion of FM to amplitude modulation (AM) in the cochlea, while detection of 2-Hz FM may depend on the use of temporal fine structure (TFS) information. TFS processing may worsen with greater age and hearing loss while AM processing probably does not. A two-stage experiment was conducted to test these ideas while controlling for the effects of detection efficiency. Stage 1 measured psychometric functions for the detection of AM alone and FM alone imposed on a 1-kHz carrier, using 2- and 10-Hz rates. Stage 2 assessed the discrimination of AM from FM at the same modulation rate when the detectability of the AM alone and FM alone was equated. Discrimination was better for the 2-Hz than for the 10-Hz rate for all young normal-hearing subjects and for some older subjects with normal hearing at 1 kHz. Other older subjects with normal hearing showed no clear difference in AM-FM discrimination for the 2- and 10-Hz rates, as was the case for most older hearing-impaired subjects. The results suggest that the ability to use TFS cues is reduced for some older people and most hearing-impaired people.
Collapse
Affiliation(s)
- Brian C. J. Moore
- Department of Experimental
Psychology, University of Cambridge, England
| | - Sashi Mariathasan
- Department of Experimental
Psychology, University of Cambridge, England
| | - Aleksander P. Sęk
- Faculty of Physics, Institute of
Acoustics, Adam Mickiewicz University, Poznań, Poland
| |
Collapse
|
14
|
Miller CW, Bernstein JGW, Zhang X, Wu YH, Bentler RA, Tremblay K. The Effects of Static and Moving Spectral Ripple Sensitivity on Unaided and Aided Speech Perception in Noise. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:3113-3126. [PMID: 30515519 PMCID: PMC6440313 DOI: 10.1044/2018_jslhr-h-17-0373] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Revised: 06/06/2018] [Accepted: 08/04/2018] [Indexed: 05/26/2023]
Abstract
PURPOSE This study evaluated whether certain spectral ripple conditions were more informative than others in predicting ecologically relevant unaided and aided speech outcomes. METHOD A quasi-experimental study design was used to evaluate 67 older adult hearing aid users with bilateral, symmetrical hearing loss. Speech perception in noise was tested under conditions of unaided and aided, auditory-only and auditory-visual, and 2 types of noise. Predictors included age, audiometric thresholds, audibility, hearing aid compression, and modulation depth detection thresholds for moving (4-Hz) or static (0-Hz) 2-cycle/octave spectral ripples applied to carriers of broadband noise or 2000-Hz low- or high-pass filtered noise. RESULTS A principal component analysis of the modulation detection data found that broadband and low-pass static and moving ripple detection thresholds loaded onto the first factor whereas high-pass static and moving ripple detection thresholds loaded onto a second factor. A linear mixed model revealed that audibility and the first factor (reflecting broadband and low-pass static and moving ripples) were significantly associated with speech perception performance. Similar results were found for unaided and aided speech scores. The interactions between speech conditions were not significant, suggesting that the relationship between ripples and speech perception was consistent regardless of visual cues or noise condition. High-pass ripple sensitivity was not correlated with speech understanding. CONCLUSIONS The results suggest that, for hearing aid users, poor speech understanding in noise and sensitivity to both static and slow-moving ripples may reflect deficits in the same underlying auditory processing mechanism. Significant factor loadings involving ripple stimuli with low-frequency content may suggest an impaired ability to use temporal fine structure information in the stimulus waveform. Support is provided for the use of spectral ripple testing to predict speech perception outcomes in clinical settings.
Collapse
Affiliation(s)
- Christi W. Miller
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| | - Joshua G. W. Bernstein
- National Military Audiology and Speech Pathology Center, Walter Reed National Military Medical Center, Bethesda, MD
| | - Xuyang Zhang
- Department of Communication Sciences and Disorders, University of Iowa, Iowa City
| | - Yu-Hsiang Wu
- Department of Communication Sciences and Disorders, University of Iowa, Iowa City
| | - Ruth A. Bentler
- Department of Communication Sciences and Disorders, University of Iowa, Iowa City
| | - Kelly Tremblay
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| |
Collapse
|
15
|
Wallaert N, Varnet L, Moore BCJ, Lorenzi C. Sensorineural hearing loss impairs sensitivity but spares temporal integration for detection of frequency modulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:720. [PMID: 30180712 DOI: 10.1121/1.5049364] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Accepted: 07/20/2018] [Indexed: 05/22/2023]
Abstract
The effect of the number of modulation cycles (N) on frequency-modulation (FM) detection thresholds (FMDTs) was measured with and without interfering amplitude modulation (AM) for hearing-impaired (HI) listeners, using a 500-Hz sinusoidal carrier and FM rates of 2 and 20 Hz. The data were compared with FMDTs for normal-hearing (NH) listeners and AM detection thresholds (AMDTs) for NH and HI listeners [Wallaert, Moore, and Lorenzi (2016). J. Acoust. Soc. 139, 3088-3096; Wallaert, Moore, Ewert, and Lorenzi (2017). J. Acoust. Soc. 141, 971-980]. FMDTs were higher for HI than for NH listeners, but the effect of increasing N was similar across groups. In contrast, AMDTs were lower and the effect of increasing N was greater for HI listeners than for NH listeners. A model of temporal-envelope processing based on a modulation filter-bank and a template-matching decision strategy accounted better for the FMDTs at 20 Hz than at 2 Hz for young NH listeners and predicted greater temporal integration of FM than observed for all groups. These results suggest that different mechanisms underlie AM and FM detection at low rates and that hearing loss impairs FM-detection mechanisms, but preserves the memory and decision processes responsible for temporal integration of FM.
Collapse
Affiliation(s)
- Nicolas Wallaert
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, Ecole Normale Supérieure, Université Paris Sciences & Lettres, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, Ecole Normale Supérieure, Université Paris Sciences & Lettres, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, Ecole Normale Supérieure, Université Paris Sciences & Lettres, Centre National de la Recherche Scientifique, 75005 Paris, France
| |
Collapse
|