1
|
Pearson DV, Shen Y, McAuley JD, Kidd GR. The effect of rhythm on selective listening in multiple-source environments for young and older adults. Hear Res 2023; 435:108789. [PMID: 37276686 PMCID: PMC10460128 DOI: 10.1016/j.heares.2023.108789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 05/03/2023] [Accepted: 05/10/2023] [Indexed: 06/07/2023]
Abstract
Understanding continuous speech with competing background sounds is challenging, particularly for older adults. One stimulus property that may aid listeners understanding of to-be-attended (target) material is temporal regularity (rhythm). In the context of speech-in-noise understanding, McAuley and colleagues recently showed a target rhythm effect whereby recognition of target speech was better when natural speech rhythm of a target talker was intact than when it was temporally altered. The current study replicates the target rhythm effect using a synthetic vowel sequence paradigm in young adults (Experiment 1) and then uses this paradigm to investigate potential age-related changes in the effect of rhythm on recognition (Experiment 2). Listeners identified the last three vowels of temporally regular (isochronous) and irregular (anisochronous) synthetic vowel sequences in quiet and with a competing background sequence of vowel-like harmonic tone complexes presented at various tempos. The results replicated the target rhythm effect whereby temporal regularity in the vowel sequences improved identification accuracy of young listeners compared to irregular vowel sequences. The magnitude of the effect was not found to be influenced by background tempo, but faster background tempos led to greater vowel identification accuracy independent of regularity. Older listeners also demonstrated a target rhythm effect but received less benefit from the temporal regularity of the target sequences than did young listeners. This study highlights the importance of rhythm for understanding age-related differences in selective listening in complex environments and provides a novel paradigm for investigating effects of rhythm on perception.
Collapse
Affiliation(s)
- Dylan V Pearson
- Department of Speech, Language, and Hearing Sciences, Indiana University, United States.
| | - Yi Shen
- Department of Speech and Hearing Sciences, University of Washington, United States
| | - J Devin McAuley
- Department of Psychology, Michigan State University, United States
| | - Gary R Kidd
- Department of Speech, Language, and Hearing Sciences, Indiana University, United States
| |
Collapse
|
2
|
Forward entrainment: Psychophysics, neural correlates, and function. Psychon Bull Rev 2022:10.3758/s13423-022-02220-y. [DOI: 10.3758/s13423-022-02220-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/13/2022] [Indexed: 12/04/2022]
Abstract
AbstractWe define forward entrainment as that part of behavioral or neural entrainment that outlasts the entraining stimulus. In this review, we examine conditions under which one may optimally observe forward entrainment. In Part 1, we review and evaluate studies that have observed forward entrainment using a variety of psychophysical methods (detection, discrimination, and reaction times), different target stimuli (tones, noise, and gaps), different entraining sequences (sinusoidal, rectangular, or sawtooth waveforms), a variety of physiological measures (MEG, EEG, ECoG, CSD), in different modalities (auditory and visual), across modalities (audiovisual and auditory-motor), and in different species. In Part 2, we describe those experimental conditions that place constraints on the magnitude of forward entrainment, including an evaluation of the effects of signal uncertainty and attention, temporal envelope complexity, signal-to-noise ratio (SNR), rhythmic rate, prior experience, and intersubject variability. In Part 3 we theorize on potential mechanisms and propose that forward entrainment may instantiate a dynamic auditory afterimage that lasts a fraction of a second to minimize prediction error in signal processing.
Collapse
|
3
|
Schlittenlacher J, Lim JX, Lawson J, Moore BC. Modulation masking produced by a low-frequency pure tone. Hear Res 2022; 424:108596. [DOI: 10.1016/j.heares.2022.108596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 06/08/2022] [Accepted: 08/01/2022] [Indexed: 11/04/2022]
|
4
|
Lentz JJ, Humes LE, Kidd GR. Differences in Auditory Perception Between Young and Older Adults When Controlling for Differences in Hearing Loss and Cognition. Trends Hear 2022; 26:23312165211066180. [PMID: 34989641 PMCID: PMC8753078 DOI: 10.1177/23312165211066180] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
This study was designed to examine age effects on various auditory perceptual skills using a large group of listeners (155 adults, 121 aged 60-88 years and 34 aged 18-30 years), while controlling for the factors of hearing loss and working memory (WM). All subjects completed 3 measures of WM, 7 psychoacoustic tasks (24 conditions) and a hearing assessment. Psychophysical measures were selected to tap phenomena thought to be mediated by higher-level auditory function and included modulation detection, modulation detection interference, informational masking (IM), masking level difference (MLD), anisochrony detection, harmonic mistuning, and stream segregation. Principal-components analysis (PCA) was applied to each psychoacoustic test. For 6 of the 7 tasks, a single component represented performance across the multiple stimulus conditions well, whereas the modulation-detection interference (MDI) task required two components to do so. The effect of age was analyzed using a general linear model applied to each psychoacoustic component. Once hearing loss and WM were accounted for as covariates in the analyses, estimated marginal mean thresholds were lower for older adults on tasks based on temporal processing. When evaluated separately, hearing loss led to poorer performance on roughly 1/2 the tasks and declines in WM accounted for poorer performance on 6 of the 8 psychoacoustic components. These results make clear the need to interpret age-group differences in performance on psychoacoustic tasks in light of cognitive declines commonly associated with aging, and point to hearing loss and cognitive declines as negatively influencing auditory perceptual skills.
Collapse
Affiliation(s)
- Jennifer J. Lentz
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IN, USA
| | - Larry E. Humes
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IN, USA
| | - Gary R. Kidd
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IN, USA
| |
Collapse
|
5
|
McAuley JD, Shen Y, Smith T, Kidd GR. Effects of speech-rhythm disruption on selective listening with a single background talker. Atten Percept Psychophys 2021; 83:2229-2240. [PMID: 33782913 PMCID: PMC10612531 DOI: 10.3758/s13414-021-02298-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/05/2021] [Indexed: 11/08/2022]
Abstract
Recent work by McAuley et al. (Attention, Perception, & Psychophysics, 82, 3222-3233, 2020) using the Coordinate Response Measure (CRM) paradigm with a multitalker background revealed that altering the natural rhythm of target speech amidst background speech worsens target recognition (a target-rhythm effect), while altering background speech rhythm improves target recognition (a background-rhythm effect). Here, we used a single-talker background to examine the role of specific properties of target and background sound patterns on selective listening without the complexity of multiple background stimuli. Experiment 1 manipulated the sex of the background talker, presented with a male target talker, to assess target and background-rhythm effects with and without a strong pitch cue to aid perceptual segregation. Experiment 2 used a vocoded single-talker background to examine target and background-rhythm effects with envelope-based speech rhythms preserved, but without semantic content or temporal fine structure. While a target-rhythm effect was present with all backgrounds, the background-rhythm effect was only observed for the same-sex background condition. Results provide additional support for a selective entrainment hypothesis, while also showing that the background-rhythm effect is not driven by envelope-based speech rhythm alone, and may be reduced or eliminated when pitch or other acoustic differences provide a strong basis for selective listening.
Collapse
Affiliation(s)
- J Devin McAuley
- Department of Psychology, Michigan State University, East Lansing, MI, 48824, USA.
| | - Yi Shen
- Department of Speech and Hearing Sciences, University of Washington, Seattle, WA, USA
| | - Toni Smith
- Department of Psychology, Michigan State University, East Lansing, MI, 48824, USA
| | - Gary R Kidd
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IN, USA
| |
Collapse
|
6
|
Jennings SG. The role of the medial olivocochlear reflex in psychophysical masking and intensity resolution in humans: a review. J Neurophysiol 2021; 125:2279-2308. [PMID: 33909513 PMCID: PMC8285664 DOI: 10.1152/jn.00672.2020] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 03/16/2021] [Accepted: 04/02/2021] [Indexed: 02/01/2023] Open
Abstract
This review addresses the putative role of the medial olivocochlear (MOC) reflex in psychophysical masking and intensity resolution in humans. A framework for interpreting psychophysical results in terms of the expected influence of the MOC reflex is introduced. This framework is used to review the effects of a precursor or contralateral acoustic stimulation on 1) simultaneous masking of brief tones, 2) behavioral estimates of cochlear gain and frequency resolution in forward masking, 3) the buildup and decay of forward masking, and 4) measures of intensity resolution. Support, or lack thereof, for a role of the MOC reflex in psychophysical perception is discussed in terms of studies on estimates of MOC strength from otoacoustic emissions and the effects of resection of the olivocochlear bundle in patients with vestibular neurectomy. Novel, innovative approaches are needed to resolve the dissatisfying conclusion that current results are unable to definitively confirm or refute the role of the MOC reflex in masking and intensity resolution.
Collapse
Affiliation(s)
- Skyler G Jennings
- Department of Communication Sciences and Disorders, The University of Utah, Salt Lake City, Utah
| |
Collapse
|
7
|
Füllgrabe C, Sęk A, Moore BCJ. Frequency selectivity in the modulation domain estimated using forward masking: Effects of masker modulation depth and masker-signal delay. Hear Res 2021; 405:108244. [PMID: 33878648 DOI: 10.1016/j.heares.2021.108244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Revised: 03/23/2021] [Accepted: 03/31/2021] [Indexed: 10/21/2022]
Abstract
The threshold for detecting amplitude modulation (AM) of a sinusoidal or noise carrier is elevated when the signal AM is preceded by masker AM applied to the same carrier. This effect, called AM forward masking, shows selectivity in the AM domain, consistent with the existence of a modulation filter bank (MFB). In this paper we explore the effect of two factors that can influence AM forward masking, using an 8-kHz sinusoidal carrier and a range of masker AM frequencies, fm, both below and above the signal AM frequency, fs, of 40 Hz. The first factor was the time delay, td, between the end of the masker AM and the start of the signal AM. The second was the AM depth, m, of the masker, which was either 1 or 0.25. The AM forward masking patterns in all conditions showed tuning in the AM domain; signal thresholds were highest when fm was close to fs. The amount of AM forward masking decreased with increasing td in a similar way for all fm, so the shapes of the masking patterns did not change markedly with td. Remarkably, the amount of AM forward masking decreased by only about 3 dB (a non-significant effect) when the masker m was decreased from 1 to 0.25. This result appears to be inconsistent with an explanation of AM forward masking in terms of adaptation in a MFB or in terms of a sliding temporal integrator.
Collapse
Affiliation(s)
- Christian Füllgrabe
- School of Sport, Exercise and Health Sciences, Loughborough University, Ashby Road, Loughborough LE11 3TU, United Kingdom.
| | - Aleksander Sęk
- Department of Acoustics, Faculty of Physics, Adam Mickiewicz University, 85 Umultowska, 61-614 Poznan, Poland.
| | - Brian C J Moore
- Cambridge Hearing Group, Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom.
| |
Collapse
|
8
|
Füllgrabe C, Sęk A, Moore BCJ. Forward masking of amplitude modulation across ears and its tuning in the modulation domain. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1764. [PMID: 33765781 DOI: 10.1121/10.0003598] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Accepted: 02/08/2021] [Indexed: 06/12/2023]
Abstract
Frequency selectivity in the amplitude modulation (AM) domain has been demonstrated using both simultaneous AM masking and forward AM masking. This has been explained using the concept of a modulation filter bank (MFB). Here, we assessed whether the MFB occurs before or after the point of binaural interaction in the auditory pathway by using forward masking in the AM domain in an ipsilateral condition (masker AM and signal AM applied to the left ear with an unmodulated carrier in the right ear) and a contralateral condition (masker AM applied to the right ear and signal AM applied to the left ear). The carrier frequency was 8 kHz, the signal AM frequency, fs, was 40 or 80 Hz, and the masker AM frequency ranged from 0.25 to 4 times fs. Contralateral forward AM masking did occur, but it was smaller than ipsilateral AM masking. Tuning in the AM domain was slightly sharper for ipsilateral than for contralateral masking, perhaps reflecting confusion of the signal and masker AM in the ipsilateral condition when their AM frequencies were the same. The results suggest that there might be an MFB both before and after the point in the auditory pathway where binaural interaction occurs.
Collapse
Affiliation(s)
- Christian Füllgrabe
- School of Sport, Exercise and Health Sciences, Loughborough University, Ashby Road, Loughborough LE11 3TU, United Kingdom
| | - Aleksander Sęk
- Cambridge Hearing Group, Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Brian C J Moore
- Cambridge Hearing Group, Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| |
Collapse
|
9
|
Marrufo-Pérez MI, Eustaquio-Martín A, Fumero MJ, Gorospe JM, Polo R, Gutiérrez Revilla A, Lopez-Poveda EA. Adaptation to noise in amplitude modulation detection without the medial olivocochlear reflex. Hear Res 2019; 377:133-141. [DOI: 10.1016/j.heares.2019.03.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 03/05/2019] [Accepted: 03/19/2019] [Indexed: 10/27/2022]
|
10
|
Buss E, Lorenzi C, Cabrera L, Leibold LJ, Grose JH. Amplitude modulation detection and modulation masking in school-age children and adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:2565. [PMID: 31046373 PMCID: PMC6909994 DOI: 10.1121/1.5098950] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 04/02/2019] [Accepted: 04/03/2019] [Indexed: 05/30/2023]
Abstract
Two experiments were performed to better understand on- and off-frequency modulation masking in normal-hearing school-age children and adults. Experiment 1 estimated thresholds for detecting 16-, 64- or 256-Hz sinusoidal amplitude modulation (AM) imposed on a 4300-Hz pure tone. Thresholds tended to improve with age, with larger developmental effects for 64- and 256-Hz AM than 16-Hz AM. Detection of 16-Hz AM was also measured with a 1000-Hz off-frequency masker tone carrying 16-Hz AM. Off-frequency modulation masking was larger for younger than older children and adults when the masker was gated with the target, but not when the masker was continuous. Experiment 2 measured detection of 16- or 64-Hz sinusoidal AM carried on a bandpass noise with and without additional on-frequency masker AM. Children and adults demonstrated modulation masking with similar tuning to modulation rate. Rate-dependent age effects for AM detection on a pure-tone carrier are consistent with maturation of temporal resolution, an effect that may be obscured by modulation masking for noise carriers. Children were more susceptible than adults to off-frequency modulation masking for gated stimuli, consistent with maturation in the ability to listen selectively in frequency, but the children were not more susceptible to on-frequency modulation masking than adults.
Collapse
Affiliation(s)
- Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine, University of North Carolina, Chapel Hill, North Carolina 27599-7070, USA
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, Ecole Normale Supérieure, Universite Paris Sciences et Lettres, Centre National de la Recherche Scientifique, Paris, France
| | - Laurianne Cabrera
- Laboratoire de Psychologie de la Perception, Université Paris Descartes, Centre National de la Recherche Scientifique, Paris, France
| | - Lori J Leibold
- Center for Hearing Research, Boys Town National Research Hospital, Omaha, Nebraska 68131, USA
| | - John H Grose
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine, University of North Carolina, Chapel Hill, North Carolina 27599-7070, USA
| |
Collapse
|
11
|
Goossens T, Vercammen C, Wouters J, van Wieringen A. Neural envelope encoding predicts speech perception performance for normal-hearing and hearing-impaired adults. Hear Res 2018; 370:189-200. [PMID: 30131201 DOI: 10.1016/j.heares.2018.07.012] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Revised: 07/19/2018] [Accepted: 07/25/2018] [Indexed: 10/28/2022]
Abstract
Peripheral hearing impairment cannot fully account for speech perception difficulties that emerge with advancing age. As the fluctuating speech envelope bears crucial information for speech perception, changes in temporal envelope processing are thought to contribute to degraded speech perception. Previous research has demonstrated changes in neural encoding of envelope modulations throughout the adult lifespan, either due to age or due to hearing impairment. To date, however, it remains unclear whether such age- and hearing-related neural changes are associated with impaired speech perception. In the present study, we investigated the potential relationship between perception of speech in different types of masking sounds and neural envelope encoding for a normal-hearing and hearing-impaired adult population including young (20-30 years), middle-aged (50-60 years), and older (70-80 years) people. Our analyses show that enhanced neural envelope encoding in the cortex and in the brainstem, respectively, is related to worse speech perception for normal-hearing and for hearing-impaired adults. This neural-behavioral correlation is found for the three age groups and appears to be independent of the type of masking noise, i.e., background noise or competing speech. These findings provide promising directions for future research aiming to develop advanced rehabilitation strategies for speech perception difficulties that emerge throughout adult life.
Collapse
Affiliation(s)
- Tine Goossens
- KU Leuven - University of Leuven, Department of Neurosciences, Research Group Experimental ORL, Herestraat 49 bus 721, 3000, Leuven, Belgium.
| | - Charlotte Vercammen
- KU Leuven - University of Leuven, Department of Neurosciences, Research Group Experimental ORL, Herestraat 49 bus 721, 3000, Leuven, Belgium.
| | - Jan Wouters
- KU Leuven - University of Leuven, Department of Neurosciences, Research Group Experimental ORL, Herestraat 49 bus 721, 3000, Leuven, Belgium.
| | - Astrid van Wieringen
- KU Leuven - University of Leuven, Department of Neurosciences, Research Group Experimental ORL, Herestraat 49 bus 721, 3000, Leuven, Belgium.
| |
Collapse
|
12
|
Verhey JL, van de Par S. Binaural frequency selectivity in humans. Eur J Neurosci 2018; 51:1179-1190. [PMID: 29359360 DOI: 10.1111/ejn.13837] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2017] [Revised: 01/09/2018] [Accepted: 01/10/2018] [Indexed: 11/30/2022]
Abstract
Several behavioural studies in humans have shown that listening to sounds with two ears that is binaural hearing, provides the human auditory system with extra information on the sound source that is not available when sounds are only perceived through one ear that is monaurally. Binaural processing involves the analysis of phase and level differences between the two ear signals. As monaural cochlea processing (in each ear) precedes the neural stages responsible for binaural processing properties it is reasonable to assume that properties of the cochlea may also be observed in binaural processing. A main characteristic of cochlea processing is its frequency selectivity. In psychoacoustics, there is an ongoing discussion on the frequency selectivity of the binaural auditory system. While some psychoacoustic experiments seem to indicate poorer frequency selectivity of the binaural system than that of the monaural processing others seem to indicate the same frequency selectivity for monaural and binaural processing. This study provides an overview of these seemingly controversial results and the different explanations that were provided to account for the different results.
Collapse
Affiliation(s)
- Jesko L Verhey
- Department of Experimental Audiology, Otto von Guericke University Magdeburg, Leipziger Str. 44, 39120, Magdeburg, Germany
| | - Steven van de Par
- Acoustic Group, Cluster of Excellence Hearing4All, Carl von Ossietzky University Oldenburg, Oldenburg, Germany
| |
Collapse
|
13
|
Magnified Neural Envelope Coding Predicts Deficits in Speech Perception in Noise. J Neurosci 2017; 37:7727-7736. [PMID: 28694336 PMCID: PMC5551064 DOI: 10.1523/jneurosci.2722-16.2017] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Revised: 05/09/2017] [Accepted: 05/23/2017] [Indexed: 11/29/2022] Open
Abstract
Verbal communication in noisy backgrounds is challenging. Understanding speech in background noise that fluctuates in intensity over time is particularly difficult for hearing-impaired listeners with a sensorineural hearing loss (SNHL). The reduction in fast-acting cochlear compression associated with SNHL exaggerates the perceived fluctuations in intensity in amplitude-modulated sounds. SNHL-induced changes in the coding of amplitude-modulated sounds may have a detrimental effect on the ability of SNHL listeners to understand speech in the presence of modulated background noise. To date, direct evidence for a link between magnified envelope coding and deficits in speech identification in modulated noise has been absent. Here, magnetoencephalography was used to quantify the effects of SNHL on phase locking to the temporal envelope of modulated noise (envelope coding) in human auditory cortex. Our results show that SNHL enhances the amplitude of envelope coding in posteromedial auditory cortex, whereas it enhances the fidelity of envelope coding in posteromedial and posterolateral auditory cortex. This dissociation was more evident in the right hemisphere, demonstrating functional lateralization in enhanced envelope coding in SNHL listeners. However, enhanced envelope coding was not perceptually beneficial. Our results also show that both hearing thresholds and, to a lesser extent, magnified cortical envelope coding in left posteromedial auditory cortex predict speech identification in modulated background noise. We propose a framework in which magnified envelope coding in posteromedial auditory cortex disrupts the segregation of speech from background noise, leading to deficits in speech perception in modulated background noise. SIGNIFICANCE STATEMENT People with hearing loss struggle to follow conversations in noisy environments. Background noise that fluctuates in intensity over time poses a particular challenge. Using magnetoencephalography, we demonstrate anatomically distinct cortical representations of modulated noise in normal-hearing and hearing-impaired listeners. This work provides the first link among hearing thresholds, the amplitude of cortical representations of modulated sounds, and the ability to understand speech in modulated background noise. In light of previous work, we propose that magnified cortical representations of modulated sounds disrupt the separation of speech from modulated background noise in auditory cortex.
Collapse
|
14
|
Shen Y, Pearson DV. Recognition of synthesized vowel sequences in steady-state and sinusoidally amplitude-modulated noises. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:1835. [PMID: 28372131 PMCID: PMC5871221 DOI: 10.1121/1.4978060] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Revised: 02/21/2017] [Accepted: 02/22/2017] [Indexed: 05/28/2023]
Abstract
Modulation masking is known to impact speech intelligibility, but it is not clear whether the mechanism underlying this phenomenon is an invariant, bottom-up process, or if it is subjected to factors such as perceptual segregation and stimulus uncertainty thereby showing a top-down component. In the main experiment of the current study (Exp. II), listeners' ability to recognize sequences of synthesized vowels (i.e., the target) in sinusoidally amplitude-modulated noises (i.e., the masker) was evaluated. The target and masker were designed to be perceptually distinct to limit the top-down component of modulation masking. The duration of each vowel was either 25 or 100 ms, the rate at which the vowels were presented was either 1 or 6 Hz, and the masker modulation rate was varied between 0.5 and 16 Hz. The selective performance degradation when the target and masker modulation spectra overlap, as would be expected from modulation masking, was not observed. In addition, these results were able to be adequately captured using a model of energetic masking without any modulation processing stages and fitted only using the vowel-recognition performance in steady-state maskers, as obtained from Exp. I. Results suggest that speech modulation masking might not be mediated through an early-sensory mechanism.
Collapse
Affiliation(s)
- Yi Shen
- Department of Speech and Hearing Sciences, Indiana University Bloomington, Bloomington, Indiana 47405, USA
| | - Dylan V Pearson
- Department of Speech and Hearing Sciences, Indiana University Bloomington, Bloomington, Indiana 47405, USA
| |
Collapse
|
15
|
Lau BK, Ruggles DR, Katyal S, Engel SA, Oxenham AJ. Sustained Cortical and Subcortical Measures of Auditory and Visual Plasticity following Short-Term Perceptual Learning. PLoS One 2017; 12:e0168858. [PMID: 28107359 PMCID: PMC5249117 DOI: 10.1371/journal.pone.0168858] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 11/06/2016] [Indexed: 12/02/2022] Open
Abstract
Short-term training can lead to improvements in behavioral discrimination of auditory and visual stimuli, as well as enhanced EEG responses to those stimuli. In the auditory domain, fluency with tonal languages and musical training has been associated with long-term cortical and subcortical plasticity, but less is known about the effects of shorter-term training. This study combined electroencephalography (EEG) and behavioral measures to investigate short-term learning and neural plasticity in both auditory and visual domains. Forty adult participants were divided into four groups. Three groups trained on one of three tasks, involving discrimination of auditory fundamental frequency (F0), auditory amplitude modulation rate (AM), or visual orientation (VIS). The fourth (control) group received no training. Pre- and post-training tests, as well as retention tests 30 days after training, involved behavioral discrimination thresholds, steady-state visually evoked potentials (SSVEP) to the flicker frequencies of visual stimuli, and auditory envelope-following responses simultaneously evoked and measured in response to rapid stimulus F0 (EFR), thought to reflect subcortical generators, and slow amplitude modulation (ASSR), thought to reflect cortical generators. Enhancement of the ASSR was observed in both auditory-trained groups, not specific to the AM-trained group, whereas enhancement of the SSVEP was found only in the visually-trained group. No evidence was found for changes in the EFR. The results suggest that some aspects of neural plasticity can develop rapidly and may generalize across tasks but not across modalities. Behaviorally, the pattern of learning was complex, with significant cross-task and cross-modal learning effects.
Collapse
Affiliation(s)
- Bonnie K. Lau
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
- * E-mail:
| | - Dorea R. Ruggles
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Sucharit Katyal
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Stephen A. Engel
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Andrew J. Oxenham
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
| |
Collapse
|
16
|
Fogerty D, Xu J, Gibbs BE. Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise: Effect of the modulation spectrum. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:1800. [PMID: 27914381 PMCID: PMC5848862 DOI: 10.1121/1.4962494] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Compared to notionally steady-state noise, modulated maskers provide a perceptual benefit for speech recognition, in part due to preserved speech information during the amplitude dips of the masker. However, overlap in the modulation spectrum between the target speech and the competing modulated masker may potentially result in modulation masking, and thereby offset the release from energetic masking. The current study investigated masking release provided by single-talker modulated noise. The overlap in the modulation spectra of the target speech and the modulated noise masker was varied through time compression or expansion of the competing masker. Younger normal hearing adults listened to sentences that were unprocessed or noise vocoded to primarily limit speech recognition to the preserved temporal envelope cues. For unprocessed speech, results demonstrated improved performance with masker modulation spectrum shifted up or down compared to the target modulation spectrum, except for the most extreme time expansion. For vocoded speech, significant masking release was observed with the slowest masker rate. Perceptual results combined with acoustic analyses of the preserved glimpses of the target speech suggest contributions of modulation masking and cognitive-linguistic processing as factors contributing to performance.
Collapse
Affiliation(s)
- Daniel Fogerty
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208, USA
| | - Jiaqian Xu
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208, USA
| | - Bobby E Gibbs
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208, USA
| |
Collapse
|
17
|
Fogerty D, Xu J, Gibbs BE. Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise: Effect of the modulation spectrum. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:1800. [PMID: 27914381 DOI: 10.5041466/1.4962494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Compared to notionally steady-state noise, modulated maskers provide a perceptual benefit for speech recognition, in part due to preserved speech information during the amplitude dips of the masker. However, overlap in the modulation spectrum between the target speech and the competing modulated masker may potentially result in modulation masking, and thereby offset the release from energetic masking. The current study investigated masking release provided by single-talker modulated noise. The overlap in the modulation spectra of the target speech and the modulated noise masker was varied through time compression or expansion of the competing masker. Younger normal hearing adults listened to sentences that were unprocessed or noise vocoded to primarily limit speech recognition to the preserved temporal envelope cues. For unprocessed speech, results demonstrated improved performance with masker modulation spectrum shifted up or down compared to the target modulation spectrum, except for the most extreme time expansion. For vocoded speech, significant masking release was observed with the slowest masker rate. Perceptual results combined with acoustic analyses of the preserved glimpses of the target speech suggest contributions of modulation masking and cognitive-linguistic processing as factors contributing to performance.
Collapse
Affiliation(s)
- Daniel Fogerty
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208, USA
| | - Jiaqian Xu
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208, USA
| | - Bobby E Gibbs
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, South Carolina 29208, USA
| |
Collapse
|
18
|
Affiliation(s)
- William A. Yost
- Director of the Parmly Hearing Institute, Professor of Hearing Sciences, and Adjunct Professor of Psychology and of Orolaryngology/Head and Neck Surgery at Loyola University of Chicago. He is the coauthor of fundamentals of Hearing (Holt, Rinehart & Winston, 1985) and coeditor of three other books on hearing
| |
Collapse
|
19
|
Interaction of Object Binding Cues in Binaural Masking Pattern Experiments. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016. [DOI: 10.1007/978-3-319-25474-6_26] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register]
|
20
|
Parthasarathy A, Lai J, Bartlett EL. Age-Related Changes in Processing Simultaneous Amplitude Modulated Sounds Assessed Using Envelope Following Responses. J Assoc Res Otolaryngol 2016; 17:119-32. [PMID: 26905273 PMCID: PMC4791415 DOI: 10.1007/s10162-016-0554-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Accepted: 01/27/2016] [Indexed: 01/04/2023] Open
Abstract
Listening conditions in the real world involve segregating the stimuli of interest from competing auditory stimuli that differ in their sound level and spectral content. It is in these conditions of complex spectro-temporal processing that listeners with age-related hearing loss experience the most difficulties. Envelope following responses (EFRs) provide objective neurophysiological measures of auditory processing. EFRs were obtained to two simultaneous sinusoidally amplitude modulated (sAM) tones from young and aged Fischer-344 rats. One was held at a fixed suprathreshold sound level (sAM1FL) while the second varied in sound level (sAM2VL) and carrier frequency. EFR amplitudes to sAM1FL in the young decreased with signal-to-noise ratio (SNR), and this reduction was more pronounced when the sAM2VL carrier frequency was spectrally separated from sAM1FL. Aged animals showed similar trends, while having decreased overall response amplitudes compared to the young. These results were replicated using an established computational model of the auditory nerve. The trends observed in the EFRs were shown to be due to the contributions of the low-frequency tails of high-frequency neurons, rather than neurons tuned to the sAM1FL carrier frequency. Modeling changes in threshold and neural loss reproduced some of the changes seen with age, but accuracy improved when combined with an additional decrease representing synaptic loss of auditory nerve neurons. Sound segregation in this case derives primarily from peripheral processing, regardless of age. Contributions by more central neural mechanisms are likely to occur only at low SNRs.
Collapse
Affiliation(s)
- Aravindakshan Parthasarathy
- Department of Biological Sciences, Purdue University Interdisciplinary Life Sciences Program, and the Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA
- Eaton-Peabody Labs, Massachusetts Eye and Ear Infirmary, Harvard Medical School, Boston, MA, USA
| | - Jesyin Lai
- Department of Biological Sciences, Purdue University Interdisciplinary Life Sciences Program, and the Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA
| | - Edward L Bartlett
- Department of Biological Sciences, Purdue University Interdisciplinary Life Sciences Program, and the Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
21
|
Sek A, Baer T, Crinnion W, Springgay A, Moore BCJ. Modulation masking within and across carriers for subjects with normal and impaired hearing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:1143-1153. [PMID: 26328728 DOI: 10.1121/1.4928135] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The detection of amplitude modulation (AM) of a carrier can be impaired by additional (masker) AM applied to the same carrier (within-carrier modulation masking, MM) or to a different carrier (across-carrier MM). These two types of MM were compared for young normal-hearing and older hearing-impaired subjects. The signal was 4- or 16-Hz sinusoidal AM of a 4000-Hz carrier. Masker AM with depth 0.4 was applied either to the same carrier or to a carrier at 3179 or 2518 Hz. The masker AM rate was 0.25, 0.5, 1, 2, or 4 times the signal rate. The signal AM depth was varied adaptively to determine the threshold. Both within-carrier and across-carrier MM patterns were similar for the two groups, suggesting that the hypothetical modulation filters are not affected by hearing loss or age. The signal AM detection thresholds were also similar for the two groups. Thresholds in the absence of masker AM were lower (better) for the older hearing-impaired than for the young normal-hearing subjects. Since the masked modulation thresholds were similar for the two groups, it seems unlikely that abnormal MM contributes to the difficulties experienced by older hearing-impaired people in understanding speech in background sounds.
Collapse
Affiliation(s)
- Aleksander Sek
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Thomas Baer
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - William Crinnion
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Alastair Springgay
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| |
Collapse
|
22
|
Humes LE, Kidd GR, Lentz JJ. Auditory and cognitive factors underlying individual differences in aided speech-understanding among older adults. Front Syst Neurosci 2013; 7:55. [PMID: 24098273 PMCID: PMC3787592 DOI: 10.3389/fnsys.2013.00055] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2013] [Accepted: 09/09/2013] [Indexed: 11/13/2022] Open
Abstract
This study was designed to address individual differences in aided speech understanding among a relatively large group of older adults. The group of older adults consisted of 98 adults (50 female and 48 male) ranging in age from 60 to 86 (mean = 69.2). Hearing loss was typical for this age group and about 90% had not worn hearing aids. All subjects completed a battery of tests, including cognitive (6 measures), psychophysical (17 measures), and speech-understanding (9 measures), as well as the Speech, Spatial, and Qualities of Hearing (SSQ) self-report scale. Most of the speech-understanding measures made use of competing speech and the non-speech psychophysical measures were designed to tap phenomena thought to be relevant for the perception of speech in competing speech (e.g., stream segregation, modulation-detection interference). All measures of speech understanding were administered with spectral shaping applied to the speech stimuli to fully restore audibility through at least 4000 Hz. The measures used were demonstrated to be reliable in older adults and, when compared to a reference group of 28 young normal-hearing adults, age-group differences were observed on many of the measures. Principal-components factor analysis was applied successfully to reduce the number of independent and dependent (speech understanding) measures for a multiple-regression analysis. Doing so yielded one global cognitive-processing factor and five non-speech psychoacoustic factors (hearing loss, dichotic signal detection, multi-burst masking, stream segregation, and modulation detection) as potential predictors. To this set of six potential predictor variables were added subject age, Environmental Sound Identification (ESI), and performance on the text-recognition-threshold (TRT) task (a visual analog of interrupted speech recognition). These variables were used to successfully predict one global aided speech-understanding factor, accounting for about 60% of the variance.
Collapse
Affiliation(s)
- Larry E Humes
- Department of Speech and Hearing Sciences, Indiana University Bloomington, IN, USA
| | | | | |
Collapse
|
23
|
Off-frequency BMLD: the role of monaural processing. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 787:293-301. [PMID: 23716235 DOI: 10.1007/978-1-4614-1590-9_33] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Large binaural masking-level differences (BMLDs) can be observed when a tonal signal with an interaural phase difference of π is presented against a diotic masker. The BMLD is large when the signal is spectrally centered in the masker and decreases strongly for off-frequency signals. No such reduction in BMLD would be expected, if monaural detection were governed by energy cues and binaural detection by changes in interaural cross-correlation. The reduction in BMLD thus suggests either that binaural processing is impaired or, alternatively, that additional monaural cues are available in off-frequency conditions. In this study, a stimulus paradigm is used that is expected to impair the processing of additional monaural cues. In the base experiment, a 25-Hz-wide band of diotic noise centered at 700 Hz served as masker. A target tone was presented at 0, 30, 60, and 100 Hz above the masker center frequency, either interaurally in phase (S 0) or out of phase (S π). In the extended experiment, an additional interference tone was always presented spectrally below the masker at the same frequency distance as the target tone was positioned above the masker. The interferer level was 6 dB below the level of the 65 dB masker. By presenting the interferer, a strong modulation is introduced, which should impair the detectability of the target tone based on the beating of masker and target. Results show a small off-frequency BMLD in the base experiment in line with literature. Adding the interference tone produced an increase in both N 0 S 0 and the N 0 S π thresholds, suggesting that monaural modulation cues were indeed used, but also -suggesting that detection performance in the N 0 S π condition was dominated by monaural processing. Additional conditions with modulated interference tones at 500 Hz further supported our hypothesis that monaural modulation cues contributed to reduced -off-frequency BMLDs.
Collapse
|
24
|
Itoh K, Nakada T. Human brain detects short-time nonlinear predictability in the temporal fine structure of deterministic chaotic sounds. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 87:042916. [PMID: 23679498 DOI: 10.1103/physreve.87.042916] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2012] [Indexed: 06/02/2023]
Abstract
Deterministic nonlinear dynamical processes are ubiquitous in nature. Chaotic sounds generated by such processes may appear irregular and random in waveform, but these sounds are mathematically distinguished from random stochastic sounds in that they contain deterministic short-time predictability in their temporal fine structures. We show that the human brain distinguishes deterministic chaotic sounds from spectrally matched stochastic sounds in neural processing and perception. Deterministic chaotic sounds, even without being attended to, elicited greater cerebral cortical responses than the surrogate control sounds after about 150 ms in latency after sound onset. Listeners also clearly discriminated these sounds in perception. The results support the hypothesis that the human auditory system is sensitive to the subtle short-time predictability embedded in the temporal fine structure of sounds.
Collapse
Affiliation(s)
- Kosuke Itoh
- Center for Integrated Human Brain Science, Brain Research Institute, University of Niigata, 1-757 Asahimachi, Niigata, 951-8585, Japan.
| | | |
Collapse
|
25
|
Stone MA, Füllgrabe C, Mackinnon RC, Moore BCJ. The importance for speech intelligibility of random fluctuations in "steady" background noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:2874-81. [PMID: 22087916 DOI: 10.1121/1.3641371] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Spectrally shaped steady noise is commonly used as a masker of speech. The effects of inherent random fluctuations in amplitude of such a noise are typically ignored. Here, the importance of these random fluctuations was assessed by comparing two cases. For one, speech was mixed with steady speech-shaped noise and N-channel tone vocoded, a process referred to as signal-domain mixing (SDM); this preserved the random fluctuations of the noise. For the second, the envelope of speech alone was extracted for each vocoder channel and a constant was added corresponding to the root-mean-square value of the noise envelope for that channel. This is referred to as envelope-domain mixing (EDM); it removed the random fluctuations of the noise. Sinusoidally modulated noise and a single talker were also used as backgrounds, with both SDM and EDM. Speech intelligibility was measured for N = 12, 19, and 30, with the target-to-background ratio fixed at -7 dB. For SDM, performance was best for the speech background and worst for the steady noise. For EDM, this pattern was reversed. Intelligibility with steady noise was consistently very poor for SDM, but near-ceiling for EDM, demonstrating that the random fluctuations in steady noise have a large effect.
Collapse
Affiliation(s)
- Michael A Stone
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England.
| | | | | | | |
Collapse
|
26
|
Fitzgerald MB, Wright BA. Perceptual learning and generalization resulting from training on an auditory amplitude-modulation detection task. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:898-906. [PMID: 21361447 PMCID: PMC3070992 DOI: 10.1121/1.3531841] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 11/23/2010] [Accepted: 11/28/2010] [Indexed: 05/19/2023]
Abstract
Fluctuations in sound amplitude provide important cues to the identity of many sounds including speech. Of interest here was whether the ability to detect these fluctuations can be improved with practice, and if so whether this learning generalizes to untrained cases. To address these issues, normal-hearing adults (n = 9) were trained to detect sinusoidal amplitude modulation (SAM; 80-Hz rate, 3-4 kHz bandpass carrier) 720 trials/day for 6-7 days and were tested before and after training on related SAM-detection and SAM-rate-discrimination conditions. Controls (n = 9) only participated in the pre- and post-tests. The trained listeners improved more than the controls on the trained condition between the pre- and post-tests, but different subgroups of trained listeners required different amounts of practice to reach asymptotic performance, ranging from 1 (n = 6) to 4-6 (n = 3) sessions. This training-induced learning did not generalize to detection with two untrained carrier spectra (5 kHz low-pass and 0.5-1.5 kHz bandpass) or to rate discrimination with the trained rate and carrier spectrum, but there was some indication that it generalized to detection with two untrained rates (30 and 150 Hz). Thus, practice improved the ability to detect amplitude modulation, but the generalization of this learning to untrained cases was somewhat limited.
Collapse
Affiliation(s)
- Matthew B Fitzgerald
- Department of Otolaryngology, New York University School of Medicine, 550 First Avenue, NBV-5E5, New York, New York 10016, USA.
| | | |
Collapse
|
27
|
Miyazono H, Glasberg BR, Moore BCJ. Perceptual learning of fundamental frequency discrimination: effects of fundamental frequency, harmonic number, and component phase. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3649-3657. [PMID: 21218897 DOI: 10.1121/1.3504713] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Thresholds (F0DLs) were measured for discrimination of the fundamental frequency (F0) of a group of harmonics (group B) embedded in harmonics with a fixed F0. Miyazono and Moore [(2009). Acoust. Sci. & Tech. 30, 383386] found a large training effect for tones with high harmonics in group B, when the harmonics were added in cosine phase. It is shown here that this effect was due to use of a cue related to pitch pulse asynchrony (PPA). When PPA cues were disrupted by introducing a temporal offset between the envelope peaks of the harmonics in group B and the remaining harmonics, F0DLs increased markedly. Perceptual learning was examined using a training stimulus with cosine-phase harmonics, F0 = 50 Hz, and high harmonics in group B, under conditions where PPA was not useful. Learning occurred, and it transferred to other cosine-phase tones, but not to random-phase tones. A similar experiment with F0 = 100 Hz showed a learning effect which transferred to a cosine-phase tone with mainly high unresolved harmonics, but not to cosine-phase tones with low harmonics, and not to random-phase tones. The learning found here appears to be specific to tones for which F0 discrimination is based on distinct peaks in the temporal envelope.
Collapse
Affiliation(s)
- Hiromitsu Miyazono
- Faculty of Administration, Prefectural University of Kumamoto, 3-1-100 Tsukide, Kumamoto, Japan
| | | | | |
Collapse
|
28
|
Shen Y, Lentz JJ. Effect of fast-acting compression on modulation detection interference for normal hearing and hearing impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:3654-65. [PMID: 20550264 DOI: 10.1121/1.3418684] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
To determine the effects of hearing loss and fast-acting compression on auditory grouping based on across-frequency modulation, modulation detection interference (MDI) was measured in listeners with normal hearing and hearing loss. MDI, the increase in the amplitude-modulation detection threshold of a target presented with an interferer distant in frequency, was measured using a 500-Hz target and a 2140-Hz interferer, both modulated with narrow-band noises of the same bandwidth. The two modulated tones were presented at equal loudness levels to listeners with normal hearing and hearing loss in the absence (Exp. 1) and in the presence (Exp. 2) of fast-acting compression applied to the interferer. Modulation detection thresholds increased with increasing modulation depth of the interferer by similar amounts for the two groups of listeners, suggesting that across-frequency grouping based on amplitude modulation is not altered by hearing impairment. Compression provided an additional increase in thresholds for both groups, indicating that compression algorithms might alter across-frequency grouping cues. Partial support for an idea that compression's effect of sharpening the onsets after each envelope valley is provided by a third experiment which found somewhat greater interference produced by square-wave modulation than sine-wave modulation at larger interferer modulation depths.
Collapse
Affiliation(s)
- Yi Shen
- Department of Speech and Hearing Sciences, Indiana University, Bloomington, Indiana 47405-7000, USA.
| | | |
Collapse
|
29
|
He NJ, Mills JH, Ahlstrom JB, Dubno JR. Age-related differences in the temporal modulation transfer function with pure-tone carriers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:3841-9. [PMID: 19206810 PMCID: PMC2676625 DOI: 10.1121/1.2998779] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Detection of amplitude modulation (AM) in 500 and 4000 Hz tonal carriers was measured as a function of modulation frequency from younger and older adults with normal hearing through 4000 Hz. The modulation frequency above which sensitivity to AM increased ("transition frequency") was similar for both groups. Temporal modulation transfer function shapes showed significant age-related differences. For younger subjects, AM detection thresholds were generally constant for low modulation frequencies. For a higher carrier frequency, AM detection thresholds then increased as modulation frequency further increased until the transition frequency. In contrast, AM detection for older subjects continuously increased with increasing modulation frequency, indicating an age-related decline in temporal resolution for faster envelope fluctuations. Significant age-related differences were observed whenever AM detection was dependent on temporal cues. For modulation frequencies above the transition frequency, age-related differences were larger for the lower frequency carrier (where both temporal and spectral cues were available) than for the higher frequency carrier (where AM detection was primarily dependent on spectral cues). These results are consistent with a general age-related decline in the synchronization of neural responses to both the carrier waveform and envelope fluctuation.
Collapse
Affiliation(s)
- Ning-ji He
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, MSC 550, Charleston, South Carolina 29425-5500, USA
| | | | | | | |
Collapse
|
30
|
Moore BCJ. Basic auditory processes involved in the analysis of speech sounds. Philos Trans R Soc Lond B Biol Sci 2008; 363:947-63. [PMID: 17827102 PMCID: PMC2606789 DOI: 10.1098/rstb.2007.2152] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
This paper reviews the basic aspects of auditory processing that play a role in the perception of speech. The frequency selectivity of the auditory system, as measured using masking experiments, is described and used to derive the internal representation of the spectrum (the excitation pattern) of speech sounds. The perception of timbre and distinctions in quality between vowels are related to both static and dynamic aspects of the spectra of sounds. The perception of pitch and its role in speech perception are described. Measures of the temporal resolution of the auditory system are described and a model of temporal resolution based on a sliding temporal integrator is outlined. The combined effects of frequency and temporal resolution can be modelled by calculation of the spectro-temporal excitation pattern, which gives good insight into the internal representation of speech sounds. For speech presented in quiet, the resolution of the auditory system in frequency and time usually markedly exceeds the resolution necessary for the identification or discrimination of speech sounds, which partly accounts for the robust nature of speech perception. However, for people with impaired hearing, speech perception is often much less robust.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, UK.
| |
Collapse
|
31
|
Apoux F, Bacon SP. Selectivity of modulation interference for consonant identification in normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 123:1665-1672. [PMID: 18345854 DOI: 10.1121/1.2828067] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The present study sought to establish whether speech recognition can be disrupted by the presence of amplitude modulation (AM) at a remote spectral region, and whether that disruption depends upon the rate of AM. The goal was to determine whether this paradigm could be used to examine which modulation frequencies in the speech envelope are most important for speech recognition. Consonant identification for a band of speech located in either the low- or high-frequency region was measured in the presence of a band of noise located in the opposite frequency region. The noise was either unmodulated or amplitude modulated by a sinusoid, a band of noise with a fixed absolute bandwidth, or a band of noise with a fixed relative bandwidth. The frequency of the modulator was 4, 16, 32, or 64 Hz. Small amounts of modulation interference were observed for all modulator types, irrespective of the location of the speech band. More important, the interference depended on modulation frequency, clearly supporting the existence of selectivity of modulation interference with speech stimuli. Overall, the results suggest a primary role of envelope fluctuations around 4 and 16 Hz without excluding the possibility of a contribution by faster rates.
Collapse
Affiliation(s)
- Frédéric Apoux
- Psychoacoustics Laboratory, Department of Speech and Hearing Science, Arizona State University, PO Box 870102, Tempe, Arizona 85287-0102, USA.
| | | |
Collapse
|
32
|
Hall JW, Buss E, Grose JH. Spectral integration and wideband analysis in gap detection and overshoot paradigms. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 122:3598-3608. [PMID: 18247767 DOI: 10.1121/1.2799473] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Several listening conditions show that energy remote from a target frequency can deleteriously affect sensitivity. One interpretation of such results entails a wideband analysis involving a wide predetection filter. The present study tested the hypothesis that both temporal gap detection and overshoot results are consistent with a wideband analysis, as contrasted with statistical combination of information across independent channels. For gap detection, stimuli were random or comodulated 50-Hz-wide noise bands centered on 1000, 1932, 3569, and 6437 Hz. For overshoot, the masker was an 8-kHz low-pass filtered noise, with 5-ms tone bursts presented at the same center frequencies used for gap detection. Signals were presented with either 0- or 250-ms delay after masker onset. In each paradigm, the target was introduced at only one frequency or at all four frequencies. Results from gap detection conditions did not favor a wideband analysis interpretation: Results in the random condition were consistent with an optimal combination of cues across frequency. An across-channel interference effect was also evident when only one of the four bands contained the gap. Although results from the overshoot conditions were consistent with a wideband analysis interpretation, they were more parsimoniously accounted for in terms of statistical combination of information.
Collapse
Affiliation(s)
- Joseph W Hall
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina School of Medicine, Chapel Hill, North Carolina 27599, USA.
| | | | | |
Collapse
|
33
|
Saoji AA, Eddins DA. Spectral modulation masking patterns reveal tuning to spectral envelope frequency. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 122:1004-13. [PMID: 17672648 DOI: 10.1121/1.2751267] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Auditory processing appears to include a series of domain-specific filtering operations that include tuning in the audio-frequency domain, followed by tuning in the temporal modulation domain, and perhaps tuning in the spectral modulation domain. To explore the possibility of tuning in the spectral modulation domain, a masking experiment was designed to measure masking patterns in the spectral modulation domain. Spectral modulation transfer functions (SMTFs) were measured for modulation frequencies from 0.25 to 14 cycles/octave superimposed on noise carriers either one octave (800-1600 Hz, 6400-12,800 Hz) or six octaves wide (200-12,800 Hz). The resulting SMTFs showed maximum sensitivity to modulation between 1 and 3 cycles/octave with reduced sensitivity above and below this region. Masked spectral modulation detection thresholds were measured for masker modulation frequencies of 1, 3, and 5 cycles/octave with a fixed modulation depth of 15 dB. The masking patterns obtained for each masker frequency and carrier band revealed tuning (maximum masking) near the masker frequency, which is consistent with the theory that spectral envelope perception is governed by a series of spectral modulation channels tuned to different spectral modulation frequencies.
Collapse
Affiliation(s)
- Aniket A Saoji
- Psychoacoustic Laboratory, Center for Hearing and Deafness, Department of Communicative Disorders and Sciences, State University of New York at Buffalo, Buffalo, New York 14314, USA.
| | | |
Collapse
|
34
|
Hall JW, Buss E, Grose JH. Binaural comodulation masking release: effects of masker interaural correlation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 120:3878-88. [PMID: 17225415 PMCID: PMC1994906 DOI: 10.1121/1.2357989] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Binaural detection was examined for a signal presented in a narrow band of noise centered on the on-signal masking band (OSB) or in the presence of flanking noise bands that were random or comodulated with respect to the OSB. The noise had an interaural correlation of 1.0 (No), 0.99 or 0.95. In No noise, random flanking bands worsened Spi detection and comodulated bands improved Spi detection for some listeners but had no effect for other listeners. For the 0.99 or 0.95 interaural correlation conditions, random flanking bands were less detrimental to Spi detection and comodulated flanking bands improved Spi detection for all listeners. Analyses based on signal detection theory indicated that the improvement in Spi thresholds obtained with comodulated bands was not compatible with an optimal combination of monaural and binaural cues or to across-frequency analyses of dynamic interaural phase differences. Two accounts consistent with the improvement in Spi thresholds in comodulated noise were (1) envelope information carried by the flanking bands improves the weighting of binaural cues associated with the signal; (2) the auditory system is sensitive to across-frequency differences in ongoing interaural correlation.
Collapse
Affiliation(s)
- Joseph W Hall
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina School of Medicine, Chapel Hill, North Carolina 27599, USA.
| | | | | |
Collapse
|
35
|
Fitzgerald MB, Wright BA. A perceptual learning investigation of the pitch elicited by amplitude-modulated noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:3794-803. [PMID: 16419824 DOI: 10.1121/1.2074687] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Noise that is amplitude modulated at rates ranging from 40 to 850 Hz can elicit a sensation of pitch. Here, the processing of this temporally based pitch was investigated using a perceptual-learning paradigm. Nine listeners were trained (1 hour per day for 6-8 days) to discriminate a standard rate of sinusoidal amplitude modulation (SAM) from a faster rate in a single condition (150 Hz SAM rate, 5 kHz low-pass carrier). All trained listeners improved significantly on that condition. These trained listeners subsequently showed no more improvement than nine untrained controls on pure-tone and rippled-noise discrimination with the same pitch, and on SAM-rate discrimination with a 30 Hz rate, although they did show some improvement with a 300 Hz rate. In addition, most trained, but not control, listeners were worse at detecting SAM at 150 Hz after, compared to before training. These results indicate that listeners can learn to improve their ability to discriminate SAM rate with multiple-hour training and that the mechanism that is modified by learning encodes (1) the pitch of SAM noise but not that of pure tones and rippled noise, (2) different SAM rates separately, and (3) differences in SAM rate more effectively than cues for SAM detection.
Collapse
Affiliation(s)
- Matthew B Fitzgerald
- Department of Communication Sciences and Disorders and Northwestern University Institute for Neuroscience, Northwestern University, 2240 Campus Drive, Evanston, Illinois 60208-3550, USA.
| | | |
Collapse
|
36
|
Füllgrabe C, Berthommier F, Lorenzi C. Masking release for consonant features in temporally fluctuating background noise. Hear Res 2005; 211:74-84. [PMID: 16289579 DOI: 10.1016/j.heares.2005.09.001] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2005] [Revised: 09/05/2005] [Accepted: 09/14/2005] [Indexed: 10/25/2022]
Abstract
Consonant identification was measured for normal-hearing listeners using Vowel-Consonant-Vowel stimuli that were either unprocessed or spectrally degraded to force listeners to use temporal-envelope cues. Stimuli were embedded in a steady state or fluctuating noise masker and presented at a fixed signal-to-noise ratio. Fluctuations in the maskers were obtained by applying sinusoidal modulation to: (i) the amplitude of the noise (1st-order SAM masker) or (ii) the modulation depth of a 1st-order SAM noise (2nd-order SAM masker). The frequencies of the amplitude variation fm and the depth variation f'm were systematically varied. Consistent with previous studies, identification scores obtained with unprocessed speech were highest in an 8-Hz, 1st-order SAM masker. Reception of voicing and manner also peaked around fm=8 Hz, while the reception of place of articulation was maximal at a higher frequency (fm=32 Hz). When 2nd-order SAM maskers were used, identification scores and received information for each consonant feature were found to be independent of f'm. They decreased progressively with increasing carrier modulation frequency fm, and ranged between those obtained with the steady state and the 1st-order SAM maskers. Finally, the results obtained with spectrally degraded speech were similar across all types of maskers, although an 8% improvement in the reception of voicing was observed for modulated maskers with fm < 64 Hz compared to the steady-state masker. These data provide additional evidence that listeners take advantage of temporal minima in fluctuating background noises, and suggest that: (i) minima of different durations are required for an optimal reception of the three consonant features and (ii) complex (i.e., 2nd-order) envelope fluctuations in background noise do not degrade speech identification by interfering with speech-envelope processing.
Collapse
Affiliation(s)
- Christian Füllgrabe
- Laboratoire de Psychologie Expérimentale - UMR CNRS 8581, Institut de Psychologie, Université René Descartes - Paris 5, 71 Avenue Vaillant, 92774 Boulogne-Billancourt, France.
| | | | | |
Collapse
|
37
|
Chatterjee M, Oba SI. Across- and within-channel envelope interactions in cochlear implant listeners. J Assoc Res Otolaryngol 2005; 5:360-75. [PMID: 15675001 PMCID: PMC2504569 DOI: 10.1007/s10162-004-4050-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
Abstract
The effects of modulated maskers on detection thresholds of a 50-Hz sinusoidal amplitude modulation (SAM) in a signal carrier were measured in nine cochlear implant (CI) listeners as a function of masker envelope type and for different masker-signal electrode separations. Both signal and masker were 200-ms-long pulse trains, presented concurrently in an interleaved stimulation mode. Masker envelopes were SAM at 20, 50, (0- and pi-phase re: the signal modulator), and 125 Hz, as well as noise amplitude modulated (NAM), all with a fixed 20% modulation depth. Comparisons were made against steady-state maskers that had an amplitude equal to the mean amplitude of the modulated maskers or to their peak amplitude (SS(peak)). Modulation thresholds were larger in the presence of the dynamic maskers versus the SS(peak) maskers; however, there was significant intersubject variability in the pattern of results. Effects of relative phase between masker and signal were not consistent across subjects. Envelope masking (the dB difference in modulation detection thresholds between modulated and SS(peak) maskers) was generally larger for the lower-modulation-frequency maskers than the 125-Hz masker. The spatial distribution of masked modulation detection thresholds was found to be considerably different from spatial forward-masking patterns obtained in the same subjects. Finally, modulation thresholds measured for a very wide separation between the masker and signal showed significant envelope masking. These results suggest that, as has been shown in acoustic stimulation, central, across-channel temporal processing mechanisms also occur in electrical stimulation.
Collapse
Affiliation(s)
- Monita Chatterjee
- Department of Auditory Implants and Perception, House Ear Institute, Los Angeles, CA 90057, USA.
| | | |
Collapse
|
38
|
Lyzenga J, Moore BCJ. Effect of frequency-modulation coherence for inharmonic stimuli: frequency-modulation phase discrimination and identification of artificial double vowels. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 117:1314-1325. [PMID: 15807020 DOI: 10.1121/1.1856251] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The ability to compare patterns of frequency modulation (FM) in separate frequency regions was explored. In experiment 1, listeners had to distinguish whether the FM applied to two nonharmonically related sinusoidal carriers was in phase or out of phase. The FM rate was the same for each carrier. The starting phase of the modulation was randomized for each stimulus in a three alternative, forced-choice (3AFC) trial. Subjects were sensitive to relative FM phase for modulation rates of 2 and 4 Hz, but not for higher rates. In experiment 2, vowel identification was compared for artificial single and double vowels. The vowels were constructed from complex tones with components spaced at 2-ERB(N) (equivalent rectangular bandwidth) intervals, by increasing the levels of three components by 15 dB, to create three "formants." In the double vowels, the components of the two vowels were interleaved, to give 1-ERB(N) spacing. The three "formant" components were frequency modulated at 2, 4, or 8 Hz, with either the same or different rates for the two vowels. The identification of double vowels was not improved by a difference in FM rate across vowels, suggesting that differences in FM rate do not support perceptual segregation of inharmonic stimuli.
Collapse
Affiliation(s)
- Johannes Lyzenga
- Department of Experimental Psychology, University of Cambridge, Cambridge CB2 3EB, United Kingdom
| | | |
Collapse
|
39
|
Kluk K, Moore BCJ. Factors affecting psychophysical tuning curves for hearing-impaired subjects with high-frequency dead regions. Hear Res 2005; 200:115-31. [PMID: 15668043 DOI: 10.1016/j.heares.2004.09.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/09/2004] [Accepted: 09/08/2004] [Indexed: 11/19/2022]
Abstract
A dead region (DR) is a region of the cochlea where there are no functioning inner hair cells and/or neurons. DRs can be detected using the threshold-equalizing-noise (TEN) test, but psychophysical tuning curves (PTCs) are sometimes used to give a more precise estimate of the edge frequency of a DR; a shifted tip of the PTC indicates a DR. We show here that the shapes of PTCs for hearing-impaired subjects can be influenced by the detection of beats and simple difference tones (SDTs). As a result, PTCs can have tips at f(s), even when f(s) falls in a DR. PTCs were measured for subjects with mild to moderate low-frequency and severe high-frequency hearing loss using sinusoidal and narrowband noise maskers (80-, 160-, 320-Hz wide): (1) in quiet; (2) in the presence of additional lowpass filtered noise (LF noise) designed to mask SDTs; (3) in the presence of a pair of low-frequency tones designed to interfere with the detection of beats (MDI tones). In condition (1), the PTCs were often W-shaped, with a sharp tip at f(s). This occurred less for the wider noise bandwidths. For subjects with good low-frequency hearing, the LF noise often reduced or eliminated the tip at f(s), suggesting that this tip was partly caused by detection of SDTs. For the sinusoidal and 80-Hz wide noise maskers, the addition of the MDI tones reduced the masker level required for threshold for masker frequencies adjacent to f(s), for nearly all subjects, suggesting a strong influence of beat detection. To minimize the influence of beats, we recommend using noise maskers with a bandwidth of 160 or (preferably) 320 Hz. In cases of near-normal hearing at low frequencies, we recommend using an additional LF noise to mask SDTs.
Collapse
Affiliation(s)
- Karolina Kluk
- Department of Experimental Psychology, University of Cambridge, Cambridge CB2 3EB, UK.
| | | |
Collapse
|
40
|
Grose JH, Hall JW, Buss E. Across-channel spectral processing. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2005; 70:87-119. [PMID: 16472632 DOI: 10.1016/s0074-7742(05)70003-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- John H Grose
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | | | | |
Collapse
|
41
|
Stickney GS, Zeng FG, Litovsky R, Assmann P. Cochlear implant speech recognition with speech maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 116:1081-91. [PMID: 15376674 DOI: 10.1121/1.1772399] [Citation(s) in RCA: 134] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Speech recognition performance was measured in normal-hearing and cochlear-implant listeners with maskers consisting of either steady-state speech-spectrum-shaped noise or a competing sentence. Target sentences from a male talker were presented in the presence of one of three competing talkers (same male, different male, or female) or speech-spectrum-shaped noise generated from this talker at several target-to-masker ratios. For the normal-hearing listeners, target-masker combinations were processed through a noise-excited vocoder designed to simulate a cochlear implant. With unprocessed stimuli, a normal-hearing control group maintained high levels of intelligibility down to target-to-masker ratios as low as 0 dB and showed a release from masking, producing better performance with single-talker maskers than with steady-state noise. In contrast, no masking release was observed in either implant or normal-hearing subjects listening through an implant simulation. The performance of the simulation and implant groups did not improve when the single-talker masker was a different talker compared to the same talker as the target speech, as was found in the normal-hearing control. These results are interpreted as evidence for a significant role of informational masking and modulation interference in cochlear implant speech recognition with fluctuating maskers. This informational masking may originate from increased target-masker similarity when spectral resolution is reduced.
Collapse
Affiliation(s)
- Ginger S Stickney
- University of California, Department of Otolaryngology--Head and Neck Surgery, Irvine, California 92697-1275, USA.
| | | | | | | |
Collapse
|
42
|
Kluk K, Moore BCJ. Factors affecting psychophysical tuning curves for normally hearing subjects. Hear Res 2004; 194:118-34. [PMID: 15276683 DOI: 10.1016/j.heares.2004.04.012] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/27/2004] [Accepted: 04/21/2004] [Indexed: 11/22/2022]
Abstract
These experiments were conducted to clarify the influence of beats and combination products on psychophysical tuning curves (PTCs) for normally hearing subjects. PTCs for 1- and 4-kHz sinusoidal signals were determined using as maskers a sinusoidal tone and 80-, 160-, and 320-Hz wide bands of noise. PTCs obtained using the sinusoidal masker showed distinct irregularities, particularly for masker frequencies close to the signal frequency. The PTCs determined for the noise maskers were more regular. The broader the masker, the more regular were the shapes of the PTCs. To reduce the detectability of beats produced by the interaction of the signal and masker, a pair of low-frequency tones, called "Modulation detection interference (MDI) tones", was used to introduce beats at the same rate. The MDI tones reduced the threshold level of the sinusoidal masker by up to 20 dB for frequencies within 300 Hz of the signal frequency; a similar but smaller effect was found when an 80-Hz wide masker was used. Adding a lowpass filtered (LF) noise to the sinusoidal or narrowband noise masker did not affect the low-frequency sides of the PTCs, suggesting no influence of combination products. The LF noise did affect the high-frequency sides of the PTCs, but this can be attributed to it reducing off-frequency listening. To achieve a PTC whose shape around the tip is minimally affected by beats, we propose using a noise masker with a bandwidth approximately equal to the bandwidth of the auditory filter for which the PTC is measured.
Collapse
Affiliation(s)
- Karolina Kluk
- Department of Experimental Psychology, University of Cambridge, Cambridge CB2 3EB, UK.
| | | |
Collapse
|
43
|
Gockel H, Carlyon RP, Plack CJ. Across-frequency interference effects in fundamental frequency discrimination: questioning evidence for two pitch mechanisms. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 116:1092-1104. [PMID: 15376675 DOI: 10.1121/1.1766021] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Carlyon and Shackleton [J. Acoust. Soc. Am. 95, 3541-3554 (1994)] presented an influential study supporting the existence of two pitch mechanisms, one for complex tones containing resolved and one for complex tones containing only unresolved components. The current experiments provide an alternative explanation for their finding, namely the existence of across-frequency interference in fundamental frequency (F0) discrimination. Sensitivity (d') was measured for F0 discrimination between two sequentially presented 400 ms complex (target) tones containing only unresolved components. In experiment 1, the target was filtered between 1375 and 15,000 Hz, had a nominal F0 of 88 Hz, and was presented either alone or with an additional complex tone ("interferer"). The interferer was filtered between 125-625 Hz, and its F0 varied between 88 and 114.4 Hz across blocks. Sensitivity was significantly reduced in the presence of the interferer, and this effect decreased as its F0 was moved progressively further from that of the target. Experiment 2 showed that increasing the level of a synchronously gated lowpass noise that spectrally overlapped with the interferer reduced this "pitch discrimination interference (PDI)". In experiment 3A, the target was filtered between 3900 and 5400 Hz and had an F0 of either 88 or 250 Hz. It was presented either alone or with an interferer, filtered between 1375 and 1875 Hz with an F0 corresponding to the nominal target F0. PDI was larger in the presence of the resolved (250 Hz F0) than in the presence of the unresolved (88 Hz F0) interferer, presumably because the pitch of the former was more salient than that of the latter. Experiments 4A and 4B showed that PDI was reduced but not eliminated when the interferer was gated on 200 ms before and off 200 ms after the target, and that some PDI was observed with a continuous interferer. The current findings provide an alternative interpretation of a study supposedly providing strong evidence for the existence of two pitch mechanisms.
Collapse
Affiliation(s)
- Hedwig Gockel
- MRC Cognition and Brain Sciences Unit, Cambridge CB2 2EF, United Kingdom.
| | | | | |
Collapse
|
44
|
Abstract
Amplitude modulation (AM) is a temporal feature of most natural acoustic signals. A long psychophysical tradition has shown that AM is important in a variety of perceptual tasks, over a range of time scales. Technical possibilities in stimulus synthesis have reinvigorated this field and brought the modulation dimension back into focus. We address the question whether specialized neural mechanisms exist to extract AM information, and thus whether consideration of the modulation domain is essential in understanding the neural architecture of the auditory system. The available evidence suggests that this is the case. Peripheral neural structures not only transmit envelope information in the form of neural activity synchronized to the modulation waveform but are often tuned so that they only respond over a limited range of modulation frequencies. Ascending the auditory neuraxis, AM tuning persists but increasingly takes the form of tuning in average firing rate, rather than synchronization, to modulation frequency. There is a decrease in the highest modulation frequencies that influence the neural response, either in average rate or synchronization, as one records at higher and higher levels along the neuraxis. In parallel, there is an increasing tolerance of modulation tuning for other stimulus parameters such as sound pressure level, modulation depth, and type of carrier. At several anatomical levels, consideration of modulation response properties assists the prediction of neural responses to complex natural stimuli. Finally, some evidence exists for a topographic ordering of neurons according to modulation tuning. The picture that emerges is that temporal modulations are a critical stimulus attribute that assists us in the detection, discrimination, identification, parsing, and localization of acoustic sources and that this wide-ranging role is reflected in dedicated physiological properties at different anatomical levels.
Collapse
Affiliation(s)
- P X Joris
- Laboratory of Auditory Neurophysiology, K.U. Leuven, Campus Gasthuisberg, B-3000 Leuven, Belgium.
| | | | | |
Collapse
|
45
|
Saberi K, Tirtabudi P, Petrosyan A, Perrott DR, Strybel TZ. Concurrent motion detection based on dynamic changes in interaural delay. Hear Res 2002; 174:149-57. [PMID: 12433406 DOI: 10.1016/s0378-5955(02)00652-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The ability to detect a dynamic change in the interaural delay of a pure tone in the presence of a distracter tone of a different frequency was investigated in four conditions: (1) a control condition in which no distracter tone was present, (2) the distracter tone was stationary (fixed interaural delay), (3) the distracter had an interaural delay that changed in the same direction as that of the target tone, i.e., concurrent auditory motion in the same direction, and (4) the distracter had an interaural delay that changed in a direction opposite to that of the target tone, i.e., concurrent auditory motion in opposite directions. In a cued single-interval two-alternative forced-choice design, the observer had to determine if the target tone had a constant or dynamic interaural delay. The target was a 500-Hz tone and the distracter was a tone with a frequency of 300, 510, 550, 600, 800, or 1000 Hz. Detection was also examined for a range of stimulus durations, rates of change in interaural delay (i.e., velocity), and extent of change in interaural time difference (i.e., 'distance'). Results showed that the best performance (highest d') was associated with the no-distracter condition, followed by the stationary-distracter, opposite-direction, and same-direction conditions, respectively. Detection improved with increasing frequency difference between distracter and target tones, but was nonetheless lower than that associated with the no-distracter condition, even when the distracter frequency was several critical bands removed from the target frequency.
Collapse
Affiliation(s)
- Kourosh Saberi
- Department of Cognitive Sciences, University of California, Irvine, CA 92697, USA.
| | | | | | | | | |
Collapse
|
46
|
Gockel H, Carlyon RP, Deeks JM. Effect of modulator asynchrony of sinusoidal and noise modulators on frequency and amplitude modulation detection interference. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 112:2975-2984. [PMID: 12509019 DOI: 10.1121/1.1512291] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The effect on modulation detection interference (MDI) of timing of gating of the modulation of target and interferer, with synchronously gated carriers, was investigated in three experiments. In a two-interval, two-alternative forced choice adaptive procedure, listeners had to detect 15 Hz sinusoidal amplitude modulation (AM) or frequency modulation (FM) imposed for 200 ms in the temporal center of a 600 ms target sinusoidal carrier. In the first experiment, 15 Hz sinusoidal FM was imposed in phase on both target and interferer carriers. Thresholds were lower for nonoverlapping than for synchronous modulation of target and interferer, but MDI still occurred for the former. Thresholds were significantly higher when the modulators were gated synchronously than when the interferer modulator was gated on before and off after that of the target. This contrasts with the findings of Oxenham and Dau [J. Acoust. Soc. Am. 110, 402-408 (2001)], who reported no effect of modulation asynchrony on AM detection thresholds, using a narrowband noise modulator. Using FM, experiment 2 showed that for temporally overlapping modulation of target and interferer, modulator asynchrony had no significant effect when the interferer was modulated by a narrowband noise. Experiment 3 showed that, for AM, synchronous gating of modulation of the target and interferer produced lower thresholds than asynchronous gating, especially for sinusoidal modulation of the interferer. Results are discussed in terms of specific cues available for periodic modulation, and differences between perceptual grouping on the basis of common AM and FM.
Collapse
Affiliation(s)
- Hedwig Gockel
- CNBH, Department of Physiology, University of Cambridge, Downing Street, Cambridge CB2 3EG, United Kingdom.
| | | | | |
Collapse
|
47
|
Abstract
Natural sounds often contain energy over a broad spectral range and consequently overlap in frequency when they occur simultaneously; however, such sounds under normal circumstances can be distinguished perceptually (e.g., the cocktail party effect). Sound components arising from different sources have distinct (i.e., incoherent) modulations, and incoherence appears to be one important cue used by the auditory system to segregate sounds into separately perceived acoustic objects. Here we show that, in the primary auditory cortex of awake marmoset monkeys, many neurons responsive to amplitude- or frequency-modulated tones at a particular carrier frequency [the characteristic frequency (CF)] also demonstrate sensitivity to the relative modulation phase between two otherwise identically modulated tones: one at CF and one at a different carrier frequency. Changes in relative modulation phase reflect alterations in temporal coherence between the two tones, and the most common neuronal response was found to be a maximum of suppression for the coherent condition. Coherence sensitivity was generally found in a narrow frequency range in the inhibitory portions of the frequency response areas (FRA), indicating that only some off-CF neuronal inputs into these cortical neurons interact with on-CF inputs on the same time scales. Over the population of neurons studied, carrier frequencies showing coherence sensitivity were found to coincide with the carrier frequencies of inhibition, implying that inhibitory inputs create the effect. The lack of strong coherence-induced facilitation also supports this interpretation. Coherence sensitivity was found to be greatest for modulation frequencies of 16-128 Hz, which is higher than the phase-locking capability of most cortical neurons, implying that subcortical neurons could play a role in the phenomenon. Collectively, these results reveal that auditory cortical neurons receive some off-CF inputs temporally matched and some temporally unmatched to the on-CF input(s) and respond in a fashion that could be utilized by the auditory system to segregate natural sounds containing similar spectral components (such as vocalizations from multiple conspecifics) based on stimulus coherence.
Collapse
Affiliation(s)
- Dennis L Barbour
- Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.
| | | |
Collapse
|
48
|
Bacon SP, Opie JM. Modulation detection interference in listeners with normal and impaired hearing. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2002; 45:392-402. [PMID: 12003520 DOI: 10.1044/1092-4388(2002/031)] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Listeners were asked to detect amplitude modulation (AM) of a target (or signal) carrier that was presented in isolation or in the presence of an additional (masker) carrier. The signal was modulated at a rate of 10 Hz, and the masker was unmodulated or was modulated at a rate of 2, 10, or 40 Hz. Nine listeners had normal hearing, 4 had a bilateral hearing loss, and 4 had a unilateral hearing loss; those with a unilateral loss were tested in both ears. The listeners with a hearing loss had normal hearing at 1 kHz and a 30- to 40-dB loss at 4 kHz. The carrier frequencies were 984 and 3952 Hz. In one set of conditions, the lower frequency carrier was the signal and the higher frequency carrier was the masker. In the other set, the reverse was true. For the impaired ears, the carriers were presented at 70 dB SPL. For the normal ears, either the carriers were both presented at 70 dB SPL or the higher frequency carrier was reduced to 40 dB SPL to simulate the lower sensation level experienced by the impaired ears. There was considerable individual variability in the results, and there was no clear effect of hearing loss. These results suggest that a mild, presumably cochlear hearing loss does not affect the ability to process AM in one frequency region in the presence of competing AM from another region.
Collapse
Affiliation(s)
- Sid P Bacon
- Department of Speech and Hearing Science, Arizona State University, Tempe 85287-1908, USA.
| | | |
Collapse
|
49
|
Turgeon M, Bregman AS, Ahad PA. Rhythmic masking release: contribution of cues for perceptual organization to the cross-spectral fusion of concurrent narrow-band noises. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1819-1831. [PMID: 12002866 DOI: 10.1121/1.1453450] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The contribution of temporal asynchrony, spatial separation, and frequency separation to the cross-spectral fusion of temporally contiguous brief narrow-band noise bursts was studied using the Rhythmic Masking Release paradigm (RMR). RMR involves the discrimination of one of two possible rhythms, despite perceptual masking of the rhythm by an irregular sequence of sounds identical to the rhythmic bursts, interleaved among them. The release of the rhythm from masking can be induced by causing the fusion of the irregular interfering sounds with concurrent "flanking" sounds situated in different frequency regions. The accuracy and the rated clarity of the identified rhythm in a 2-AFC procedure were employed to estimate the degree of fusion of the interferring sounds with flanking sounds. The results suggest that while synchrony fully fuses short-duration noise bursts across frequency and across space (i.e., across ears and loudspeakers), an asynchrony of 20-40 ms produces no fusion. Intermediate asynchronies of 10-20 ms produce partial fusion, where the presence of other cues is critical for unambiguous grouping. Though frequency and spatial separation reduced fusion, neither of these manipulations was sufficient to abolish it. For the parameters varied in this study, stimulus onset asynchrony was the dominant cue determining fusion, but there were additive effects of the other cues. Temporal synchrony appears to be critical in determining whether brief sounds with abrupt onsets and offsets are heard as one event or more than one.
Collapse
Affiliation(s)
- Martine Turgeon
- Psychology Department, McGill University, Montreal, Quebec, Canada.
| | | | | |
Collapse
|
50
|
Schulze H, Hess A, Ohl FW, Scheich H. Superposition of horseshoe-like periodicity and linear tonotopic maps in auditory cortex of the Mongolian gerbil. Eur J Neurosci 2002; 15:1077-84. [PMID: 11918666 DOI: 10.1046/j.1460-9568.2002.01935.x] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The segregation of an individual sound from a mixture of concurrent sounds, the so-called cocktail-party phenomenon, is a fundamental and largely unexplained capability of the auditory system. Speaker recognition involves grouping of the various spectral (frequency) components of an individual's voice and segregating them from other competing voices. The important parameter for grouping may be the periodicity of sound waves because the spectral components of a given voice have one periodicity, viz. fundamental frequency, as their common denominator. To determine the relationship between the representations of spectral content and periodicity in the primary auditory cortex (AI), we used optical recording of intrinsic signals and electrophysiological mapping in Mongolian gerbils (Meriones unguiculatus). We found that periodicity maps as an almost circular gradient superimposed on the linear tonotopic gradient in the low frequency part of AI. This geometry of the periodicity map may imply competitive signal processing in support of the theory of "winner-takes-all".
Collapse
Affiliation(s)
- Holger Schulze
- Leibniz Institute for Neurobiology, Brenneckestrasse 6, 39118 Magdeburg, Germany.
| | | | | | | |
Collapse
|