Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: O'Neill ER, Parke MN, Kreft HA, Oxenham AJ. Development and Validation of Sentences Without Semantic Context to Complement the Basic English Lexicon Sentences. J Speech Lang Hear Res 2020;63:3847-3854. [PMID: 33049146 PMCID: PMC8582750 DOI: 10.1044/2020_jslhr-20-00174] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

For:	O'Neill ER, Parke MN, Kreft HA, Oxenham AJ. Development and Validation of Sentences Without Semantic Context to Complement the Basic English Lexicon Sentences. J Speech Lang Hear Res 2020;63:3847-3854. [PMID: 33049146 PMCID: PMC8582750 DOI: 10.1044/2020_jslhr-20-00174] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Number

Cited by Other Article(s)

Pandey PR, Herrmann B. The Influence of Semantic Context on the Intelligibility Benefit From Speech Glimpses in Younger and Older Adults. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2025;68:2499-2516. [PMID: 40233803 DOI: 10.1044/2025_jslhr-24-00588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/17/2025]

Abstract

PURPOSE

Speech is often masked by background sound that fluctuates over time. Fluctuations in masker intensity can reveal glimpses of speech that support speech intelligibility, but older adults have frequently been shown to benefit less from speech glimpses than younger adults when listening to sentences. Recent work, however, suggests that older adults may leverage speech glimpses as much, or more, when listening to naturalistic stories, potentially because of the availability of semantic context in stories. The current study directly investigated whether semantic context helps older adults benefit from speech glimpses released by a fluctuating (modulated) masker more than younger adults.

METHOD

In two experiments, we reduced and extended semantic information of sentence stimuli in modulated and unmodulated speech maskers for younger and older adults. Speech intelligibility was assessed.

RESULTS

We found that semantic context improves speech intelligibility in both younger and older adults. Both age groups also exhibit better speech intelligibility for a modulated than an unmodulated (stationary) masker, but the benefit from the speech glimpses was reduced in older compared to younger adults. Semantic context amplified the benefit gained from the speech glimpses, but there was no indication that the amplification by the semantic context led to a greater benefit in older adults. If anything, younger adults benefitted more.

CONCLUSIONS

The current results suggest that the deficit in the masking-release benefit in older adults generalizes to situations in which extended speech context is available. That previous research found a greater benefit in older than younger adults during story listening may suggest that other factors, such as thematic knowledge, motivation, or cognition, may amplify the benefit from speech glimpses under naturalistic listening conditions.

Collapse

Gao Z, Oxenham AJ. Adaptation to sentences and melodies when making judgments along a voice-nonvoice continuum. Atten Percept Psychophys 2025;87:1022-1032. [PMID: 40000570 DOI: 10.3758/s13414-025-03030-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/05/2025] [Indexed: 02/27/2025]

Borrie SA, Tetzloff KA, Barrett TS, Lansford KL. Increasing Motivation Increases Intelligibility Benefits of Perceptual Training in Dysarthria. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2025;34:85-96. [PMID: 39504442 PMCID: PMC11745309 DOI: 10.1044/2024_ajslp-24-00196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Revised: 07/18/2024] [Accepted: 08/20/2024] [Indexed: 11/08/2024]

Smith ML, Winn MB. Repairing Misperceptions of Words Early in a Sentence is More Effortful Than Repairing Later Words, Especially for Listeners With Cochlear Implants. Trends Hear 2025;29:23312165251320789. [PMID: 39995109 PMCID: PMC11851752 DOI: 10.1177/23312165251320789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 01/10/2025] [Accepted: 01/30/2025] [Indexed: 02/26/2025] Open

Lee J, Oxenham AJ. Testing the role of temporal coherence on speech intelligibility with noise and single-talker maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;156:3285-3297. [PMID: 39545746 PMCID: PMC11575144 DOI: 10.1121/10.0034420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 10/25/2024] [Indexed: 11/17/2024]

Borrie SA, Hepworth TJ, Wynn CJ, Hustad KC, Barrett TS, Lansford KL. Perceptual Learning of Dysarthria in Adolescence. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:3791-3803. [PMID: 37616225 PMCID: PMC10713018 DOI: 10.1044/2023_jslhr-23-00231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 05/28/2023] [Accepted: 06/20/2023] [Indexed: 08/26/2023]

Abstract

PURPOSE

As evidenced by perceptual learning studies involving adult listeners and speakers with dysarthria, adaptation to dysarthric speech is driven by signal predictability (speaker property) and a flexible speech perception system (listener property). Here, we extend adaptation investigations to adolescent populations and examine whether adult and adolescent listeners can learn to better understand an adolescent speaker with dysarthria.

METHOD

Classified by developmental stage, adult (n = 42) and adolescent (n = 40) listeners completed a three-phase perceptual learning protocol (pretest, familiarization, and posttest). During pretest and posttest, all listeners transcribed speech produced by a 13-year-old adolescent with spastic dysarthria associated with cerebral palsy. During familiarization, half of the adult and adolescent listeners engaged in structured familiarization (audio and lexical feedback) with the speech of the adolescent speaker with dysarthria; and the other half, with the speech of a neurotypical adolescent speaker (control).

RESULTS

Intelligibility scores increased from pretest to posttest for all listeners. However, listeners who received dysarthria familiarization achieved greater intelligibility improvements than those who received control familiarization. Furthermore, there was a significant effect of developmental stage, where the adults achieved greater intelligibility improvements relative to the adolescents.

CONCLUSIONS

This study provides the first tranche of evidence that adolescent dysarthric speech is learnable-a finding that holds even for adolescent listeners whose speech perception systems are not yet fully developed. Given the formative role that social interactions play during adolescence, these findings of improved intelligibility afford important clinical implications.

Collapse

Wasiuk PA, Buss E, Oleson JJ, Calandruccio L. Predicting speech-in-speech recognition: Short-term audibility, talker sex, and listener factors. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022;152:3010. [PMID: 36456289 DOI: 10.1121/10.0015228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 11/01/2022] [Indexed: 06/17/2023]

Cowan T, Paroby C, Leibold LJ, Buss E, Rodriguez B, Calandruccio L. Masked-Speech Recognition for Linguistically Diverse Populations: A Focused Review and Suggestions for the Future. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:3195-3216. [PMID: 35917458 PMCID: PMC9911100 DOI: 10.1044/2022_jslhr-22-00011] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 04/12/2022] [Accepted: 05/04/2022] [Indexed: 06/15/2023]

Abstract

PURPOSE

Twenty years ago, von Hapsburg and Peña (2002) wrote a tutorial that reviewed the literature on speech audiometry and bilingualism and outlined valuable recommendations to increase the rigor of the evidence base. This review article returns to that seminal tutorial to reflect on how that advice was applied over the last 20 years and to provide updated recommendations for future inquiry.

METHOD

We conducted a focused review of the literature on masked-speech recognition for bilingual children and adults. First, we evaluated how studies published since 2002 described bilingual participants. Second, we reviewed the literature on native language masked-speech recognition. Third, we discussed theoretically motivated experimental work. Fourth, we outlined how recent research in bilingual speech recognition can be used to improve clinical practice.

RESULTS

Research conducted since 2002 commonly describes bilingual samples in terms of their language status, competency, and history. Bilingualism was not consistently associated with poor masked-speech recognition. For example, bilinguals who were exposed to English prior to age 7 years and who were dominant in English performed comparably to monolinguals for masked-sentence recognition tasks. To the best of our knowledge, there are no data to document the masked-speech recognition ability of these bilinguals in their other language compared to a second monolingual group, which is an important next step. Nonetheless, individual factors that commonly vary within bilingual populations were associated with masked-speech recognition and included language dominance, competency, and age of acquisition. We identified methodological issues in sampling strategies that could, in part, be responsible for inconsistent findings between studies. For instance, disparities in socioeconomic status (SES) between recruited bilingual and monolingual groups could cause confounding bias within the research design.

CONCLUSIONS

Dimensions of the bilingual linguistic profile should be considered in clinical practice to inform counseling and (re)habilitation strategies since susceptibility to masking is elevated in at least one language for most bilinguals. Future research should continue to report language status, competency, and history but should also report language stability and demand for use data. In addition, potential confounds (e.g., SES, educational attainment) when making group comparisons between monolinguals and bilinguals must be considered.

Collapse

Pragt L, van Hengel P, Grob D, Wasmann JWA. Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf. Front Digit Health 2022;4:806076. [PMID: 35252959 PMCID: PMC8889114 DOI: 10.3389/fdgth.2022.806076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 01/18/2022] [Indexed: 11/26/2022] Open

Abstract

Objective

Automated speech recognition (ASR) systems have become increasingly sophisticated, accurate, and deployable on many digital devices, including on a smartphone. This pilot study aims to examine the speech recognition performance of ASR apps using audiological speech tests. In addition, we compare ASR speech recognition performance to normal hearing and hearing impaired listeners and evaluate if standard clinical audiological tests are a meaningful and quick measure of the performance of ASR apps.

Methods

Four apps have been tested on a smartphone, respectively AVA, Earfy, Live Transcribe, and Speechy. The Dutch audiological speech tests performed were speech audiometry in quiet (Dutch CNC-test), Digits-in-Noise (DIN)-test with steady-state speech-shaped noise, sentences in quiet and in averaged long-term speech-shaped spectrum noise (Plomp-test). For comparison, the app's ability to transcribe a spoken dialogue (Dutch and English) was tested.

Results

All apps scored at least 50% phonemes correct on the Dutch CNC-test for a conversational speech intensity level (65 dB SPL) and achieved 90–100% phoneme recognition at higher intensity levels. On the DIN-test, AVA and Live Transcribe had the lowest (best) signal-to-noise ratio +8 dB. The lowest signal-to-noise measured with the Plomp-test was +8 to 9 dB for Earfy (Android) and Live Transcribe (Android). Overall, the word error rate for the dialogue in English (19–34%) was lower (better) than for the Dutch dialogue (25–66%).

Conclusion

The performance of the apps was limited on audiological tests that provide little linguistic context or use low signal to noise levels. For Dutch audiological speech tests in quiet, ASR apps performed similarly to a person with a moderate hearing loss. In noise, the ASR apps performed more poorly than most profoundly deaf people using a hearing aid or cochlear implant. Adding new performance metrics including the semantic difference as a function of SNR and reverberation time could help to monitor and further improve ASR performance.

Collapse

Ratnanather JT, Wang LC, Bae SH, O'Neill ER, Sagi E, Tward DJ. Visualization of Speech Perception Analysis via Phoneme Alignment: A Pilot Study. Front Neurol 2022;12:724800. [PMID: 35087462 PMCID: PMC8787339 DOI: 10.3389/fneur.2021.724800] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 12/13/2021] [Indexed: 11/13/2022] Open

Abstract

Objective: Speech tests assess the ability of people with hearing loss to comprehend speech with a hearing aid or cochlear implant. The tests are usually at the word or sentence level. However, few tests analyze errors at the phoneme level. So, there is a need for an automated program to visualize in real time the accuracy of phonemes in these tests. Method: The program reads in stimulus-response pairs and obtains their phonemic representations from an open-source digital pronouncing dictionary. The stimulus phonemes are aligned with the response phonemes via a modification of the Levenshtein Minimum Edit Distance algorithm. Alignment is achieved via dynamic programming with modified costs based on phonological features for insertion, deletions and substitutions. The accuracy for each phoneme is based on the F1-score. Accuracy is visualized with respect to place and manner (consonants) or height (vowels). Confusion matrices for the phonemes are used in an information transfer analysis of ten phonological features. A histogram of the information transfer for the features over a frequency-like range is presented as a phonemegram. Results: The program was applied to two datasets. One consisted of test data at the sentence and word levels. Stimulus-response sentence pairs from six volunteers with different degrees of hearing loss and modes of amplification were analyzed. Four volunteers listened to sentences from a mobile auditory training app while two listened to sentences from a clinical speech test. Stimulus-response word pairs from three lists were also analyzed. The other dataset consisted of published stimulus-response pairs from experiments of 31 participants with cochlear implants listening to 400 Basic English Lexicon sentences via different talkers at four different SNR levels. In all cases, visualization was obtained in real time. Analysis of 12,400 actual and random pairs showed that the program was robust to the nature of the pairs. Conclusion: It is possible to automate the alignment of phonemes extracted from stimulus-response pairs from speech tests in real time. The alignment then makes it possible to visualize the accuracy of responses via phonological features in two ways. Such visualization of phoneme alignment and accuracy could aid clinicians and scientists.

Collapse

O'Neill ER, Parke MN, Kreft HA, Oxenham AJ. Role of semantic context and talker variability in speech perception of cochlear-implant users and normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;149:1224. [PMID: 33639827 PMCID: PMC7895533 DOI: 10.1121/10.0003532] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 01/01/2021] [Accepted: 01/26/2021] [Indexed: 06/12/2023]