1
|
Wohlbauer DM, Dillier N. A Hundred Ways to Encode Sound Signals for Cochlear Implants. Annu Rev Biomed Eng 2025; 27:335-369. [PMID: 40310887 DOI: 10.1146/annurev-bioeng-102623-121249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Cochlear implants are the most successful neural prostheses used to restore hearing in severe-to-profound hearing-impaired individuals. The field of cochlear implant coding investigates interdisciplinary approaches to translate acoustic signals into electrical pulses transmitted at the electrode-neuron interface, ranging from signal preprocessing algorithms, enhancement, and feature extraction methodologies to electric signal generation. In the last five decades, numerous coding strategies have been proposed clinically and experimentally. Initially developed to restore speech perception, increasing computational possibilities now allow coding of more complex signals, and new techniques to optimize the transmission of electrical signals are constantly gaining attention. This review provides insights into the history of multichannel coding and presents an extensive list of implemented strategies. The article briefly addresses each method and considers promising future directions of neural prostheses and possible signal processing, with the ultimate goal of providing a current big picture of the large field of cochlear implant coding.
Collapse
Affiliation(s)
- Dietmar M Wohlbauer
- Department of Otolaryngology, Head and Neck Surgery, Massachusetts Eye and Ear, Harvard Medical School, Boston, Massachusetts, USA;
| | - Norbert Dillier
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zürich, University of Zürich, Zürich, Switzerland
| |
Collapse
|
2
|
Luo J, Wang R, Xu K, Chao X, Zheng Y, Hu F, Liu X, Vandali AE, Wang H, Xu L. Outcomes Using the Optimized Pitch and Language Strategy Versus the Advanced Combination Encoder Strategy in Mandarin-Speaking Cochlear Implant Recipients. Ear Hear 2025; 46:210-222. [PMID: 39104002 PMCID: PMC11637569 DOI: 10.1097/aud.0000000000001572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 07/02/2024] [Indexed: 08/07/2024]
Abstract
OBJECTIVES The experimental Optimized Pitch and Language (OPAL) strategy enhances coding of fundamental frequency (F0) information in the temporal envelope of electrical signals delivered to channels of a cochlear implant (CI). Previous studies with OPAL have explored performance on speech and lexical tone perception in Mandarin- and English-speaking CI recipients. However, it was not clear which cues to lexical tone (primary and/or secondary) were used by the Mandarin CI listeners. The primary aim of the present study was to investigate whether OPAL provides improved recognition of Mandarin lexical tones in both quiet and noisy environments compared with the Advanced Combination Encoder (ACE) strategy. A secondary aim was to investigate whether, and to what extent, removal of secondary (duration and intensity envelope) cues to lexical tone affected Mandarin tone perception. DESIGN Thirty-two CI recipients with an average age of 24 (range 7 to 57) years were enrolled in the study. All recipients had at least 1 year of experience using ACE. Each subject attended two testing sessions, the first to measure baseline performance, and the second to evaluate the effect of strategy after provision of some take-home experience using OPAL. A minimum take-home duration of approximately 4 weeks was prescribed in which subjects were requested to use OPAL as much as possible but were allowed to also use ACE when needed. The evaluation tests included recognition of Mandarin lexical tones in quiet and in noise (signal to noise ratio [SNR] +5 dB) using naturally produced tones and duration/intensity envelope normalized versions of the tones; Mandarin sentence in adaptive noise; Mandarin monosyllabic and disyllabic word in quiet; a subset of Speech, Spatial, and Qualities of hearing questionnaire (SSQ, speech hearing scale); and subjective preference for strategy in quiet and noise. RESULTS For both the natural and normalized lexical tone tests, mean scores for OPAL were significantly higher than ACE in quiet by 2.7 and 2.9%-points, respectively, and in noise by 7.4 and 7.2%-points, respectively. Monosyllabic word recognition in quiet using OPAL was significantly higher than ACE by approximately 7.5% points. Average SSQ ratings for OPAL were significantly higher than ACE by approximately 0.5 points on a 10-point scale. In quiet conditions, 14 subjects preferred OPAL, 7 expressed a preference for ACE, and 9 reported no preference. Compared with quiet, in noisy situations, there was a stronger preference for OPAL (19 recipients), a similar preference for ACE (7 recipients), while fewer expressed no preference. Average daily take-home use of ACE and OPAL was 4.9 and 7.1 hr, respectively. CONCLUSIONS For Mandarin-speaking CI recipients, OPAL provided significant improvements to lexical tone perception for natural and normalized tones in quiet and noise, monosyllabic word recognition in quiet, and subjective ratings of speech intelligibility. Subjects accessed both primary and secondary cues to lexical tone for perception in quiet and noise conditions. The benefits of lexical tone recognition were attributed to enhanced F0 rate cues encoded by OPAL, especially in a noisy environment. The OPAL strategy was well accepted by many of the Mandarin-speaking CI recipients.
Collapse
Affiliation(s)
- Jianfen Luo
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-first authors
| | - Ruijie Wang
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-first authors
| | - Kaifan Xu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Xiuhua Chao
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Yi Zheng
- Cochlear Medical Device (Beijing) Co., Ltd, Beijing, China
| | - Fangxia Hu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Xianqi Liu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | | | - Haibo Wang
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-corresponding authors
| | - Lei Xu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-corresponding authors
| |
Collapse
|
3
|
Tang P, Xu Rattanasone N, Demuth K, Wang L, Yuen I. Mandarin-speaking Children With Cochlear Implants Face Challenges in Using F0 Expansion to Express Contrastive Focus. Ear Hear 2024; 45:1274-1283. [PMID: 38769615 DOI: 10.1097/aud.0000000000001518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
OBJECTIVES Children with cochlear implants (CIs) face challenges in perceiving fundamental frequency (F0) information because CIs do not transmit F0 effectively. In Mandarin, F0 can contrast meanings at the word level, that is, via lexical tones with distinct F0 contours, and signal contrastive relations between words at the utterance-level, that is, via contrastive focus with expanded F0 range and longer duration. Mandarin-speaking children with CIs have been reported to face challenges in producing distinct F0 contours across tones, but early implantation facilitates tonal acquisition. However, it is still unclear if utterance-level prosody, such as contrastive focus, is also challenging for these children, and if early implantation also offers benefits for focus production. Therefore, this study asked how accurately children with CIs can produce contrastive focus, and if early implantation leads to more accurate focus production, with acoustic patterns approaching that of children with typical hearing (TH). DESIGN Participants included 55 Mandarin-speaking children (3 to 7 years) with CIs and 55 age-matched children with TH. Children produced noun phrases with and without contrastive focus, such as RED-COLORED cat versus red-colored cat . Three adult native listeners perceptually scored the productions as correct or incorrect. The "correct" productions were then acoustically analyzed in terms of F0 range and duration. RESULTS Based on the perceptual scores, children with CIs produced focus with significantly lower accuracy (38%) than their TH peers (84%). The acoustic analysis on their "correct" productions showed that children with TH used both F0 and duration to mark focus, producing focal syllables with an expanded F0 range and long duration, and postfocal syllables with a reduced F0 range and short duration. However, children with CIs differed from children with TH in that they produced focal syllables with long duration but not an expanded F0 range, although they produced postfocal syllables with a reduced F0 range and short duration like their TH peers. In addition, early implantation correlated with the percept of more accurate focus productions and better use of F0 range in focal marking. CONCLUSIONS This study finds that Mandarin-speaking children with CIs are still learning to apply appropriate acoustic cues to contrastive focus. The challenge appears to lie in the use of an expanded F0 range to mark focus, probably related to the limited transmission of F0 information through the CI devices. These findings thus have implications for parents and those working with children with CIs, showing that utterance-level prosody also requires speech remediation, and underscores the critical role of identifying problems early in the acquisition of F0 functions in Mandarin, not only at the word level but also at the utterance-level.
Collapse
Affiliation(s)
- Ping Tang
- School of Foreign Studies, Nanjing University of Science and Technology, Nanjing, Jiangsu, China
| | - Nan Xu Rattanasone
- Department of Linguistics, ARC Centre of Excellence in Cognition and its Disorders Macquarie University, North Ryde, NSW, Australia
| | - Katherine Demuth
- Department of Linguistics, ARC Centre of Excellence in Cognition and its Disorders Macquarie University, North Ryde, NSW, Australia
| | - Liyan Wang
- China Rehabilitation Research Centre for Hearing and Speech Impairment, Chaoyang, Beijing, China
| | - Ivan Yuen
- Department of Language Science and Technology, Universität des Saarlandes, Saarbrücken, Germany
| |
Collapse
|
4
|
Yu Q, Li H, Li S, Tang P. Prosodic and Visual Cues Facilitate Irony Comprehension by Mandarin-Speaking Children With Cochlear Implants. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:2172-2190. [PMID: 38820233 DOI: 10.1044/2024_jslhr-23-00701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2024]
Abstract
PURPOSE This study investigated irony comprehension by Mandarin-speaking children with cochlear implants, focusing on how prosodic and visual cues contribute to their comprehension, and whether second-order Theory of Mind is required for using these cues. METHOD We tested 52 Mandarin-speaking children with cochlear implants (aged 3-7 years) and 52 age- and gender-matched children with normal hearing. All children completed a Theory of Mind test and a story comprehension test. Ironic stories were presented in three conditions, each providing different cues: (a) context-only, (b) context and prosody, and (c) context, prosody, and visual cues. Comparisons were conducted on the accuracy of story understanding across the three conditions to examine the role of prosodic and visual cues. RESULTS The results showed that, compared to the context-only condition, the additional prosodic and visual cues both improved the accuracy of irony comprehension for children with cochlear implants, similar to their normal-hearing peers. Furthermore, such improvements were observed for all children, regardless of whether they passed the second-order Theory of Mind test or not. CONCLUSIONS This study is the first to demonstrate the benefits of prosodic and visual cues in irony comprehension, without reliance on second-order Theory of Mind, for Mandarin-speaking children with cochlear implants. It implies potential insights for utilizing prosodic and visual cues in intervention strategies to promote irony comprehension.
Collapse
Affiliation(s)
- Qianxi Yu
- School of Foreign Studies, Nanjing University of Science and Technology, China
| | - Honglan Li
- School of Foreign Studies, Nanjing University of Science and Technology, China
| | - Shanpeng Li
- School of Foreign Studies, Nanjing University of Science and Technology, China
| | - Ping Tang
- School of Foreign Studies, Nanjing University of Science and Technology, China
| |
Collapse
|
5
|
Camarena A, Goldsworthy RL. Characterizing the relationship between modulation sensitivity and pitch resolution in cochlear implant users. Hear Res 2024; 448:109026. [PMID: 38776706 PMCID: PMC11572715 DOI: 10.1016/j.heares.2024.109026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 03/28/2024] [Accepted: 04/30/2024] [Indexed: 05/25/2024]
Abstract
Cochlear implants are medical devices that have restored hearing to approximately one million people around the world. Outcomes are impressive and most recipients attain excellent speech comprehension in quiet without relying on lip-reading cues, but pitch resolution is poor compared to normal hearing. Amplitude modulation of electrical stimulation is a primary cue for pitch perception in cochlear implant users. The experiments described in this article focus on the relationship between sensitivity to amplitude modulations and pitch resolution based on changes in the frequency of amplitude modulations. In the first experiment, modulation sensitivity and pitch resolution were measured in adults with no known hearing loss and in cochlear implant users with sounds presented to and processed by their clinical devices. Stimuli were amplitude-modulated sinusoids and amplitude-modulated narrow-band noises. Modulation detection and modulation frequency discrimination were measured for modulation frequencies centered on 110, 220, and 440 Hz. Pitch resolution based on changes in modulation frequency was measured for modulation depths of 25 %, 50 %, 100 %, and for a half-waved rectified modulator. Results revealed a strong linear relationship between modulation sensitivity and pitch resolution for cochlear implant users and peers with no known hearing loss. In the second experiment, cochlear implant users took part in analogous procedures of modulation sensitivity and pitch resolution but bypassing clinical sound processing using single-electrode stimulation. Results indicated that modulation sensitivity and pitch resolution was better conveyed by single-electrode stimulation than by clinical processors. Results at 440 Hz were worse, but also not well conveyed by clinical sound processing, so it remains unclear whether the 300 Hz perceptual limit described in the literature is a technological or biological limitation. These results highlight modulation depth and sensitivity as critical factors for pitch resolution in cochlear implant users and characterize the relationship that should inform the design of modulation enhancement algorithms for cochlear implants.
Collapse
Affiliation(s)
- Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States of America
| | - Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States of America.
| |
Collapse
|
6
|
Kovačić D, James CJ. Stimulation Rate and Voice Pitch Perception in Cochlear Implants. J Assoc Res Otolaryngol 2022; 23:665-680. [PMID: 35918501 PMCID: PMC9613839 DOI: 10.1007/s10162-022-00854-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 06/05/2022] [Indexed: 06/15/2023] Open
Abstract
The stimulation rate in cochlear implant (CI) sound coding, or the "carrier" rate in pulses per second (pps), is known to influence pitch perception, as well as loudness perception and sound quality. Our main objective was to investigate the effects of reduced carrier rate on the loudness and pitch of coded speech samples. We describe two experiments with 16 Nucleus® CI users, where we controlled modulation characteristics and carrier rate using Spectral and Temporal Enhanced Processing (STEP), a novel experimental multichannel sound coder. We used a fixed set of threshold and comfortable stimulation levels for each subject, obtained from clinical MAPs. In the first experiment, we determined equivalence for voice pitch ranking and voice gender categorization between the Advanced Combination Encoder (ACE), a widely used clinical strategy in Nucleus® recipients, and STEP for fundamental frequencies (F0) 120-250 Hz. In the second experiment, loudness was determined as a function of the input amplitude of speech samples for carrier rates of 1000, 500, and 250 pps per channel. Then, using equally loud sound coder programs, we evaluated the effect of carrier rate on voice pitch perception. Although nearly all subjects could categorize voice gender significantly above chance, pitch ranking varied across subjects. Overall, carrier rate did not substantially affect voice pitch ranking or voice gender categorization: as long as the carrier rate was at least twice the fundamental frequency, or when stimulation pulses for the lowest, 250 pps carrier were aligned to F0 peaks. These results indicate that carrier rates as low as 250 pps per channel are sufficient to support functional voice pitch perception for those CI users sensitive to temporal pitch cues; at least when temporal modulations and pulse timings in the coder output are well controlled by novel strategies such as STEP.
Collapse
Affiliation(s)
- Damir Kovačić
- Department of Physics, Faculty of Science, University of Split, Ruđera Boškovića 33, 21000 Split, Croatia
| | - Chris J. James
- Cochlear France SAS, 135 Route de Saint Simon, 31100 Toulouse, France
| |
Collapse
|
7
|
Zhou H, Kan A, Yu G, Guo Z, Zheng N, Meng Q. Pitch perception with the temporal limits encoder for cochlear implants. IEEE Trans Neural Syst Rehabil Eng 2022; 30:2528-2539. [PMID: 36044501 DOI: 10.1109/tnsre.2022.3203079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The temporal-limits-encoder (TLE) strategy has been proposed to enhance the representation of temporal fine structure (TFS) in cochlear implants (CIs), which is vital for many aspects of sound perception but is typically discarded by most modern CI strategies. TLE works by computing an envelope modulator that is within the temporal pitch limits of CI electric hearing. This paper examines the TFS information encoded by TLE and evaluates the salience and usefulness of this information in CI users. Two experiments were conducted to compare pitch perception performance of TLE versus the widely-used Advanced Combinational Encoder (ACE) strategy. Experiment 1 investigated whether TLE processing improved pitch discrimination compared to ACE. Experiment 2 parametrically examined the effect of changing the lower frequency limit of the TLE modulator on pitch ranking. In both experiments, F0 difference limens were measured with synthetic harmonic complex tones using an adaptive procedure. Signal analysis of the outputs of TLE and ACE strategies showed that TLE introduces important temporal pitch cues that are not available with ACE. Results showed an improvement in pitch discrimination with TLE when the acoustic input had a lower F0 frequency. No significant effect of lower frequency limit was observed for pitch ranking, though a lower limit did tend to provide better outcomes. These results suggest that the envelope modulation introduced by TLE can improve pitch perception for CI listeners.
Collapse
|
8
|
Goldsworthy RL, Bissmeyer SRS, Camarena A. Advantages of Pulse Rate Compared to Modulation Frequency for Temporal Pitch Perception in Cochlear Implant Users. J Assoc Res Otolaryngol 2022; 23:137-150. [PMID: 34981263 PMCID: PMC8782986 DOI: 10.1007/s10162-021-00828-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 12/01/2021] [Indexed: 02/03/2023] Open
Abstract
Most cochlear implants encode the fundamental frequency of periodic sounds by amplitude modulation of constant-rate pulsatile stimulation. Pitch perception provided by such stimulation strategies is markedly poor. Two experiments are reported here that consider potential advantages of pulse rate compared to modulation frequency for providing stimulation timing cues for pitch. The first experiment examines beat frequency distortion that occurs when modulating constant-rate pulsatile stimulation. This distortion has been reported on previously, but the results presented here indicate that distortion occurs for higher stimulation rates than previously reported. The second experiment examines pitch resolution as provided by pulse rate compared to modulation frequency. The results indicate that pitch discrimination is better with pulse rate than with modulation frequency. The advantage was large for rates near what has been suggested as the upper limit of temporal pitch perception conveyed by cochlear implants. The results are relevant to sound processing design for cochlear implants particularly for algorithms that encode fundamental frequency into deep envelope modulations or into precisely timed pulsatile stimulation.
Collapse
Affiliation(s)
- Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
| | - Susan R S Bissmeyer
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
9
|
Goldsworthy RL, Camarena A, Bissmeyer SRS. Pitch perception is more robust to interference and better resolved when provided by pulse rate than by modulation frequency of cochlear implant stimulation. Hear Res 2021; 409:108319. [PMID: 34340020 PMCID: PMC9343238 DOI: 10.1016/j.heares.2021.108319] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 07/15/2021] [Accepted: 07/21/2021] [Indexed: 01/14/2023]
Abstract
Cochlear implants are medical devices that have been used to restore hearing to more than half a million people worldwide. Most recipients achieve high levels of speech comprehension through these devices, but speech comprehension in background noise and music appreciation in general are markedly poor compared to normal hearing. A key aspect of hearing that is notably diminished in cochlear implant outcomes is the sense of pitch provided by these devices. Pitch perception is an important factor affecting speech comprehension in background noise and is critical for music perception. The present article summarizes two experiments that examine the robustness and resolution of pitch perception as provided by cochlear implant stimulation timing. The driving hypothesis is that pitch conveyed by stimulation timing cues is more robust and better resolved when provided by variable pulse rates than by modulation frequency of constant-rate stimulation. Experiment 1 examines the robustness for hearing a large, one-octave, pitch difference in the presence of interfering electrical stimulation. With robustness to interference characterized for an otherwise easily discernible pitch difference, Experiment 2 examines the resolution of discrimination thresholds in the presence of interference as conveyed by modulation frequency or by pulse rate. These experiments test for an advantage of stimulation with precise temporal cues. The results indicate that pitch provided by pulse rate is both more robust to interference and is better resolved compared to when provided by modulation frequency. These results should inform the development of new sound processing strategies for cochlear implants designed to encode fundamental frequency of sounds into precise temporal stimulation.
Collapse
Affiliation(s)
- Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States.
| | - Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States; Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, United States
| | - Susan R S Bissmeyer
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States; Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, United States
| |
Collapse
|
10
|
Kang H, Macherey O, Roman S, Pressnitzer D. Auditory memory for random time patterns in cochlear implant listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:1934. [PMID: 34598651 DOI: 10.1121/10.0005728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 07/01/2021] [Indexed: 06/13/2023]
Abstract
Learning about new sounds is essential for cochlear-implant and normal-hearing listeners alike, with the additional challenge for implant listeners that spectral resolution is severely degraded. Here, a task measuring the rapid learning of slow or fast stochastic temporal sequences [Kang, Agus, and Pressnitzer (2017). J. Acoust. Soc. Am. 142, 2219-2232] was performed by cochlear-implant (N = 10) and normal-hearing (N = 9) listeners, using electric or acoustic pulse sequences, respectively. Rapid perceptual learning was observed for both groups, with highly similar characteristics. Moreover, for cochlear-implant listeners, an additional condition tested ultra-fast electric pulse sequences that would be impossible to represent temporally when presented acoustically. This condition also demonstrated learning. Overall, the results suggest that cochlear-implant listeners have access to the neural plasticity mechanisms needed for the rapid perceptual learning of complex temporal sequences.
Collapse
Affiliation(s)
- HiJee Kang
- Laboratoire des Systèmes Perceptifs, Département d'études Cognitives, École Normale Supérieure, PSL University, CNRS, 29 Rue d'Ulm, 75005 Paris, France
| | - Olivier Macherey
- Aix-Marseille University, CNRS, Centrale Marseille, LMA, 4 impasse Nikola Tesla, CS40006, 13453 Marseille, Cedex 13, France
| | - Stéphane Roman
- Department of Pediatric Otolaryngology and Neck Surgery, Aix-Marseille University, 264 Rue Saint Pierre, 13005 Marseille, France
| | - Daniel Pressnitzer
- Laboratoire des Systèmes Perceptifs, Département d'études Cognitives, École Normale Supérieure, PSL University, CNRS, 29 Rue d'Ulm, 75005 Paris, France
| |
Collapse
|
11
|
Longer Cochlear Implant Experience Leads to Better Production of Mandarin Tones for Early Implanted Children. Ear Hear 2021; 42:1405-1411. [PMID: 33974784 DOI: 10.1097/aud.0000000000001036] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Children with cochlear implants (CIs) face challenges in acquiring tones, since CIs do not transmit pitch information effectively. It has been suggested that longer CI experience provides additional benefits for children implanted early, enabling them to achieve language abilities similar to that of normal-hearing (NH) children (Colletti 2009). Mandarin is a tonal language with four lexical tones and a neutral tone (T0), characterized by distinct pitch and durational patterns. It has been suggested that early implantation (i.e., before 2 years) greatly benefits the acquisition of Mandarin tones by children with CIs (Tang et al. 2019c). In this study, we extend those findings to investigate the effect of CI experience on the acquisition of Mandarin tones for children implanted early. We asked the extent to which they were able to produce distinct pitch and durational patterns of both lexical tones and T0 as a function of CI experience, and the extent to which their tonal productions were acoustically like that of NH children. DESIGN Forty-four NH 3-year olds and 28 children implanted with CIs between 1 and 2 years, aged 3 to 7, were recruited. The children with CIs were grouped according to the length of CI experience: 3 to 6 years, 2 to 3 years, and 1 to 2 years. Lexical tone and T0 productions were elicited using a picture-naming task. Tonal productions from the children with CIs were acoustically analyzed and compared with those from the NH children. RESULTS Children with 3 to 6 years of CI experience were able to produce distinct pitch and durational patterns for both lexical tones and T0, with NH-like acoustic realizations. Children with 2 to 3 years of CI experience were also able to produce the expected tonal patterns, although their productions were not yet NH-like. Those with only 1 to 2 years of CI experience, however, were not yet able to produce the distinct acoustic patterns for either lexical tones or T0. CONCLUSIONS These results provide acoustic evidence demonstrating that, when Mandarin-speaking children are implanted before the age of 2, only those with 3 to 6 years of experience were able to produce NH-like tones, including both lexical tone and T0. Children with shorter CI experience (less than 3 years) were unable to produce distinct acoustic patterns for the different tones. This suggests that at least 3 years of CI experience is still needed for early implanted children to acquire tonal distinctions similar to those of NH 3-year olds.
Collapse
|
12
|
Bissmeyer SRS, Hossain S, Goldsworthy RL. Perceptual learning of pitch provided by cochlear implant stimulation rate. PLoS One 2020; 15:e0242842. [PMID: 33270735 PMCID: PMC7714175 DOI: 10.1371/journal.pone.0242842] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/10/2020] [Indexed: 11/19/2022] Open
Abstract
Cochlear implant users hear pitch evoked by stimulation rate, but discrimination diminishes for rates above 300 Hz. This upper limit on rate pitch is surprising given the remarkable and specialized ability of the auditory nerve to respond synchronously to stimulation rates at least as high as 3 kHz and arguably as high as 10 kHz. Sensitivity to stimulation rate as a pitch cue varies widely across cochlear implant users and can be improved with training. The present study examines individual differences and perceptual learning of stimulation rate as a cue for pitch ranking. Adult cochlear implant users participated in electrode psychophysics that involved testing once per week for three weeks. Stimulation pulse rate discrimination was measured in bipolar and monopolar configurations for apical and basal electrodes. Base stimulation rates between 100 and 800 Hz were examined. Individual differences were quantified using psychophysically derived metrics of spatial tuning and temporal integration. This study examined distribution of measures across subjects, predictive power of psychophysically derived metrics of spatial tuning and temporal integration, and the effect of training on rate discrimination thresholds. Psychophysical metrics of spatial tuning and temporal integration were not predictive of stimulation rate discrimination, but discrimination thresholds improved at lower frequencies with training. Since most clinical devices do not use variable stimulation rates, it is unknown to what extent recipients may learn to use stimulation rate cues if provided in a clear and consistent manner.
Collapse
Affiliation(s)
- Susan R. S. Bissmeyer
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Shaikat Hossain
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Raymond L. Goldsworthy
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
13
|
Lamping W, Goehring T, Marozeau J, Carlyon RP. The effect of a coding strategy that removes temporally masked pulses on speech perception by cochlear implant users. Hear Res 2020; 391:107969. [PMID: 32320925 PMCID: PMC7116331 DOI: 10.1016/j.heares.2020.107969] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 03/26/2020] [Accepted: 04/05/2020] [Indexed: 01/11/2023]
Abstract
Speech recognition in noisy environments remains a challenge for cochlear implant (CI) recipients. Unwanted charge interactions between current pulses, both within and between electrode channels, are likely to impair performance. Here we investigate the effect of reducing the number of current pulses on speech perception. This was achieved by implementing a psychoacoustic temporal-masking model where current pulses in each channel were passed through a temporal integrator to identify and remove pulses that were less likely to be perceived by the recipient. The decision criterion of the temporal integrator was varied to control the percentage of pulses removed in each condition. In experiment 1, speech in quiet was processed with a standard Continuous Interleaved Sampling (CIS) strategy and with 25, 50 and 75% of pulses removed. In experiment 2, performance was measured for speech in noise with the CIS reference and with 50 and 75% of pulses removed. Speech intelligibility in quiet revealed no significant difference between reference and test conditions. For speech in noise, results showed a significant improvement of 2.4 dB when removing 50% of pulses and performance was not significantly different between the reference and when 75% of pulses were removed. Further, by reducing the overall amount of current pulses by 25, 50, and 75% but accounting for the increase in charge necessary to compensate for the decrease in loudness, estimated average power savings of 21.15, 40.95, and 63.45%, respectively, could be possible for this set of listeners. In conclusion, removing temporally masked pulses may improve speech perception in noise and result in substantial power savings.
Collapse
Affiliation(s)
- Wiebke Lamping
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark; Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, United Kingdom.
| | - Tobias Goehring
- Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, United Kingdom
| | - Jeremy Marozeau
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark
| | - Robert P Carlyon
- Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, United Kingdom
| |
Collapse
|
14
|
Lindenbeck MJ, Laback B, Majdak P, Srinivasan S. Temporal-pitch sensitivity in electric hearing with amplitude modulation and inserted pulses with short inter-pulse intervals. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:777. [PMID: 32113255 PMCID: PMC7002171 DOI: 10.1121/10.0000610] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Revised: 12/26/2019] [Accepted: 01/02/2020] [Indexed: 06/10/2023]
Abstract
Listeners with cochlear implants (CIs) typically show poor sensitivity to the temporal-envelope pitch of high-rate pulse trains. Sensitivity to interaural time differences improves when adding pulses with short inter-pulse intervals (SIPIs) to high-rate pulse trains. In the current study, monaural temporal-pitch sensitivity with SIPI pulses was investigated for six CI listeners. Amplitude-modulated single-electrode stimuli, representing the coding of the fundamental frequency (F0) in the envelope of a high-rate carrier, were used. Two SIPI-insertion approaches, five modulation depths, two typical speech-F0s, and two carrier rates were tested. SIPI pulses were inserted either in every amplitude-modulation period (full-rate SIPI) to support the F0 cue or in every other amplitude-modulation period (half-rate SIPI) to circumvent a potential rate limitation at higher F0s. The results demonstrate that full-rate SIPI pulses improve temporal-pitch sensitivity across F0s and particularly at low modulation depths where envelope-pitch cues are weak. The half-rate SIPI pulses did not circumvent the limitation and further increased variability across listeners. Further, no effect of the carrier rate was found. Thus, the SIPI approach appears to be a promising approach to enhance CI listeners' access to temporal-envelope pitch cues at pulse rates used clinically.
Collapse
Affiliation(s)
- Martin J Lindenbeck
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| | - Bernhard Laback
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| | - Piotr Majdak
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| | - Sridhar Srinivasan
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| |
Collapse
|
15
|
Swanson BA, Marimuthu VMR, Mannell RH. Place and Temporal Cues in Cochlear Implant Pitch and Melody Perception. Front Neurosci 2019; 13:1266. [PMID: 31849583 PMCID: PMC6888014 DOI: 10.3389/fnins.2019.01266] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 11/07/2019] [Indexed: 11/15/2022] Open
Abstract
The present study compared pitch and melody perception using cochlear place of excitation and temporal cues in six adult nucleus cochlear implant (CI) recipients. The stimuli were synthesized tones presented through a loudspeaker, and recipients used the Advanced Combinational Encoder (ACE) sound coding strategy on their own sound processors. Three types of tones were used, denoted H3, H4, and P5. H3 tones were harmonic tones with fundamental frequencies in the range C3-C4 (131-262 Hz), providing temporal pitch cues alone. H4 tones were harmonic tones with fundamental frequencies in the range C4-C5 (262-523 Hz), providing a mixture of temporal and place cues. P5 tones were pure tones with fundamental frequencies in the range C5-C6 (523-1046 Hz), providing place pitch cues alone. Four experimental procedures were used: pitch discrimination, pitch ranking, backward modified melodies, and warped modified melodies. In each trial of the modified melodies tests, subjects heard a familiar melody and a version with modified pitch (in randomized order), and had to select the unmodified melody. In all four procedures, many scores were much lower than would be expected for normal hearing listeners, implying that the strength of the perceived pitch was weak. Discrimination and ranking with H3 and P5 tones was poor for two-semitone intervals, but near perfect for intervals of five semitones and larger. H4 tones provided the lowest group mean scores in all four procedures, with some pitch reversals observed in pitch ranking. Group mean scores for P5 tones (place cues alone) were at least as high as those for H3 tones (temporal cues alone). The relatively good scores on the melody tasks with P5 tones were surprising, given the lack of temporal cues, raising the possibility of musical pitch using place cues alone. However, the alternative possibility that the CI recipients perceived the place cues as brightness, rather than musical pitch per se, cannot be excluded. These findings show that pitch perception models need to incorporate neural place representations alongside temporal cues if they are to predict pitch and melody perception in the absence of temporal cues.
Collapse
Affiliation(s)
| | - Vijay M. R. Marimuthu
- Department of Linguistics, Faculty of Human Sciences, Macquarie University, Sydney, NSW, Australia
| | - Robert H. Mannell
- Department of Linguistics, Faculty of Human Sciences, Macquarie University, Sydney, NSW, Australia
| |
Collapse
|
16
|
Tang P, Yuen I, Xu Rattanasone N, Gao L, Demuth K. The Acquisition of Mandarin Tonal Processes by Children With Cochlear Implants. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:1309-1325. [PMID: 31063698 DOI: 10.1044/2018_jslhr-s-18-0304] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Purpose Children with cochlear implants (CIs) face challenges in acquiring tonal languages, as CIs do not efficiently code pitch information. Mandarin is a tonal language with lexical tones and tonal processes such as neutral tone and tone sandhi, exhibiting contextually conditioned tonal realizations. Previous studies suggest that early implantation and long CI experience facilitate the acquisition of lexical tones by children with CIs. However, there is lack of acoustic evidence on children's tonal productions demonstrating that this is the case, and it is unclear whether and how children with CIs are able to acquire contextual tones. This study therefore examined the acoustic realization of both lexical tones and contextual tones as produced by children fitted with CIs, exploring the potential effects of age at implantation and length of CI experience on their acquisition of the Mandarin tonal system. Method Seventy-two Mandarin-learning preschoolers with CIs, varying in age at implantation (13-42 months) and length of CI experience (2-49 months), and 44 normal hearing 3-year-old controls were recruited. Tonal productions were elicited from both groups using picture-naming tasks and acoustically compared. Results Only the early implanted group (i.e., implanted before the age of 2 years) produced normal-like lexical tones and generally had contextual tones approximating those of the normal-hearing children. The other children, including those with longer CI experience, did not have typical tonal productions; their pitch patterns for lexical tones tended to be flatter, and contextual tone productions were unchanged across tonal contexts. Conclusion Children with CIs face challenges in acquiring Mandarin tones, but early implantation may help them to develop normal-like lexical tone categories, which further facilitates their implementation of contextual tones. Supplemental Material https://doi.org/10.23641/asha.8038889.
Collapse
Affiliation(s)
- Ping Tang
- School of Foreign Studies, Nanjing University of Science and Technology, Jiangsu, China
- Department of Linguistics, ARC Centre of Excellence in Cognition and Its Disorders, Macquarie University, Sydney, New South Wales, Australia
- The HEARing Cooperative Research Centre, Melbourne, Victoria, Australia
| | - Ivan Yuen
- Department of Linguistics, ARC Centre of Excellence in Cognition and Its Disorders, Macquarie University, Sydney, New South Wales, Australia
| | - Nan Xu Rattanasone
- Department of Linguistics, ARC Centre of Excellence in Cognition and Its Disorders, Macquarie University, Sydney, New South Wales, Australia
- The HEARing Cooperative Research Centre, Melbourne, Victoria, Australia
| | - Liqun Gao
- School of Communication Science, Beijing Language and Culture University, China
| | - Katherine Demuth
- Department of Linguistics, ARC Centre of Excellence in Cognition and Its Disorders, Macquarie University, Sydney, New South Wales, Australia
- The HEARing Cooperative Research Centre, Melbourne, Victoria, Australia
| |
Collapse
|
17
|
Goldsworthy RL. Temporal envelope cues and simulations of cochlear implant signal processing. SPEECH COMMUNICATION 2019; 109:24-33. [PMID: 39104946 PMCID: PMC11299890 DOI: 10.1016/j.specom.2019.03.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/07/2024]
Abstract
Conventional signal processing implemented on clinical cochlear implant (CI) sound processors is based on envelope signals extracted from overlapping frequency regions. Conventional strategies do not encode temporal envelope or temporal fine-structure cues with high fidelity. In contrast, several research strategies have been developed recently to enhance the encoding of temporal envelope and fine-structure cues. The present study examines the salience of temporal envelope cues when encoded into vocoder representations of CI signal processing. Normal-hearing listeners were evaluated on measures of speech reception, speech quality ratings, and spatial hearing when listening to vocoder representations of CI signal processing. Conventional vocoder techniques using envelope signals with noise- or tone-excited reconstruction were evaluated in comparison to a novel approach based on impulse-response reconstruction. A variation of this impulse-response approach was based on a research strategy, the Fundamentally Asynchronous Stimulus Timing (FAST) algorithm, designed to improve temporal precision of envelope cues. The results indicate that the introduced impulse-response approach, combined with the FAST algorithm, produces similar results on speech reception measures as the conventional vocoder approaches, while providing significantly better sound quality and spatial hearing outcomes. This novel approach for stimulating how temporal envelope cues are encoded into CI stimulation has potential for examining diverse aspects of hearing, particularly in aspects of musical pitch perception and spatial hearing.
Collapse
|
18
|
Evaluation of the Optimized Pitch and Language Strategy in Cochlear Implant Recipients. Ear Hear 2019; 40:555-567. [DOI: 10.1097/aud.0000000000000627] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
19
|
Abstract
OBJECTIVES Vocoders offer an effective platform to simulate the effects of cochlear implant speech processing strategies in normal-hearing listeners. Several behavioral studies have examined the effects of varying spectral and temporal cues on vocoded speech perception; however, little is known about the neural indices of vocoded speech perception. Here, the scalp-recorded frequency following response (FFR) was used to study the effects of varying spectral and temporal cues on brainstem neural representation of specific acoustic cues, the temporal envelope periodicity related to fundamental frequency (F0) and temporal fine structure (TFS) related to formant and formant-related frequencies, as reflected in the phase-locked neural activity in response to vocoded speech. DESIGN In experiment 1, FFRs were measured in 12 normal-hearing, adult listeners in response to a steady state English back vowel /u/ presented in an unaltered, unprocessed condition and six sine-vocoder conditions with varying numbers of channels (1, 2, 4, 8, 16, and 32), while the temporal envelope cutoff frequency was fixed at 500 Hz. In experiment 2, FFRs were obtained from 14 normal-hearing, adult listeners in response to the same English vowel /u/, presented in an unprocessed condition and four vocoded conditions where both the temporal envelope cutoff frequency (50 versus 500 Hz) and carrier type (sine wave versus noise band) were varied separately with the number of channels fixed at 8. Fast Fourier Transform was applied to the time waveforms of FFR to analyze the strength of brainstem neural representation of temporal envelope periodicity (F0) and TFS-related peaks (formant structure). RESULTS Brainstem neural representation of both temporal envelope and TFS cues improved when the number of channels increased from 1 to 4, followed by a plateau with 8 and 16 channels, and a reduction in phase-locking strength with 32 channels. For the sine vocoders, peaks in the FFRTFS spectra corresponded with the low-frequency sine-wave carriers and side band frequencies in the stimulus spectra. When the temporal envelope cutoff frequency increased from 50 to 500 Hz, an improvement was observed in brainstem F0 representation with no change in brainstem representation of spectral peaks proximal to the first formant frequency (F1). There was no significant effect of carrier type (sine- versus noise-vocoder) on brainstem neural representation of F0 cues when the temporal envelope cutoff frequency was 500 Hz. CONCLUSIONS While the improvement in neural representation of temporal envelope and TFS cues with up to 4 vocoder channels is consistent with the behavioral literature, the reduced neural phase-locking strength noted with even more channels may be because of the narrow bandwidth of each channel as the number of channels increases. Stronger neural representation of temporal envelope cues with higher temporal envelope cutoff frequencies is likely a reflection of brainstem neural phase-locking to F0-related periodicity fluctuations preserved in the 500-Hz temporal envelopes, which are unavailable in the 50-Hz temporal envelopes. No effect of temporal envelope cutoff frequency was seen for neural representation of TFS cues, suggesting that spectral side band frequencies created by the 500-Hz temporal envelopes did not improve neural representation of F1 cues over the 50-Hz temporal envelopes. Finally, brainstem F0 representation was not significantly affected by carrier type with a temporal envelope cutoff frequency of 500 Hz, which is inconsistent with previous results of behavioral studies examining pitch perception of vocoded stimuli.
Collapse
|
20
|
Liu H, Peng X, Zhao Y, Ni X. The effectiveness of sound-processing strategies on tonal language cochlear implant users: A systematic review. Pediatr Investig 2017; 1:32-39. [PMID: 32851216 PMCID: PMC7331426 DOI: 10.1002/ped4.12011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 10/12/2017] [Indexed: 11/23/2022] Open
Abstract
IMPORTANCE Contemporary cochlear implants (CIs) are well established as a technology for people with severe-to-profound sensorineural hearing loss, with their effectiveness having been widely reported. However, for tonal language CI recipients, speech perception remains a challenge: Conventional signal processing strategies have been demonstrated to possibly provide insufficient information to encode tonal cues, and CI recipients have exhibited considerable deficits in tone perception. Thus, some tonal language-oriented sound-processing strategies have been introduced. The effects of available tonal language-oriented strategies on tone perception are reviewed and evaluated in this study. The results may aid in designing and improving tonal language-appropriate sound-processing strategies for CI recipients. OBJECTIVE The objective of this systematic review was to investigate the effects of tonal-language-oriented signal processing strategies on tone perception, music perception, word and sentence recognition. METHODS To evaluate the effects of tonal language-oriented strategies on tone perception, we conducted a systematic review. We searched for relevant reports dated from January 1979 to July 2017 using PubMed, Cochrane Library, EBSCO, Web of Science, EMBASE, and 4 Chinese periodical databases (CBMdisc, CNKI, VIP, and Wanfang Data). RESULTS According to our search strategy, 672 potentially eligible studies were retrieved from the databases, with 12 of these studies included in the final review after a 4-stage selection process. The majority of sound-processing strategies designed for tonal language were HiResolution® with Fidelity 120 (HiRes 120), fine structure processing, temporal fine structure (TFS), and C-tone. Generally, acute or short-term comparisons between the tonal language-oriented strategies and the conventional strategy did not reveal statistically significant differences in speech perception (or show a small improvement). However, a tendency toward improved tone perception and subjectively reported overall preferred sound quality was observed with the tonal language-oriented strategies. INTERPRETATION Conventional signal processing strategies typically provided very limited F0 information via temporal envelopes delivered to the stimulating electrodes. In contrast, tonal language-oriented coding strategies attempted to present more spectral information and TFS cues required for tone perception. Thus, a tendency of improved performance in tonal language perception in CI users was shown.
Collapse
Affiliation(s)
- Haihong Liu
- Beijing Key Laboratory for Pediatric Diseases of Otorhinolaryngology, Head and Neck SurgeryMinistry of Education (MOE) Key Laboratory of Major Diseases in ChildrenBeijing Pediatric Research InstituteBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
- Department of Otorhinolaryngology, Head and Neck SurgeryBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| | - Xiaoxia Peng
- Center for Clinical Epidemiology and Evidence‐Based MedicineBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| | - Yawen Zhao
- Beijing Key Laboratory for Pediatric Diseases of Otorhinolaryngology, Head and Neck SurgeryMinistry of Education (MOE) Key Laboratory of Major Diseases in ChildrenBeijing Pediatric Research InstituteBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| | - Xin Ni
- Beijing Key Laboratory for Pediatric Diseases of Otorhinolaryngology, Head and Neck SurgeryMinistry of Education (MOE) Key Laboratory of Major Diseases in ChildrenBeijing Pediatric Research InstituteBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
- Department of Otorhinolaryngology, Head and Neck SurgeryBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| |
Collapse
|
21
|
Ping L, Wang N, Tang G, Lu T, Yin L, Tu W, Fu QJ. Implementation and preliminary evaluation of ‘C-tone’: A novel algorithm to improve lexical tone recognition in Mandarin-speaking cochlear implant users. Cochlear Implants Int 2017. [PMID: 28629258 DOI: 10.1080/14670100.2017.1339492] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
| | - Ningyuan Wang
- Zhejiang Nurotron Biotechnology Co., Ltd, Zhejiang, PR China
| | - Guofang Tang
- Zhejiang Nurotron Biotechnology Co., Ltd, Zhejiang, PR China
| | - Thomas Lu
- Nurotron Biotechnology, Inc., Irvine, CA, USA
| | - Li Yin
- Zhejiang Nurotron Biotechnology Co., Ltd, Zhejiang, PR China
| | - Wenhe Tu
- Zhejiang Nurotron Biotechnology Co., Ltd, Zhejiang, PR China
| | - Qian-Jie Fu
- Department of Head and Neck Surgery, David Geffen School of Medicine, UCLA, Los Angeles, 2100 West Third Street, Suite 100, Los Angeles, CA 90057, USA
| |
Collapse
|
22
|
Speech onset enhancement improves intelligibility in adverse listening conditions for cochlear implant users. Hear Res 2016; 342:13-22. [DOI: 10.1016/j.heares.2016.09.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 09/07/2016] [Indexed: 11/17/2022]
|
23
|
Tan J, Dowell R, Vogel A. Mandarin Lexical Tone Acquisition in Cochlear Implant Users With Prelingual Deafness: A Review. Am J Audiol 2016; 25:246-56. [PMID: 27387047 DOI: 10.1044/2016_aja-15-0069] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 02/20/2016] [Indexed: 11/09/2022] Open
Abstract
PURPOSE The purpose of this review article is to synthesize evidence from the fields of developmental linguistics and cochlear implant technology relevant to the production and perception of Mandarin lexical tone in cochlear implant users with prelingual deafness. The aim of this review was to identify potential factors that determine outcomes for tonal-language speaking cochlear implant users and possible directions for further research. METHOD A computerized database search of MEDLINE, CINAHL, Academic Search Premier, Web of Science, and Google Scholar was undertaken in June and July 2014. Search terms used were lexical tone AND tonal language, speech development AND/OR speech production AND/OR speech perception AND cochlear implants, and pitch perception AND cochlear implants, anywhere in the title or abstract. CONCLUSION Despite the demonstrated limitations of pitch perception in cochlear implant users, there is some evidence that typical production and perception of lexical tone is possible by cochlear implant users with prelingual deafness. Further studies are required to determine the factors that contribute to better outcomes to inform rehabilitation processes for cochlear implant users in tonal-language environments.
Collapse
Affiliation(s)
- Johanna Tan
- The University of Melbourne, Victoria, Australia
| | | | - Adam Vogel
- Center for Neuroscience of Speech, The University of Melbourne, Victoria, Australia
- Hertie Institute for Clinical Brain Research, Eberhard Karls Universität Tübingen, Germany
- Murdoch Childrens Research Institute, The Bruce Lefroy Centre for Genetic Health Research, Melbourne, Victoria, Australia
| |
Collapse
|
24
|
Vandali AE, Dawson PW, Arora K. Results using the OPAL strategy in Mandarin speaking cochlear implant recipients. Int J Audiol 2016; 56:S74-S85. [PMID: 27329178 DOI: 10.1080/14992027.2016.1190872] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
OBJECTIVE To evaluate the effectiveness of an experimental pitch-coding strategy for improving recognition of Mandarin lexical tone in cochlear implant (CI) recipients. DESIGN Adult CI recipients were tested on recognition of Mandarin tones in quiet and speech-shaped noise at a signal-to-noise ratio of +10 dB; Mandarin sentence speech-reception threshold (SRT) in speech-shaped noise; and pitch discrimination of synthetic complex-harmonic tones in quiet. Two versions of the experimental strategy were examined: (OPAL) linear (1:1) mapping of fundamental frequency (F0) to the coded modulation rate; and (OPAL+) transposed mapping of high F0s to a lower coded rate. Outcomes were compared to results using the clinical ACE™ strategy. STUDY SAMPLE Five Mandarin speaking users of Nucleus® cochlear implants. RESULTS A small but significant benefit in recognition of lexical tones was observed using OPAL compared to ACE in noise, but not in quiet, and not for OPAL+ compared to ACE or OPAL in quiet or noise. Sentence SRTs were significantly better using OPAL+ and comparable using OPAL to those using ACE. No differences in pitch discrimination thresholds were observed across strategies. CONCLUSIONS OPAL can provide benefits to Mandarin lexical tone recognition in moderately noisy conditions and preserve perception of Mandarin sentences in challenging noise conditions.
Collapse
Affiliation(s)
- Andrew E Vandali
- a The Hearing CRC , Carlton , Victoria , Australia.,b The Bionics Institute , East Melbourne , Victoria , Australia , and
| | - Pam W Dawson
- a The Hearing CRC , Carlton , Victoria , Australia.,c Cochlear Ltd , East Melbourne , Victoria , Australia
| | - Komal Arora
- a The Hearing CRC , Carlton , Victoria , Australia.,c Cochlear Ltd , East Melbourne , Victoria , Australia
| |
Collapse
|
25
|
Marimuthu V, Swanson BA, Mannell R. Cochlear Implant Rate Pitch and Melody Perception as a Function of Place and Number of Electrodes. Trends Hear 2016; 20:20/0/2331216516643085. [PMID: 27094028 PMCID: PMC4871214 DOI: 10.1177/2331216516643085] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Six Nucleus cochlear implant recipients participated in a study investigating the effect of place of stimulation on melody perception using rate-pitch cues. Each stimulus was a pulse train delivered on either a single electrode or multiple electrodes sequentially. Four spatial stimulation patterns were used: a single apical electrode, a single mid electrode, a pair of electrodes (apical and mid), and 11 electrodes (from apical to mid). Within one block of trials, all stimuli had the same spatial stimulation pattern, with pulse rate varying from 131 to 262 pps. An additional pulse rate range of 262 to 523 pps was tested with the single-electrode stimuli. Two experimental procedures were used: note ranking; and a modified melodies test with backwards and warp modification. In each trial of the modified melodies test, a familiar melody and a version with modified pitch were presented (in random order), and the subject’s task was to select the unmodified melody. There were no significant differences in performance for stimulation on 1, 2, or 11 electrodes, implying that recipients were unable to combine temporal information from different places in the cochlea to give a stronger pitch cue. No advantage of apical electrodes was found: at the lower pulse rates, there were no significant differences between electrodes; and at the higher pulse rates, scores on the apical electrode dropped more than those on the mid electrode.
Collapse
Affiliation(s)
- Vijay Marimuthu
- Department of Linguistics, Macquarie University, Sydney, Australia
| | | | - Robert Mannell
- Department of Linguistics, Macquarie University, Sydney, Australia
| |
Collapse
|
26
|
Goldsworthy RL. Correlations Between Pitch and Phoneme Perception in Cochlear Implant Users and Their Normal Hearing Peers. J Assoc Res Otolaryngol 2015; 16:797-809. [PMID: 26373936 DOI: 10.1007/s10162-015-0541-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Accepted: 08/30/2015] [Indexed: 11/29/2022] Open
Abstract
This study examined correlations between pitch and phoneme perception for nine cochlear implant users and nine normal hearing listeners. Pure tone frequency discrimination thresholds were measured for frequencies of 500, 1000, and 2000 Hz. Complex tone fundamental frequency (F0) discrimination thresholds were measured for F0s of 110, 220, and 440 Hz. The effects of amplitude and frequency roving were measured under the rationale that individuals who are robust to such perturbations would perform better on phoneme perception measures. Phoneme identification was measured using consonant and vowel materials in quiet, in stationary speech-shaped noise (SSN), in spectrally notched SSN, and in temporally gated SSN. Cochlear implant pure tone frequency discrimination thresholds ranged between 1.5 and 9.9 %, while cochlear implant complex tone F0 discrimination thresholds ranged between 2.6 and 28.5 %. On average, cochlear implant users had 5.3 dB of masking release for consonants and 8.4 dB of masking release for vowels when measured in temporally gated SSN compared to stationary SSN. Correlations with phoneme identification measures were generally higher for complex tone discrimination measures than for pure tone discrimination measures. Correlations with phoneme identification measures were also generally higher for pitch perception measures that included amplitude and frequency roving. The strongest correlations were observed for measures of complex tone F0 discrimination with phoneme identification in temporally gated SSN. The results of this study suggest that musical training or signal processing strategies that improve F0 discrimination should improve consonant identification in fluctuating noise.
Collapse
|
27
|
Francart T, Osses A, Wouters J. Speech perception with F0mod, a cochlear implant pitch coding strategy. Int J Audiol 2015; 54:424-32. [DOI: 10.3109/14992027.2014.989455] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
28
|
Lee T, Yu S, Yuan M, Wong TKC, Kong YY. The effect of enhancing temporal periodicity cues on Cantonese tone recognition by cochlear implantees. Int J Audiol 2014; 53:546-57. [PMID: 24694089 DOI: 10.3109/14992027.2014.893374] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
OBJECTIVES This study investigates the efficacy of a cochlear implant (CI) processing method that enhances temporal periodicity cues of speech. DESIGN Subjects participated in word and tone identification tasks. Two processing conditions - the conventional advanced combination encoder (ACE) and tone-enhanced ACE were tested. Test materials were Cantonese disyllabic words recorded from one male and one female speaker. Speech-shaped noise was added to clean speech. The fundamental frequency information for periodicity enhancement was extracted from the clean speech. Electrical stimuli generated from the noisy speech with and without periodicity enhancement were presented via direct stimulation using a Laura 34 research processor. Subjects were asked to identify the presented word. STUDY SAMPLE Seven post-lingually deafened native Cantonese-speaking CI users. RESULTS Percent correct word, segmental structure, and tone identification scores were calculated. While word and segmental structure identification accuracy remained similar between the two processing conditions, tone identification in noise was better with tone-enhanced ACE than with conventional ACE. Significant improvement on tone perception was found only for the female voice. CONCLUSIONS Temporal periodicity cues are important to tone perception in noise. Pitch and tone perception by CI users could be improved when listeners received enhanced temporal periodicity cues.
Collapse
Affiliation(s)
- Tan Lee
- * Department of Electronic Engineering, The Chinese University of Hong Kong , China
| | | | | | | | | |
Collapse
|
29
|
Pitch and loudness matching of unmodulated and modulated stimuli in cochlear implantees. Hear Res 2013; 302:32-49. [DOI: 10.1016/j.heares.2013.05.004] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Revised: 04/30/2013] [Accepted: 05/06/2013] [Indexed: 11/23/2022]
|