1
|
Wohlbauer DM, Dillier N. A Hundred Ways to Encode Sound Signals for Cochlear Implants. Annu Rev Biomed Eng 2025; 27:335-369. [PMID: 40310887 DOI: 10.1146/annurev-bioeng-102623-121249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Cochlear implants are the most successful neural prostheses used to restore hearing in severe-to-profound hearing-impaired individuals. The field of cochlear implant coding investigates interdisciplinary approaches to translate acoustic signals into electrical pulses transmitted at the electrode-neuron interface, ranging from signal preprocessing algorithms, enhancement, and feature extraction methodologies to electric signal generation. In the last five decades, numerous coding strategies have been proposed clinically and experimentally. Initially developed to restore speech perception, increasing computational possibilities now allow coding of more complex signals, and new techniques to optimize the transmission of electrical signals are constantly gaining attention. This review provides insights into the history of multichannel coding and presents an extensive list of implemented strategies. The article briefly addresses each method and considers promising future directions of neural prostheses and possible signal processing, with the ultimate goal of providing a current big picture of the large field of cochlear implant coding.
Collapse
Affiliation(s)
- Dietmar M Wohlbauer
- Department of Otolaryngology, Head and Neck Surgery, Massachusetts Eye and Ear, Harvard Medical School, Boston, Massachusetts, USA;
| | - Norbert Dillier
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zürich, University of Zürich, Zürich, Switzerland
| |
Collapse
|
2
|
Luo J, Wang R, Xu K, Chao X, Zheng Y, Hu F, Liu X, Vandali AE, Wang H, Xu L. Outcomes Using the Optimized Pitch and Language Strategy Versus the Advanced Combination Encoder Strategy in Mandarin-Speaking Cochlear Implant Recipients. Ear Hear 2025; 46:210-222. [PMID: 39104002 PMCID: PMC11637569 DOI: 10.1097/aud.0000000000001572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 07/02/2024] [Indexed: 08/07/2024]
Abstract
OBJECTIVES The experimental Optimized Pitch and Language (OPAL) strategy enhances coding of fundamental frequency (F0) information in the temporal envelope of electrical signals delivered to channels of a cochlear implant (CI). Previous studies with OPAL have explored performance on speech and lexical tone perception in Mandarin- and English-speaking CI recipients. However, it was not clear which cues to lexical tone (primary and/or secondary) were used by the Mandarin CI listeners. The primary aim of the present study was to investigate whether OPAL provides improved recognition of Mandarin lexical tones in both quiet and noisy environments compared with the Advanced Combination Encoder (ACE) strategy. A secondary aim was to investigate whether, and to what extent, removal of secondary (duration and intensity envelope) cues to lexical tone affected Mandarin tone perception. DESIGN Thirty-two CI recipients with an average age of 24 (range 7 to 57) years were enrolled in the study. All recipients had at least 1 year of experience using ACE. Each subject attended two testing sessions, the first to measure baseline performance, and the second to evaluate the effect of strategy after provision of some take-home experience using OPAL. A minimum take-home duration of approximately 4 weeks was prescribed in which subjects were requested to use OPAL as much as possible but were allowed to also use ACE when needed. The evaluation tests included recognition of Mandarin lexical tones in quiet and in noise (signal to noise ratio [SNR] +5 dB) using naturally produced tones and duration/intensity envelope normalized versions of the tones; Mandarin sentence in adaptive noise; Mandarin monosyllabic and disyllabic word in quiet; a subset of Speech, Spatial, and Qualities of hearing questionnaire (SSQ, speech hearing scale); and subjective preference for strategy in quiet and noise. RESULTS For both the natural and normalized lexical tone tests, mean scores for OPAL were significantly higher than ACE in quiet by 2.7 and 2.9%-points, respectively, and in noise by 7.4 and 7.2%-points, respectively. Monosyllabic word recognition in quiet using OPAL was significantly higher than ACE by approximately 7.5% points. Average SSQ ratings for OPAL were significantly higher than ACE by approximately 0.5 points on a 10-point scale. In quiet conditions, 14 subjects preferred OPAL, 7 expressed a preference for ACE, and 9 reported no preference. Compared with quiet, in noisy situations, there was a stronger preference for OPAL (19 recipients), a similar preference for ACE (7 recipients), while fewer expressed no preference. Average daily take-home use of ACE and OPAL was 4.9 and 7.1 hr, respectively. CONCLUSIONS For Mandarin-speaking CI recipients, OPAL provided significant improvements to lexical tone perception for natural and normalized tones in quiet and noise, monosyllabic word recognition in quiet, and subjective ratings of speech intelligibility. Subjects accessed both primary and secondary cues to lexical tone for perception in quiet and noise conditions. The benefits of lexical tone recognition were attributed to enhanced F0 rate cues encoded by OPAL, especially in a noisy environment. The OPAL strategy was well accepted by many of the Mandarin-speaking CI recipients.
Collapse
Affiliation(s)
- Jianfen Luo
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-first authors
| | - Ruijie Wang
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-first authors
| | - Kaifan Xu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Xiuhua Chao
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Yi Zheng
- Cochlear Medical Device (Beijing) Co., Ltd, Beijing, China
| | - Fangxia Hu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Xianqi Liu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | | | - Haibo Wang
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-corresponding authors
| | - Lei Xu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-corresponding authors
| |
Collapse
|
3
|
Šodan A, Meunier S, Péan V, Lavieille JP, Roman S, Macherey O. Asymmetry in the Perception of Electrical Chirps Presented to Cochlear Implant Listeners. J Assoc Res Otolaryngol 2024; 25:491-506. [PMID: 39090303 PMCID: PMC11527866 DOI: 10.1007/s10162-024-00952-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 05/11/2024] [Indexed: 08/04/2024] Open
Abstract
INTRODUCTION Although a broadband acoustic click is physically the shortest duration sound we can hear, its peripheral neural representation is not as short because of cochlear filtering. The traveling wave imposes frequency-dependent delays to the sound waveform so that in response to a click, apical nerve fibers, coding for low frequencies, are excited several milliseconds after basal fibers, coding for high frequencies. Nevertheless, a click sounds like a click and these across-fiber delays are not perceived. This suggests that they may be compensated by the central auditory system, rendering our perception consistent with the external world. This explanation is difficult to evaluate in normal-hearing listeners because the contributions of peripheral and central auditory processing cannot easily be disentangled. Here, we test this hypothesis in cochlear implant listeners for whom cochlear mechanics is bypassed. METHOD Eight cochlear implant users ranked in perceived duration 12 electrical chirps of various physical durations and spanning the cochlea in the apex-to-base or base-to-apex direction (Exp. 1). Late-latency cortical potentials were also recorded in response to a subset of these chirps (Exp. 2). RESULTS We show that an electrical chirp spanning the cochlea from base-to-apex is perceived as shorter than the same chirp spanning the cochlea in the opposite direction despite having the same physical duration. Cortical potentials also provide neural correlates of this asymmetry in perception. CONCLUSION These results demonstrate that the central auditory system processes frequency sweeps differently depending on the direction of the frequency change and that this processing difference is not simply the result of peripheral filtering.
Collapse
Affiliation(s)
- Ana Šodan
- Aix Marseille Univ, CNRS, Centrale Marseille, LMA UMR 7031, 13013, Marseille, France.
| | - Sabine Meunier
- Aix Marseille Univ, CNRS, Centrale Marseille, LMA UMR 7031, 13013, Marseille, France
| | | | - Jean-Pierre Lavieille
- Department of ORL, Laboratory of Applied Biomechanics, LDV unit, 13002, Marseille, France
- University Hospital Nord, 13015, Marseille, France
| | - Stéphane Roman
- Institut de Neurosciences des systèmes, Inserm UMR1106, Aix-Marseille Univ., 13005, Marseille, France
- Dept. of Pediatric Otolaryngology and Neck Surgery, Aix-Marseille Univ., 13005, Marseille, France
| | - Olivier Macherey
- Aix Marseille Univ, CNRS, Centrale Marseille, LMA UMR 7031, 13013, Marseille, France
| |
Collapse
|
4
|
Camarena A, Goldsworthy RL. Characterizing the relationship between modulation sensitivity and pitch resolution in cochlear implant users. Hear Res 2024; 448:109026. [PMID: 38776706 PMCID: PMC11572715 DOI: 10.1016/j.heares.2024.109026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 03/28/2024] [Accepted: 04/30/2024] [Indexed: 05/25/2024]
Abstract
Cochlear implants are medical devices that have restored hearing to approximately one million people around the world. Outcomes are impressive and most recipients attain excellent speech comprehension in quiet without relying on lip-reading cues, but pitch resolution is poor compared to normal hearing. Amplitude modulation of electrical stimulation is a primary cue for pitch perception in cochlear implant users. The experiments described in this article focus on the relationship between sensitivity to amplitude modulations and pitch resolution based on changes in the frequency of amplitude modulations. In the first experiment, modulation sensitivity and pitch resolution were measured in adults with no known hearing loss and in cochlear implant users with sounds presented to and processed by their clinical devices. Stimuli were amplitude-modulated sinusoids and amplitude-modulated narrow-band noises. Modulation detection and modulation frequency discrimination were measured for modulation frequencies centered on 110, 220, and 440 Hz. Pitch resolution based on changes in modulation frequency was measured for modulation depths of 25 %, 50 %, 100 %, and for a half-waved rectified modulator. Results revealed a strong linear relationship between modulation sensitivity and pitch resolution for cochlear implant users and peers with no known hearing loss. In the second experiment, cochlear implant users took part in analogous procedures of modulation sensitivity and pitch resolution but bypassing clinical sound processing using single-electrode stimulation. Results indicated that modulation sensitivity and pitch resolution was better conveyed by single-electrode stimulation than by clinical processors. Results at 440 Hz were worse, but also not well conveyed by clinical sound processing, so it remains unclear whether the 300 Hz perceptual limit described in the literature is a technological or biological limitation. These results highlight modulation depth and sensitivity as critical factors for pitch resolution in cochlear implant users and characterize the relationship that should inform the design of modulation enhancement algorithms for cochlear implants.
Collapse
Affiliation(s)
- Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States of America
| | - Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States of America.
| |
Collapse
|
5
|
Ashihara T, Furukawa S, Kashino M. Estimating Pitch Information From Simulated Cochlear Implant Signals With Deep Neural Networks. Trends Hear 2024; 28:23312165241298606. [PMID: 39569552 DOI: 10.1177/23312165241298606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024] Open
Abstract
Cochlear implant (CI) users, even with substantial speech comprehension, generally have poor sensitivity to pitch information (or fundamental frequency, F0). This insensitivity is often attributed to limited spectral and temporal resolution in the CI signals. However, the pitch sensitivity markedly varies among individuals, and some users exhibit fairly good sensitivity. This indicates that the CI signal contains sufficient information about F0, and users' sensitivity is predominantly limited by other physiological conditions such as neuroplasticity or neural health. We estimated the upper limit of F0 information that a CI signal can convey by decoding F0 from simulated CI signals (multi-channel pulsatile signals) with a deep neural network model (referred to as the CI model). We varied the number of electrode channels and the pulse rate, which should respectively affect spectral and temporal resolutions of stimulus representations. The F0-estimation performance generally improved with increasing number of channels and pulse rate. For the sounds presented under quiet conditions, the model performance was at best comparable to that of a control waveform model, which received raw-waveform inputs. Under conditions in which background noise was imposed, the performance of the CI model generally degraded by a greater degree than that of the waveform model. The pulse rate had a particularly large effect on predicted performance. These observations indicate that the CI signal contains some information for predicting F0, which is particularly sufficient for targets under quiet conditions. The temporal resolution (represented as pulse rate) plays a critical role in pitch representation under noisy conditions.
Collapse
Affiliation(s)
- Takanori Ashihara
- NTT Human Informatics Laboratories, NTT Corporation, Kanagawa, Japan
- Department of Information and Communications Engineering, Tokyo Institute of Technology, Yokohama-shi, Kanagawa-ken, Japan
| | - Shigeto Furukawa
- Graduate School of Public Health, Shizuoka Graduate University of Public Health, Shizuoka, Japan
- Speech-Language-Hearing Center, Shizuoka General Hospital, Shizuoka, Japan
- NTT Communication Science Laboratories, NTT Corporation, Kanagawa, Japan
| | - Makio Kashino
- NTT Communication Science Laboratories, NTT Corporation, Kanagawa, Japan
| |
Collapse
|
6
|
Kuo CY, Liu JW, Wang CH, Juan CH, Hsieh IH. The role of carrier spectral composition in the perception of musical pitch. Atten Percept Psychophys 2023; 85:2083-2099. [PMID: 37479873 DOI: 10.3758/s13414-023-02761-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/07/2023] [Indexed: 07/23/2023]
Abstract
Temporal envelope fluctuations of natural sounds convey critical information to speech and music processing. In particular, musical pitch perception is assumed to be primarily underlined by temporal envelope encoding. While increasing evidence demonstrates the importance of carrier fine structure to complex pitch perception, how carrier spectral information affects musical pitch perception is less clear. Here, transposed tones designed to convey identical envelope information across different carriers were used to assess the effects of carrier spectral composition to pitch discrimination and musical-interval and melody identifications. Results showed that pitch discrimination thresholds became lower (better) with increasing carrier frequencies from 1k to 10k Hz, with performance comparable to that of pure sinusoids. Musical interval and melody defined by the periodicity of sine- or harmonic complex envelopes across carriers were identified with greater than 85% accuracy even on a 10k-Hz carrier. Moreover, enhanced interval and melody identification performance was observed with increasing carrier frequency up to 6k Hz. Findings suggest a perceptual enhancement of temporal envelope information with increasing carrier spectral region in musical pitch processing, at least for frequencies up to 6k Hz. For carriers in the extended high-frequency region (8-20k Hz), the use of temporal envelope information to music pitch processing may vary depending on task requirement. Collectively, these results implicate the fidelity of temporal envelope information to musical pitch perception is more pronounced than previously considered, with ecological implications.
Collapse
Affiliation(s)
- Chao-Yin Kuo
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei City, Taiwan
| | - Jia-Wei Liu
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
| | - Chih-Hung Wang
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei City, Taiwan
| | - Chi-Hung Juan
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
- Cognitive Intelligence and Precision Healthcare Center, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
| | - I-Hui Hsieh
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan.
- Cognitive Intelligence and Precision Healthcare Center, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan.
| |
Collapse
|
7
|
Lindenbeck MJ, Majdak P, Srinivasan S, Laback B. Pitch discrimination in electric hearing with inconsistent and consistent amplitude-modulation and inter-pulse rate cues. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:3268. [PMID: 37307025 PMCID: PMC10264086 DOI: 10.1121/10.0019452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 04/28/2023] [Accepted: 05/01/2023] [Indexed: 06/13/2023]
Abstract
Users of cochlear implants (CIs) struggle in situations that require selective hearing to focus on a target source while ignoring other sources. One major reason for that is the limited access to timing cues such as temporal pitch or interaural time differences (ITDs). Various approaches to improve timing-cue sensitivity while maintaining speech understanding have been proposed, among them inserting extra pulses with short inter-pulse intervals (SIPIs) into amplitude-modulated (AM) high-rate pulse trains. Indeed, SIPI rates matching the naturally occurring AM rates improve pitch discrimination. For ITD, however, low SIPI rates are required, potentially mismatching the naturally occurring AM rates and thus creating unknown pitch effects. In this study, we investigated the perceptual contribution of AM and SIPI rate to pitch discrimination in five CI listeners and with two AM depths (0.1 and 0.5). Our results show that the SIPI-rate cue generally dominated the percept for both consistent and inconsistent cues. When tested with inconsistent cues, also the AM rate contributed, however, at the large AM depth only. These findings have implications when aiming at jointly improving temporal-pitch and ITD sensitivity in a future mixed-rate stimulation approach.
Collapse
Affiliation(s)
- Martin J Lindenbeck
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, A-1040 Vienna, Austria
| | - Piotr Majdak
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, A-1040 Vienna, Austria
| | - Sridhar Srinivasan
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, A-1040 Vienna, Austria
| | - Bernhard Laback
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, A-1040 Vienna, Austria
| |
Collapse
|
8
|
Chen Y. Is Cantonese lexical tone information important for sentence recognition accuracy in quiet and in noise? PLoS One 2022; 17:e0276254. [PMID: 36282852 PMCID: PMC9595525 DOI: 10.1371/journal.pone.0276254] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 10/03/2022] [Indexed: 11/06/2022] Open
Abstract
In Chinese languages, tones are used to express the lexical meaning of words. It is therefore important to analyze the role of lexical tone in Chinese sentence recognition accuracy. There is a lack of research on the role of Cantonese lexical tones in sentence recognition accuracy. Therefore, this study examined the contribution of lexical tone information to Cantonese sentence recognition accuracy and its cognitive correlates in adults with normal hearing (NH). A text-to-speech synthesis engine was used to synthesize Cantonese daily-use sentences with each word carrying an original or a flat lexical tone, which were then presented to 97 participants in quiet, in speech-shaped noise (SSN), and in two-talker babble (TTB) noise conditions. Both target sentences and noises were presented at 65 dB binaurally via insert headphones. It was found that listeners with NH can almost perfectly recognize a daily-use Cantonese sentence with mismatched lexical tone information in quiet, while their sentence recognition decreases substantially in noise. The same finding was reported for Mandarin, which has a relatively simple tonal system, suggesting that the current results may be applicable to other tonal languages. In addition, working memory (WM) was significantly related to decline in sentence recognition score in the TTB but not in the SSN, when the lexical tones were mismatched. This finding can be explained using the Ease of Language Understanding model and suggests that those with higher WM are less likely to be affected by the degraded lexical information for perceiving daily-use sentences in the TTB.
Collapse
Affiliation(s)
- Yuan Chen
- Department of Special Education and Counselling, Integrated Center for Wellbeing (I-WELL), The Education University of Hong Kong, Taipo, New Territories, Hong Kong SAR, China
- * E-mail:
| |
Collapse
|
9
|
Kovačić D, James CJ. Stimulation Rate and Voice Pitch Perception in Cochlear Implants. J Assoc Res Otolaryngol 2022; 23:665-680. [PMID: 35918501 PMCID: PMC9613839 DOI: 10.1007/s10162-022-00854-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 06/05/2022] [Indexed: 06/15/2023] Open
Abstract
The stimulation rate in cochlear implant (CI) sound coding, or the "carrier" rate in pulses per second (pps), is known to influence pitch perception, as well as loudness perception and sound quality. Our main objective was to investigate the effects of reduced carrier rate on the loudness and pitch of coded speech samples. We describe two experiments with 16 Nucleus® CI users, where we controlled modulation characteristics and carrier rate using Spectral and Temporal Enhanced Processing (STEP), a novel experimental multichannel sound coder. We used a fixed set of threshold and comfortable stimulation levels for each subject, obtained from clinical MAPs. In the first experiment, we determined equivalence for voice pitch ranking and voice gender categorization between the Advanced Combination Encoder (ACE), a widely used clinical strategy in Nucleus® recipients, and STEP for fundamental frequencies (F0) 120-250 Hz. In the second experiment, loudness was determined as a function of the input amplitude of speech samples for carrier rates of 1000, 500, and 250 pps per channel. Then, using equally loud sound coder programs, we evaluated the effect of carrier rate on voice pitch perception. Although nearly all subjects could categorize voice gender significantly above chance, pitch ranking varied across subjects. Overall, carrier rate did not substantially affect voice pitch ranking or voice gender categorization: as long as the carrier rate was at least twice the fundamental frequency, or when stimulation pulses for the lowest, 250 pps carrier were aligned to F0 peaks. These results indicate that carrier rates as low as 250 pps per channel are sufficient to support functional voice pitch perception for those CI users sensitive to temporal pitch cues; at least when temporal modulations and pulse timings in the coder output are well controlled by novel strategies such as STEP.
Collapse
Affiliation(s)
- Damir Kovačić
- Department of Physics, Faculty of Science, University of Split, Ruđera Boškovića 33, 21000 Split, Croatia
| | - Chris J. James
- Cochlear France SAS, 135 Route de Saint Simon, 31100 Toulouse, France
| |
Collapse
|
10
|
Monaghan JJM, Carlyon RP, Deeks JM. Modulation Depth Discrimination by Cochlear Implant Users. J Assoc Res Otolaryngol 2022; 23:285-299. [PMID: 35080684 PMCID: PMC8964891 DOI: 10.1007/s10162-022-00834-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 12/30/2021] [Indexed: 11/29/2022] Open
Abstract
Cochlear implants (CIs) convey the amplitude envelope of speech by modulating high-rate pulse trains. However, not all of the envelope may be necessary to perceive amplitude modulations (AMs); the effective envelope depth may be limited by forward and backward masking from the envelope peaks. Three experiments used modulated pulse trains to measure which portions of the envelope can be effectively processed by CI users as a function of AM frequency. Experiment 1 used a three-interval forced-choice task to test the ability of CI users to discriminate less-modulated pulse trains from a fully modulated standard, without controlling for loudness. The stimuli in experiment 2 were identical, but a two-interval task was used in which participants were required to choose the less-modulated interval, ignoring loudness. Catch trials, in which judgements based on level or modulation depth would give opposing answers, were included. Experiment 3 employed novel stimuli whose modulation envelope could be modified below a variable point in the dynamic range, without changing the loudness of the stimulus. Overall, results showed that substantial portions of the envelope are not accurately encoded by CI users. In experiment 1, where loudness cues were available, participants on average were insensitive to changes in the bottom 30% of their dynamic range. In experiment 2, where loudness was controlled, participants appeared insensitive to changes in the bottom 50% of the dynamic range. In experiment 3, participants were insensitive to changes in the bottom 80% of the dynamic range. We discuss potential reasons for this insensitivity and implications for CI speech-processing strategies.
Collapse
Affiliation(s)
- Jessica J M Monaghan
- Macquarie University, The Australian Hearing Hub, NSW, 2109, Sydney, Australia.
- National Acoustic Laboratories, The Australian Hearing Hub, Sydney, NSW, 2109, Australia.
| | - Robert P Carlyon
- Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| | - John M Deeks
- Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| |
Collapse
|
11
|
Goldsworthy RL, Bissmeyer SRS, Camarena A. Advantages of Pulse Rate Compared to Modulation Frequency for Temporal Pitch Perception in Cochlear Implant Users. J Assoc Res Otolaryngol 2022; 23:137-150. [PMID: 34981263 PMCID: PMC8782986 DOI: 10.1007/s10162-021-00828-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 12/01/2021] [Indexed: 02/03/2023] Open
Abstract
Most cochlear implants encode the fundamental frequency of periodic sounds by amplitude modulation of constant-rate pulsatile stimulation. Pitch perception provided by such stimulation strategies is markedly poor. Two experiments are reported here that consider potential advantages of pulse rate compared to modulation frequency for providing stimulation timing cues for pitch. The first experiment examines beat frequency distortion that occurs when modulating constant-rate pulsatile stimulation. This distortion has been reported on previously, but the results presented here indicate that distortion occurs for higher stimulation rates than previously reported. The second experiment examines pitch resolution as provided by pulse rate compared to modulation frequency. The results indicate that pitch discrimination is better with pulse rate than with modulation frequency. The advantage was large for rates near what has been suggested as the upper limit of temporal pitch perception conveyed by cochlear implants. The results are relevant to sound processing design for cochlear implants particularly for algorithms that encode fundamental frequency into deep envelope modulations or into precisely timed pulsatile stimulation.
Collapse
Affiliation(s)
- Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
| | - Susan R S Bissmeyer
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
12
|
Huang EHH, Wu CM, Lin HC. Combination and Comparison of Sound Coding Strategies Using Cochlear Implant Simulation With Mandarin Speech. IEEE Trans Neural Syst Rehabil Eng 2021; 29:2407-2416. [PMID: 34767509 DOI: 10.1109/tnsre.2021.3128064] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Three cochlear implant (CI) sound coding strategies were combined in the same signal processing path and compared for speech intelligibility with vocoded Mandarin sentences. The three CI coding strategies, biologically-inspired hearing aid algorithm (BioAid), envelope enhancement (EE), and fundamental frequency modulation (F0mod), were combined with the advanced combination encoder (ACE) strategy. Hence, four singular coding strategies and four combinational coding strategies were derived. Mandarin sentences with speech-shape noise were processed using these coding strategies. Speech understanding of vocoded Mandarin sentences was evaluated using short-time objective intelligibility (STOI) and subjective sentence recognition tests with normal-hearing listeners. For signal-to-noise ratios at 5 dB or above, the EE strategy had slightly higher average scores in both STOI and listening tests compared to ACE. The addition of EE to BioAid slightly increased the mean scores for BioAid+EE, which was the combination strategy with the highest scores in both objective and subjective speech intelligibility. The benefits of BioAid, F0mod, and the four combinational coding strategies were not observed in CI simulation. The findings of this study may be useful for the future design of coding strategies and related studies with Mandarin.
Collapse
|
13
|
Goldsworthy RL, Camarena A, Bissmeyer SRS. Pitch perception is more robust to interference and better resolved when provided by pulse rate than by modulation frequency of cochlear implant stimulation. Hear Res 2021; 409:108319. [PMID: 34340020 PMCID: PMC9343238 DOI: 10.1016/j.heares.2021.108319] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 07/15/2021] [Accepted: 07/21/2021] [Indexed: 01/14/2023]
Abstract
Cochlear implants are medical devices that have been used to restore hearing to more than half a million people worldwide. Most recipients achieve high levels of speech comprehension through these devices, but speech comprehension in background noise and music appreciation in general are markedly poor compared to normal hearing. A key aspect of hearing that is notably diminished in cochlear implant outcomes is the sense of pitch provided by these devices. Pitch perception is an important factor affecting speech comprehension in background noise and is critical for music perception. The present article summarizes two experiments that examine the robustness and resolution of pitch perception as provided by cochlear implant stimulation timing. The driving hypothesis is that pitch conveyed by stimulation timing cues is more robust and better resolved when provided by variable pulse rates than by modulation frequency of constant-rate stimulation. Experiment 1 examines the robustness for hearing a large, one-octave, pitch difference in the presence of interfering electrical stimulation. With robustness to interference characterized for an otherwise easily discernible pitch difference, Experiment 2 examines the resolution of discrimination thresholds in the presence of interference as conveyed by modulation frequency or by pulse rate. These experiments test for an advantage of stimulation with precise temporal cues. The results indicate that pitch provided by pulse rate is both more robust to interference and is better resolved compared to when provided by modulation frequency. These results should inform the development of new sound processing strategies for cochlear implants designed to encode fundamental frequency of sounds into precise temporal stimulation.
Collapse
Affiliation(s)
- Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States.
| | - Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States; Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, United States
| | - Susan R S Bissmeyer
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States; Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, United States
| |
Collapse
|
14
|
Shekar RCMC, Hansen JHL. An evaluation framework for research platforms to advance cochlear implant/hearing aid technology: A case study with CCi-MOBILE. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:229. [PMID: 33514127 PMCID: PMC7803384 DOI: 10.1121/10.0002989] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 12/06/2020] [Accepted: 12/09/2020] [Indexed: 06/12/2023]
Abstract
Cochlear implants (CIs) and hearing aids (HAs) are advanced assistive hearing devices that perform sound processing to achieve acoustic to acoustic/electrical stimulation, thus enabling the prospects for hearing restoration and rehabilitation. Since commercial CIs/HAs are typically constrained by manufacturer design/production constraints, it is necessary for researchers to use research platforms (RPs) to advance algorithms and conduct investigational studies with CI/HA subjects. While previous CI/HA research platforms exist, no study has explored establishing a formal evaluation protocol for the operational safety and reliability of RPs. This study proposes a two-phase analysis and evaluation paradigm for RPs. In the acoustic phase 1 step, a signal processing acoustic space is explored in order to present a sampled set of audio input content to explore the safety of the resulting output electric/acoustic stimulation. In the parameter phase 2 step, the configurable space for realizable electrical stimulation pulses is determined, and overall stimulation reliability and safety are evaluated. The proposed protocol is applied and demonstrated using Costakis Cochlear Implant Mobile. Assessment protocol observations, results, and additional best practices for subsampling of the acoustic and parameter test spaces are discussed. The proposed analysis-evaluation protocol establishes a viable framework for assessing RP operational safety and reliability. Guidelines for adapting the proposed protocol to address variability in RP configuration due to experimental factors such as custom algorithms, stimulation techniques, and/or individualization are also considered.
Collapse
Affiliation(s)
- Ram C M C Shekar
- Cochlear Implant Processing Laboratory, Center for Robust Speech Systems (CRSS-CILab), Department of Electrical and Computer Engineering, Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, Texas 75080, USA
| | - John H L Hansen
- Cochlear Implant Processing Laboratory, Center for Robust Speech Systems (CRSS-CILab), Department of Electrical and Computer Engineering, Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, Texas 75080, USA
| |
Collapse
|
15
|
Nogueira W, Boghdady NE, Langner F, Gaudrain E, Başkent D. Effect of Channel Interaction on Vocal Cue Perception in Cochlear Implant Users. Trends Hear 2021; 25:23312165211030166. [PMID: 34461780 PMCID: PMC8411629 DOI: 10.1177/23312165211030166] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 06/14/2021] [Accepted: 06/16/2021] [Indexed: 11/16/2022] Open
Abstract
Speech intelligibility in multitalker settings is challenging for most cochlear implant (CI) users. One possibility for this limitation is the suboptimal representation of vocal cues in implant processing, such as the fundamental frequency (F0), and the vocal tract length (VTL). Previous studies suggested that while F0 perception depends on spectrotemporal cues, VTL perception relies largely on spectral cues. To investigate how spectral smearing in CIs affects vocal cue perception in speech-on-speech (SoS) settings, adjacent electrodes were simultaneously stimulated using current steering in 12 Advanced Bionics users to simulate channel interaction. In current steering, two adjacent electrodes are simultaneously stimulated forming a channel of parallel stimulation. Three such stimulation patterns were used: Sequential (one current steering channel), Paired (two channels), and Triplet stimulation (three channels). F0 and VTL just-noticeable differences (JNDs; Task 1), in addition to SoS intelligibility (Task 2) and comprehension (Task 3), were measured for each stimulation strategy. In Tasks 2 and 3, four maskers were used: the same female talker, a male voice obtained by manipulating both F0 and VTL (F0+VTL) of the original female speaker, a voice where only F0 was manipulated, and a voice where only VTL was manipulated. JNDs were measured relative to the original voice for the F0, VTL, and F0+VTL manipulations. When spectral smearing was increased from Sequential to Triplet, a significant deterioration in performance was observed for Tasks 1 and 2, with no differences between Sequential and Paired stimulation. Data from Task 3 were inconclusive. These results imply that CI users may tolerate certain amounts of channel interaction without significant reduction in performance on tasks relying on voice perception. This points to possibilities for using parallel stimulation in CIs for reducing power consumption.
Collapse
Affiliation(s)
- Waldo Nogueira
- Department of Otolaryngology, Medical University
Hannover and Cluster of Excellence Hearing4all, Hanover, Germany
| | - Nawal El Boghdady
- Department of Otorhinolaryngology, University Medical
Center Groningen, University of Groningen, Groningen,
Netherlands
- Research School of Behavioral and Cognitive
Neurosciences, University of
Groningen, University of Groningen, Groningen,
Netherlands
| | - Florian Langner
- Department of Otolaryngology, Medical University
Hannover and Cluster of Excellence Hearing4all, Hanover, Germany
| | - Etienne Gaudrain
- Department of Otorhinolaryngology, University Medical
Center Groningen, University of Groningen, Groningen,
Netherlands
- Research School of Behavioral and Cognitive
Neurosciences, University of
Groningen, University of Groningen, Groningen,
Netherlands
- Lyon Neuroscience Research Center, CNRS UMR 5292,
INSERM U1028, University Lyon 1, Lyon, France
| | - Deniz Başkent
- Department of Otorhinolaryngology, University Medical
Center Groningen, University of Groningen, Groningen,
Netherlands
- Research School of Behavioral and Cognitive
Neurosciences, University of
Groningen, University of Groningen, Groningen,
Netherlands
| |
Collapse
|
16
|
Bissmeyer SRS, Hossain S, Goldsworthy RL. Perceptual learning of pitch provided by cochlear implant stimulation rate. PLoS One 2020; 15:e0242842. [PMID: 33270735 PMCID: PMC7714175 DOI: 10.1371/journal.pone.0242842] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/10/2020] [Indexed: 11/19/2022] Open
Abstract
Cochlear implant users hear pitch evoked by stimulation rate, but discrimination diminishes for rates above 300 Hz. This upper limit on rate pitch is surprising given the remarkable and specialized ability of the auditory nerve to respond synchronously to stimulation rates at least as high as 3 kHz and arguably as high as 10 kHz. Sensitivity to stimulation rate as a pitch cue varies widely across cochlear implant users and can be improved with training. The present study examines individual differences and perceptual learning of stimulation rate as a cue for pitch ranking. Adult cochlear implant users participated in electrode psychophysics that involved testing once per week for three weeks. Stimulation pulse rate discrimination was measured in bipolar and monopolar configurations for apical and basal electrodes. Base stimulation rates between 100 and 800 Hz were examined. Individual differences were quantified using psychophysically derived metrics of spatial tuning and temporal integration. This study examined distribution of measures across subjects, predictive power of psychophysically derived metrics of spatial tuning and temporal integration, and the effect of training on rate discrimination thresholds. Psychophysical metrics of spatial tuning and temporal integration were not predictive of stimulation rate discrimination, but discrimination thresholds improved at lower frequencies with training. Since most clinical devices do not use variable stimulation rates, it is unknown to what extent recipients may learn to use stimulation rate cues if provided in a clear and consistent manner.
Collapse
Affiliation(s)
- Susan R. S. Bissmeyer
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Shaikat Hossain
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Raymond L. Goldsworthy
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
17
|
Huang W, Wong LLN, Chen F, Liu H, Liang W. Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3855-3864. [PMID: 33022190 DOI: 10.1044/2020_jslhr-20-00033] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose Fundamental frequency (F0) is the primary acoustic cue for lexical tone perception in tonal languages but is processed in a limited way in cochlear implant (CI) systems. The aim of this study was to evaluate the importance of F0 contours in sentence recognition in Mandarin-speaking children with CIs and find out whether it is similar to/different from that in age-matched normal-hearing (NH) peers. Method Age-appropriate sentences, with F0 contours manipulated to be either natural or flattened, were randomly presented to preschool children with CIs and their age-matched peers with NH under three test conditions: in quiet, in white noise, and with competing sentences at 0 dB signal-to-noise ratio. Results The neutralization of F0 contours resulted in a significant reduction in sentence recognition. While this was seen only in noise conditions among NH children, it was observed throughout all test conditions among children with CIs. Moreover, the F0 contour-induced accuracy reduction ratios (i.e., the reduction in sentence recognition resulting from the neutralization of F0 contours compared to the normal F0 condition) were significantly greater in children with CIs than in NH children in all test conditions. Conclusions F0 contours play a major role in sentence recognition in both quiet and noise among pediatric implantees, and the contribution of the F0 contour is even more salient than that in age-matched NH children. These results also suggest that there may be differences between children with CIs and NH children in how F0 contours are processed.
Collapse
Affiliation(s)
- Wanting Huang
- Unit of Human Communication, Development, and Information Sciences, Faculty of Education, The University of Hong Kong, China
| | - Lena L N Wong
- Unit of Human Communication, Development, and Information Sciences, Faculty of Education, The University of Hong Kong, China
| | - Fei Chen
- Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Haihong Liu
- Beijing Key Laboratory of Pediatric Diseases of Otolaryngology, Head and Neck Surgery, Beijing Children's Hospital, China
| | - Wei Liang
- China Rehabilitation Research Center for Hearing and Speech Impairment, Beijing, China
| |
Collapse
|
18
|
Lindenbeck MJ, Laback B, Majdak P, Srinivasan S. Temporal-pitch sensitivity in electric hearing with amplitude modulation and inserted pulses with short inter-pulse intervals. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:777. [PMID: 32113255 PMCID: PMC7002171 DOI: 10.1121/10.0000610] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Revised: 12/26/2019] [Accepted: 01/02/2020] [Indexed: 06/10/2023]
Abstract
Listeners with cochlear implants (CIs) typically show poor sensitivity to the temporal-envelope pitch of high-rate pulse trains. Sensitivity to interaural time differences improves when adding pulses with short inter-pulse intervals (SIPIs) to high-rate pulse trains. In the current study, monaural temporal-pitch sensitivity with SIPI pulses was investigated for six CI listeners. Amplitude-modulated single-electrode stimuli, representing the coding of the fundamental frequency (F0) in the envelope of a high-rate carrier, were used. Two SIPI-insertion approaches, five modulation depths, two typical speech-F0s, and two carrier rates were tested. SIPI pulses were inserted either in every amplitude-modulation period (full-rate SIPI) to support the F0 cue or in every other amplitude-modulation period (half-rate SIPI) to circumvent a potential rate limitation at higher F0s. The results demonstrate that full-rate SIPI pulses improve temporal-pitch sensitivity across F0s and particularly at low modulation depths where envelope-pitch cues are weak. The half-rate SIPI pulses did not circumvent the limitation and further increased variability across listeners. Further, no effect of the carrier rate was found. Thus, the SIPI approach appears to be a promising approach to enhance CI listeners' access to temporal-envelope pitch cues at pulse rates used clinically.
Collapse
Affiliation(s)
- Martin J Lindenbeck
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| | - Bernhard Laback
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| | - Piotr Majdak
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| | - Sridhar Srinivasan
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, 1040 Vienna, Austria
| |
Collapse
|
19
|
Tran P, Richardson ML, Zeng FG. Input-Output Functions in Human Heads Obtained With Cochlear Implant and Transcranial Electric Stimulation. Neuromodulation 2019; 24:1402-1411. [PMID: 31710408 DOI: 10.1111/ner.13065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 09/18/2019] [Accepted: 09/23/2019] [Indexed: 11/27/2022]
Abstract
OBJECTIVES Electric stimulation is used to treat a number of neurologic disorders such as epilepsy and depression. However, delivering the required current to far-field neural targets is often ineffective because of current spread through low-impedance pathways. Here, the specific aims are to develop an empirical measure for current passing through the human head and to optimize stimulation strategies for targeting deeper structures, including the auditory nerve, by utilizing the cochlear implant (CI). MATERIALS AND METHODS Outward input/output (I/O) functions were obtained by CI stimulation and recording scalp potentials in five CI subjects. Conversely, inward I/O functions were obtained by noninvasive transcranial electric stimulation (tES) and recording intracochlear potentials using the onboard recording capability of the CI. RESULTS I/O measures indicate substantial current spread, with a maximum of 2.2% gain recorded at the inner ear target during tES (mastoid-to-mastoid electrode configuration). Similarly, CI stimulation produced a maximum of 1.1% gain at the scalp electrode nearest the CI return electrode. Gain varied with electrode montage according to a point source model that accounted for distances between the stimulating and recording electrodes. Within the same electrode montages, current gain patterns varied across subjects suggesting the importance of tissue properties, geometry, and electrode positioning. CONCLUSION These results provide a novel objective measure of electric stimulation in the human head, which can help to optimize stimulation parameters that improve neural excitation of deep structures by reducing the influence of current spread.
Collapse
Affiliation(s)
- Phillip Tran
- Center for Hearing Research, University of California, Irvine, CA, USA
| | | | - Fan-Gang Zeng
- Center for Hearing Research, University of California, Irvine, CA, USA
| |
Collapse
|
20
|
Zhang F, Underwood G, McGuire K, Liang C, Moore DR, Fu QJ. Frequency change detection and speech perception in cochlear implant users. Hear Res 2019; 379:12-20. [PMID: 31035223 PMCID: PMC6571168 DOI: 10.1016/j.heares.2019.04.007] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Revised: 03/21/2019] [Accepted: 04/15/2019] [Indexed: 10/27/2022]
Abstract
Dynamic frequency changes in sound provide critical cues for speech perception. Most previous studies examining frequency discrimination in cochlear implant (CI) users have employed behavioral tasks in which target and reference tones (differing in frequency) are presented statically in separate time intervals. Participants are required to identify the target frequency by comparing stimuli across these time intervals. However, perceiving dynamic frequency changes in speech requires detection of within-interval frequency change. This study explored the relationship between detection of within-interval frequency changes and speech perception performance of CI users. Frequency change detection thresholds (FCDTs) were measured in 20 adult CI users using a 3-alternative forced-choice (3AFC) procedure. Stimuli were 1-sec pure tones (base frequencies at 0.25, 1, 4 kHz) with frequency changes occurring 0.5 s after the tone onset. Speech tests were 1) Consonant-Nucleus-Consonant (CNC) monosyllabic word recognition, 2) Arizona Biomedical Sentence Recognition (AzBio) in Quiet, 3) AzBio in Noise (AzBio-N, +10 dB signal-to-noise/SNR ratio), and 4) Digits-in-noise (DIN). Participants' subjective satisfaction with the CI was obtained. Results showed that correlations between FCDTs and speech perception were all statistically significant. The satisfaction level of CI use was not related to FCDTs, after controlling for major demographic factors. DIN speech reception thresholds were significantly correlated to AzBio-N scores. The current findings suggest that the ability to detect within-interval frequency changes may play an important role in speech perception performance of CI users. FCDT and DIN can serve as simple and rapid tests that require no or minimal linguistic background for the prediction of CI speech outcomes.
Collapse
Affiliation(s)
- Fawen Zhang
- Department of Communication Sciences and Disorders, University of Cincinnati, Ohio, USA.
| | - Gabrielle Underwood
- Department of Communication Sciences and Disorders, University of Cincinnati, Ohio, USA
| | - Kelli McGuire
- Department of Communication Sciences and Disorders, University of Cincinnati, Ohio, USA
| | - Chun Liang
- Department of Communication Sciences and Disorders, University of Cincinnati, Ohio, USA; Shenzhen Maternity & Child Healthcare Hospital, Shenzhen, China
| | - David R Moore
- Communication Sciences Research Center, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA; Department of Otolaryngology, University of Cincinnati, Ohio, USA
| | - Qian-Jie Fu
- Department of Head and Neck Surgery, University of California, Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
21
|
Gaudrain E, Başkent D. Discrimination of Voice Pitch and Vocal-Tract Length in Cochlear Implant Users. Ear Hear 2019; 39:226-237. [PMID: 28799983 PMCID: PMC5839701 DOI: 10.1097/aud.0000000000000480] [Citation(s) in RCA: 76] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 06/29/2017] [Indexed: 12/02/2022]
Abstract
OBJECTIVES When listening to two competing speakers, normal-hearing (NH) listeners can take advantage of voice differences between the speakers. Users of cochlear implants (CIs) have difficulty in perceiving speech on speech. Previous literature has indicated sensitivity to voice pitch (related to fundamental frequency, F0) to be poor among implant users, while sensitivity to vocal-tract length (VTL; related to the height of the speaker and formant frequencies), the other principal voice characteristic, has not been directly investigated in CIs. A few recent studies evaluated F0 and VTL perception indirectly, through voice gender categorization, which relies on perception of both voice cues. These studies revealed that, contrary to prior literature, CI users seem to rely exclusively on F0 while not utilizing VTL to perform this task. The objective of the present study was to directly and systematically assess raw sensitivity to F0 and VTL differences in CI users to define the extent of the deficit in voice perception. DESIGN The just-noticeable differences (JNDs) for F0 and VTL were measured in 11 CI listeners using triplets of consonant-vowel syllables in an adaptive three-alternative forced choice method. RESULTS The results showed that while NH listeners had average JNDs of 1.95 and 1.73 semitones (st) for F0 and VTL, respectively, CI listeners showed JNDs of 9.19 and 7.19 st. These JNDs correspond to differences of 70% in F0 and 52% in VTL. For comparison to the natural range of voices in the population, the F0 JND in CIs remains smaller than the typical male-female F0 difference. However, the average VTL JND in CIs is about twice as large as the typical male-female VTL difference. CONCLUSIONS These findings, thus, directly confirm that CI listeners do not seem to have sufficient access to VTL cues, likely as a result of limited spectral resolution, and, hence, that CI listeners' voice perception deficit goes beyond poor perception of F0. These results provide a potential common explanation not only for a number of deficits observed in CI listeners, such as voice identification and gender categorization, but also for competing speech perception.
Collapse
Affiliation(s)
- Etienne Gaudrain
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology-Head and Neck Surgery, Groningen, The Netherlands; CNRS UMR 5292, Lyon Neuroscience Research Center, Auditory Cognition and Psychoacoustics, Université Lyon, Lyon, France; and Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
| | - Deniz Başkent
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology-Head and Neck Surgery, Groningen, The Netherlands; CNRS UMR 5292, Lyon Neuroscience Research Center, Auditory Cognition and Psychoacoustics, Université Lyon, Lyon, France; and Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
22
|
A Randomized Controlled Crossover Study of the Impact of Online Music Training on Pitch and Timbre Perception in Cochlear Implant Users. J Assoc Res Otolaryngol 2019; 20:247-262. [PMID: 30815761 DOI: 10.1007/s10162-018-00704-0] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2017] [Accepted: 10/17/2018] [Indexed: 10/27/2022] Open
Abstract
Cochlear implant (CI) biomechanical constraints result in impoverished spectral cues and poor frequency resolution, making it difficult for users to perceive pitch and timbre. There is emerging evidence that music training may improve CI-mediated music perception; however, much of the existing studies involve time-intensive and less readily accessible in-person music training paradigms, without rigorous experimental control paradigms. Online resources for auditory rehabilitation remain an untapped potential resource for CI users. Furthermore, establishing immediate value from an acute music training program may encourage CI users to adhere to post-implantation rehabilitation exercises. In this study, we evaluated the impact of an acute online music training program on pitch discrimination and timbre identification. Via a randomized controlled crossover study design, 20 CI users and 21 normal hearing (NH) adults were assigned to one of two arms. Arm-A underwent 1 month of online self-paced music training (intervention) followed by 1 month of audiobook listening (control). Arm-B underwent 1 month of audiobook listening followed by 1 month of music training. Pitch and timbre sensitivity scores were taken across three visits: (1) baseline, (2) after 1 month of intervention, and (3) after 1 month of control. We found that performance improved in pitch discrimination among CI users and NH listeners, with both online music training and audiobook listening. Music training, however, provided slightly greater benefit for instrument identification than audiobook listening. For both tasks, this improvement appears to be related to both fast stimulus learning as well as procedural learning. In conclusion, auditory training (with either acute participation in an online music training program or audiobook listening) may improve performance on untrained tasks of pitch discrimination and timbre identification. These findings demonstrate a potential role for music training in perceptual auditory appraisal of complex stimuli. Furthermore, this study highlights the importance and the need for more tightly controlled training studies in order to accurately evaluate the impact of rehabilitation training protocols on auditory processing.
Collapse
|
23
|
Zarei E, Sadjedi H, Farhadi M. Low variable rate stimulation strategy for cochlear implants using temporal cues and electrophysiological factors. Technol Health Care 2018; 26:879-888. [PMID: 29991148 DOI: 10.3233/thc-171154] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND The performance of a cochlear implant (CI), especially in conveying pitch depends on its electrical stimulation strategy. OBJECTIVE The present study proposes a variable-rate stimulation algorithm which improves speech emotion perception by using temporal fine-structure cues and electrophysiological parameters of the patient. METHODS This method is based on the coding of the phase information at the peak time intervals of the band-passed signals. The stimulation pulse is generated at the time of peak occurrence, which is able to excite the number of fibers with a discharge probability above a threshold. Calculating the discharge probability is based on the excitable fiber model and taking into account the biological characteristics of the patient, such as the fiber threshold and the distribution of remaining intact fibers. RESULTS The results of the emotion detection test on selective reconstructed sentences from the Persian emotional speech database (Persian ESD) indicated that the listeners have been able to detect the emotion by an average of 83.82% using the proposed stimulation algorithm while it was 75% and 48.03% for the zero-crossing and the continuous interleaved sampling (CIS), respectively. Furthermore, the number of pulses compared to the zero-crossing and the CIS has decreased by 76.3% and 75.4%, respectively. CONCLUSIONS In this paper, a stimulation method was proposed for cochlear implants by considering the patient's biological parameters. It has been successful in transmitting speech emotion despite the reduction of stimulating pulses. This has some advantages such as reducing the interaction of current fields between electrodes during stimulation and reducing battery usage.
Collapse
Affiliation(s)
- Elham Zarei
- Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Hamed Sadjedi
- Engineering Faculty, Shahed University, Tehran, Iran
| | - Mohammad Farhadi
- ENT and Head & Neck Research Center and Department, Hazrat Rasoul Akram Hospital, Iran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
24
|
Factors Affecting Speech Reception in Background Noise with a Vocoder Implementation of the FAST Algorithm. J Assoc Res Otolaryngol 2018; 19:467-478. [PMID: 29744731 DOI: 10.1007/s10162-018-0672-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2017] [Accepted: 04/23/2018] [Indexed: 10/16/2022] Open
Abstract
Speech segregation in background noise remains a difficult task for individuals with hearing loss. Several signal processing strategies have been developed to improve the efficacy of hearing assistive technologies in complex listening environments. The present study measured speech reception thresholds in normal-hearing listeners attending to a vocoder based on the Fundamental Asynchronous Stimulus Timing algorithm (FAST: Smith et al. 2014), which triggers pulses based on the amplitudes of channel magnitudes in order to preserve envelope timing cues, with two different reconstruction bandwidths (narrowband and broadband) to control the degree of spectrotemporal resolution. Five types of background noise were used including same male talker, female talker, time-reversed male talker, time-reversed female talker, and speech-shaped noise to probe the contributions of different types of speech segregation cues and to elucidate how degradation affects speech reception across these conditions. Maskers were spatialized using head-related transfer functions in order to create co-located and spatially separated conditions. Results indicate that benefits arising from voicing and spatial cues can be preserved using the FAST algorithm but are reduced with a reduction in spectral resolution.
Collapse
|
25
|
Landsberger DM, Vermeire K, Claes A, Van Rompaey V, Van de Heyning P. Qualities of Single Electrode Stimulation as a Function of Rate and Place of Stimulation with a Cochlear Implant. Ear Hear 2018; 37:e149-59. [PMID: 26583480 DOI: 10.1097/aud.0000000000000250] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVES Although it has been shown previously that changes in temporal coding produce changes in pitch in all cochlear regions, research has suggested that temporal coding might be best encoded in relatively apical locations. The authors hypothesized that although temporal coding may provide useable information at any cochlear location, low rates of stimulation might provide better sound quality in apical regions that are more likely to encode temporal information in the normal ear. In the present study, sound qualities of single electrode pulse trains were scaled to provide insight into the combined effects of cochlear location and stimulation rate on sound quality. DESIGN Ten long-term users of MED-EL cochlear implants with 31-mm electrode arrays (Standard or FLEX) were asked to scale the sound quality of single electrode pulse trains in terms of how "Clean," "Noisy," "High," and "Annoying" they sounded. Pulse trains were presented on most electrodes between 1 and 12 representing the entire range of the long electrode array at stimulation rates of 100, 150, 200, 400, or 1500 pulses per second. RESULTS Although high rates of stimulation are scaled as having a Clean sound quality across the entire array, only the most apical electrodes (typically 1 through 3) were considered Clean at low rates. Low rates on electrodes 6 through 12 were not rated as Clean, whereas the low-rate quality of electrodes 4 and 5 were typically in between. Scaling of Noisy responses provided an approximately inverse pattern as Clean responses. High responses show the trade-off between rate and place of stimulation on pitch. Because High responses did not correlate with Clean responses, subjects were not rating sound quality based on pitch. CONCLUSIONS If explicit temporal coding is to be provided in a cochlear implant, it is likely to sound better when provided apically. In addition, the finding that low rates sound clean only at apical places of stimulation is consistent with previous findings that a change in rate of stimulation corresponds to an equivalent change in perceived pitch at apical locations. Collectively, the data strongly suggest that temporal coding with a cochlear implant is optimally provided by electrodes placed well into the second cochlear turn.
Collapse
Affiliation(s)
- David M Landsberger
- 1Department of Otolaryngology, New York University School of Medicine, New York, New York, USA; 2Department of Otorhinolaryngology & Head and Neck Surgery, Antwerp University Hospital, Antwerp, Belgium; 3Hearing and Speech Center, Long Island Jewish Medical Center, New Hyde Park, New York, USA; and 4Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | | | | | | | | |
Collapse
|
26
|
|
27
|
Lai WK, Dillier N, Killian M. A Neural Excitability Based Coding Strategy for Cochlear Implants. ACTA ACUST UNITED AC 2018. [DOI: 10.4236/jbise.2018.117014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
28
|
Liu H, Peng X, Zhao Y, Ni X. The effectiveness of sound-processing strategies on tonal language cochlear implant users: A systematic review. Pediatr Investig 2017; 1:32-39. [PMID: 32851216 PMCID: PMC7331426 DOI: 10.1002/ped4.12011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 10/12/2017] [Indexed: 11/23/2022] Open
Abstract
IMPORTANCE Contemporary cochlear implants (CIs) are well established as a technology for people with severe-to-profound sensorineural hearing loss, with their effectiveness having been widely reported. However, for tonal language CI recipients, speech perception remains a challenge: Conventional signal processing strategies have been demonstrated to possibly provide insufficient information to encode tonal cues, and CI recipients have exhibited considerable deficits in tone perception. Thus, some tonal language-oriented sound-processing strategies have been introduced. The effects of available tonal language-oriented strategies on tone perception are reviewed and evaluated in this study. The results may aid in designing and improving tonal language-appropriate sound-processing strategies for CI recipients. OBJECTIVE The objective of this systematic review was to investigate the effects of tonal-language-oriented signal processing strategies on tone perception, music perception, word and sentence recognition. METHODS To evaluate the effects of tonal language-oriented strategies on tone perception, we conducted a systematic review. We searched for relevant reports dated from January 1979 to July 2017 using PubMed, Cochrane Library, EBSCO, Web of Science, EMBASE, and 4 Chinese periodical databases (CBMdisc, CNKI, VIP, and Wanfang Data). RESULTS According to our search strategy, 672 potentially eligible studies were retrieved from the databases, with 12 of these studies included in the final review after a 4-stage selection process. The majority of sound-processing strategies designed for tonal language were HiResolution® with Fidelity 120 (HiRes 120), fine structure processing, temporal fine structure (TFS), and C-tone. Generally, acute or short-term comparisons between the tonal language-oriented strategies and the conventional strategy did not reveal statistically significant differences in speech perception (or show a small improvement). However, a tendency toward improved tone perception and subjectively reported overall preferred sound quality was observed with the tonal language-oriented strategies. INTERPRETATION Conventional signal processing strategies typically provided very limited F0 information via temporal envelopes delivered to the stimulating electrodes. In contrast, tonal language-oriented coding strategies attempted to present more spectral information and TFS cues required for tone perception. Thus, a tendency of improved performance in tonal language perception in CI users was shown.
Collapse
Affiliation(s)
- Haihong Liu
- Beijing Key Laboratory for Pediatric Diseases of Otorhinolaryngology, Head and Neck SurgeryMinistry of Education (MOE) Key Laboratory of Major Diseases in ChildrenBeijing Pediatric Research InstituteBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
- Department of Otorhinolaryngology, Head and Neck SurgeryBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| | - Xiaoxia Peng
- Center for Clinical Epidemiology and Evidence‐Based MedicineBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| | - Yawen Zhao
- Beijing Key Laboratory for Pediatric Diseases of Otorhinolaryngology, Head and Neck SurgeryMinistry of Education (MOE) Key Laboratory of Major Diseases in ChildrenBeijing Pediatric Research InstituteBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| | - Xin Ni
- Beijing Key Laboratory for Pediatric Diseases of Otorhinolaryngology, Head and Neck SurgeryMinistry of Education (MOE) Key Laboratory of Major Diseases in ChildrenBeijing Pediatric Research InstituteBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
- Department of Otorhinolaryngology, Head and Neck SurgeryBeijing Children's HospitalCapital Medical UniversityNational Center for Children's HealthBeijingChina
| |
Collapse
|
29
|
Speech onset enhancement improves intelligibility in adverse listening conditions for cochlear implant users. Hear Res 2016; 342:13-22. [DOI: 10.1016/j.heares.2016.09.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 09/07/2016] [Indexed: 11/17/2022]
|
30
|
Erfanian Saeedi N, Blamey PJ, Burkitt AN, Grayden DB. An integrated model of pitch perception incorporating place and temporal pitch codes with application to cochlear implant research. Hear Res 2016; 344:135-147. [PMID: 27845260 DOI: 10.1016/j.heares.2016.11.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Revised: 11/04/2016] [Accepted: 11/08/2016] [Indexed: 11/19/2022]
Abstract
Although the neural mechanisms underlying pitch perception are not yet fully understood, there is general agreement that place and temporal representations of pitch are both used by the auditory system. This paper describes a neural network model of pitch perception that integrates both codes of pitch and explores the contributions of, and the interactions between, the two representations in simulated pitch ranking trials in normal and cochlear implant hearing. The model can replicate various psychophysical observations including the perception of the missing fundamental pitch and sensitivity to pitch interval sizes. As a case study, the model was used to investigate the efficiency of pitch perception cues in a novel sound processing scheme, Stimulation based on Auditory Modelling (SAM), that aims to improve pitch perception in cochlear implant hearing. Results showed that enhancement of the pitch perception cues would lead to better pitch ranking scores in the integrated model only if the place and temporal pitch cues were consistent.
Collapse
Affiliation(s)
- Nafise Erfanian Saeedi
- NeuroEngineering Laboratory, Dept. of Electrical & Electronic Engineering, University of Melbourne, Australia; Centre for Neural Engineering, University of Melbourne, Australia.
| | - Peter J Blamey
- The Bionics Institute, East Melbourne, Australia; Dept. of Medical Bionics, University of Melbourne, Australia
| | - Anthony N Burkitt
- NeuroEngineering Laboratory, Dept. of Electrical & Electronic Engineering, University of Melbourne, Australia; The Bionics Institute, East Melbourne, Australia
| | - David B Grayden
- NeuroEngineering Laboratory, Dept. of Electrical & Electronic Engineering, University of Melbourne, Australia; Centre for Neural Engineering, University of Melbourne, Australia; The Bionics Institute, East Melbourne, Australia
| |
Collapse
|
31
|
Zarei E, Sadjedi H. A new approach for speech synthesis in cochlear implant systems based on electrophysiological factors. Technol Health Care 2016; 25:221-235. [PMID: 27689564 DOI: 10.3233/thc-161265] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Speech synthesis models have been considered as viable tools for performance evaluation of cochlear stimulation algorithms, due to the difficulties of clinical tests. OBJECTIVE The present study has developed a tool that can be used before any audio signal reconstruction algorithm, which shows more conformity with the electrophysiological parameters of the patient in evaluation of the cochlear implant stimulation algorithms. METHODS In this method, excitable nerve fiber characteristics such as stimulation threshold and effective refractory period have been considered in the signal pre-reconstruction process. This algorithm subsumes the user's biological parameters (e.g., the manner of distribution of the remaining intact nerve fibers) as well as the stimulation signal parameters (e.g., stimulation rate, pulse width, amplitude of stimulation, the distance between stimulation electrode and fibers) in the signal pre-reconstruction. RESULTS Effect of changes in these parameters can be observed by the number of excited fibers, which is directly related to the signal intensity and pitch frequency perceived by the user. The obtained results from simulations are in accordance with previous clinical findings. Also, the ability of the proposed tool can be seen by the correspondence between the results obtained from the proposed model and the amplitude growth functions of the cochlear implant users. CONCLUSIONS This paper has introduced a tool for signal reconstruction from electrical stimulation so that a more comprehensive criterion for examination of the stimulating algorithms in cochlear implant can be achieved.
Collapse
Affiliation(s)
- Elham Zarei
- Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Hamed Sadjedi
- Engineering Faculty, Shahed University, Tehran, Iran
| |
Collapse
|
32
|
Vandali AE, Dawson PW, Arora K. Results using the OPAL strategy in Mandarin speaking cochlear implant recipients. Int J Audiol 2016; 56:S74-S85. [PMID: 27329178 DOI: 10.1080/14992027.2016.1190872] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
OBJECTIVE To evaluate the effectiveness of an experimental pitch-coding strategy for improving recognition of Mandarin lexical tone in cochlear implant (CI) recipients. DESIGN Adult CI recipients were tested on recognition of Mandarin tones in quiet and speech-shaped noise at a signal-to-noise ratio of +10 dB; Mandarin sentence speech-reception threshold (SRT) in speech-shaped noise; and pitch discrimination of synthetic complex-harmonic tones in quiet. Two versions of the experimental strategy were examined: (OPAL) linear (1:1) mapping of fundamental frequency (F0) to the coded modulation rate; and (OPAL+) transposed mapping of high F0s to a lower coded rate. Outcomes were compared to results using the clinical ACE™ strategy. STUDY SAMPLE Five Mandarin speaking users of Nucleus® cochlear implants. RESULTS A small but significant benefit in recognition of lexical tones was observed using OPAL compared to ACE in noise, but not in quiet, and not for OPAL+ compared to ACE or OPAL in quiet or noise. Sentence SRTs were significantly better using OPAL+ and comparable using OPAL to those using ACE. No differences in pitch discrimination thresholds were observed across strategies. CONCLUSIONS OPAL can provide benefits to Mandarin lexical tone recognition in moderately noisy conditions and preserve perception of Mandarin sentences in challenging noise conditions.
Collapse
Affiliation(s)
- Andrew E Vandali
- a The Hearing CRC , Carlton , Victoria , Australia.,b The Bionics Institute , East Melbourne , Victoria , Australia , and
| | - Pam W Dawson
- a The Hearing CRC , Carlton , Victoria , Australia.,c Cochlear Ltd , East Melbourne , Victoria , Australia
| | - Komal Arora
- a The Hearing CRC , Carlton , Victoria , Australia.,c Cochlear Ltd , East Melbourne , Victoria , Australia
| |
Collapse
|
33
|
|
34
|
Carlyon RP, Deeks JM. Combined neural and behavioural measures of temporal pitch perception in cochlear implant users. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:2885-2905. [PMID: 26627764 DOI: 10.1121/1.4934275] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Four experiments measured the perceptual and neural correlates of the temporal pattern of electrical stimulation applied to one cochlear-implant (CI) electrode, for several subjects. Neural effects were estimated from the electrically evoked compound action potential (ECAP) to each pulse. Experiment 1 attenuated every second pulse of a 200-pps pulse train. Increasing attenuation caused pitch to drop and the ECAP to become amplitude modulated, thereby providing an estimate of the relationship between neural modulation and pitch. Experiment 2 showed that the pitch of a 200-pps pulse train can be reduced by delaying every second pulse, so that the inter-pulse-intervals alternate between longer and shorter intervals. This caused the ECAP to become amplitude modulated, but not by enough to account for the change in pitch. Experiment 3 replicated the finding that rate discrimination deteriorates with increases in baseline rate. This was accompanied by an increase in ECAP modulation, but by an amount that produced only a small effect on pitch in experiment 1. Experiment 4 showed that preceding a pulse train with a carefully selected "pre-pulse" could reduce ECAP modulation, but did not improve rate discrimination. Implications for theories of pitch and for limitations of pitch perception in CI users are discussed.
Collapse
Affiliation(s)
- Robert P Carlyon
- MRC Cognition & Brain Sciences Unit, 15 Chaucer Road, Cambridge CB2 7EF, England
| | - John M Deeks
- MRC Cognition & Brain Sciences Unit, 15 Chaucer Road, Cambridge CB2 7EF, England
| |
Collapse
|
35
|
Fielden CA, Kluk K, Boyle PJ, McKay CM. The perception of complex pitch in cochlear implants: A comparison of monopolar and tripolar stimulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:2524-36. [PMID: 26520335 DOI: 10.1121/1.4931910] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Cochlear implant listeners typically perform poorly in tasks of complex pitch perception (e.g., musical pitch and voice pitch). One explanation is that wide current spread during implant activation creates channel interactions that may interfere with perception of temporal fundamental frequency information contained in the amplitude modulations within channels. Current focusing using a tripolar mode of stimulation has been proposed as a way of reducing channel interactions, minimising spread of excitation and potentially improving place and temporal pitch cues. The present study evaluated the effect of mode in a group of cochlear implant listeners on a pitch ranking task using male and female singing voices separated by either a half or a quarter octave. Results were variable across participants, but on average, pitch ranking was at chance level when the pitches were a quarter octave apart and improved when the difference was a half octave. No advantage was observed for tripolar over monopolar mode at either pitch interval, suggesting that previously published psychophysical advantages for focused modes may not translate into improvements in complex pitch ranking. Evaluation of the spectral centroid of the stimulation pattern, plus a lack of significant difference between male and female voices, suggested that participants may have had difficulty in accessing temporal pitch cues in either mode.
Collapse
Affiliation(s)
- Claire A Fielden
- School of Psychological Sciences, University of Manchester, Ellen Wilkinson Building, Oxford Road, Manchester M13 9PL, United Kingdom
| | - Karolina Kluk
- School of Psychological Sciences, University of Manchester, Ellen Wilkinson Building, Oxford Road, Manchester M13 9PL, United Kingdom
| | - Patrick J Boyle
- Advanced Bionics UK Ltd., 2 Breaks House, Mill Court, Great Shelford, Cambridge, CB22 5LD, United Kingdom
| | - Colette M McKay
- School of Psychological Sciences, University of Manchester, Ellen Wilkinson Building, Oxford Road, Manchester M13 9PL, United Kingdom
| |
Collapse
|
36
|
Melodic pitch perception and lexical tone perception in Mandarin-speaking cochlear implant users. Ear Hear 2015; 36:102-10. [PMID: 25099401 DOI: 10.1097/aud.0000000000000086] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES To examine the relationship between lexical tone perception and melodic pitch perception in Mandarin-speaking cochlear implant (CI) users and to investigate the influence of previous acoustic hearing on CI users' speech and music perception. DESIGN Lexical tone perception and melodic contour identification (MCI) were measured in 21 prelingual and 11 postlingual young (aged 6-26 years) Mandarin-speaking CI users. Lexical tone recognition was measured for four tonal patterns: tone 1 (flat F0), tone 2 (rising F0), tone 3 (falling-rising F0), and tone 4 (falling F0). MCI was measured using nine five-note melodic patterns that contained changes in pitch contour, as well as different semitone spacing between notes. RESULTS Lexical tone recognition was generally good (overall mean = 81% correct), and there was no significant difference between subject groups. MCI performance was generally poor (mean = 23% correct). MCI performance was significantly better for postlingual (mean = 32% correct) than for prelingual CI participants (mean = 18% correct). After correcting for outliers, there was no significant correlation between lexical tone recognition and MCI performance for prelingual or postlingual CI participants. Age at deafness was significantly correlated with MCI performance only for postlingual participants. CI experience was significantly correlated with MCI performance for both prelingual and postlingual participants. Duration of deafness was significantly correlated with tone recognition only for prelingual participants. CONCLUSIONS Despite the prevalence of pitch cues in Mandarin, the present CI participants had great difficulty perceiving melodic pitch. The availability of amplitude and duration cues in lexical tones most likely compensated for the poor pitch perception observed with these CI listeners. Previous acoustic hearing experience seemed to benefit postlingual CI users' melodic pitch perception. Longer CI experience was associated with better MCI performance for both subject groups, suggesting that CI users' music perception may improve as they gain experience with their device.
Collapse
|
37
|
Zhu S, Wong LLN, Chen F, Chen Y. Consonant discrimination by Mandarin-speaking children with prelingual hearing impairment. Int J Pediatr Otorhinolaryngol 2015; 79:1354-61. [PMID: 26112665 DOI: 10.1016/j.ijporl.2015.06.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Revised: 05/16/2015] [Accepted: 06/05/2015] [Indexed: 10/23/2022]
Abstract
OBJECTIVES Little is known about the consonant discrimination ability of Mandarin-speaking children with prelingual hearing impairment (HI) and fitted with hearing aids (HAs). The present study aimed to evaluate Mandarin consonant discrimination ability in children with HI, and explore the effects of unaided and aided hearing threshold, the age of first HA fitting and the duration of HA use on consonant discrimination ability. METHODS Subjects were Mandarin-speaking children aged 5;4-12;6 years with profound HI (n=41), children aged 6;1-12;4 years with severe HI (n=26), and children aged 5;0-11;9 years with moderate HI (n=9). The Mandarin Consonant Discrimination Test was administered in six test conditions: -10, -5, 0, 5 and 10dB signal to noise ratios (S/Ns) and quiet. HAs were in the usual user's settings, adjusted to match the manufacturer prescribed settings and individual preferences, and the volume was set to comfortable listening level. RESULTS The results revealed that /p(h)/-/t(h)/, /ts/-/tʂ/ and /ʐ/-/l/ were the most difficult and /p/-/p(h)/, /t/-/t(h)/, /tɕ/-/tɕ(h)/ and /k/-/k(h)/ were the easiest consonant minimal pairs to discriminate in quiet both for children with profound HI and those with moderate to severe HI. In noise, no significant difference in performance was found among all consonant minimal pairs. A backward elimination stepwise multiple linear regressions revealed that unaided hearing level accounted for 25.4% of the variance in consonant discrimination performance in noise at 10dB S/N and 30.4% in quiet. However, aided hearing threshold, the age of first HA fitting and the duration of HA use did not significantly predict consonant discrimination ability both in quiet and in noise. CONCLUSIONS Consonant discrimination performance of children with profound HI was poorer than those with moderate to severe HI. The ability to discriminate consonant pairs seems to depend on age of acquisition of the consonants. Although the age of first HA fitting and the duration of HA use were not correlated with consonant discrimination outcomes, this finding does not preclude the importance of early HA fitting.
Collapse
Affiliation(s)
- Shufeng Zhu
- Department of Electrical and Electronic Engineering, South University of Science and Technology of China, Shenzhen, China; Division of Speech and Hearing Sciences, The University of Hong Kong, Hong Kong.
| | - Lena L N Wong
- Division of Speech and Hearing Sciences, The University of Hong Kong, Hong Kong
| | - Fei Chen
- Department of Electrical and Electronic Engineering, South University of Science and Technology of China, Shenzhen, China; Division of Speech and Hearing Sciences, The University of Hong Kong, Hong Kong
| | - Yuan Chen
- Division of Speech and Hearing Sciences, The University of Hong Kong, Hong Kong
| |
Collapse
|
38
|
Francart T, Osses A, Wouters J. Speech perception with F0mod, a cochlear implant pitch coding strategy. Int J Audiol 2015; 54:424-32. [DOI: 10.3109/14992027.2014.989455] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
39
|
Coelho AC, Brasolotto AG, Bevilacqua MC. An initial study of voice characteristics of children using two different sound coding strategies in comparison to normal hearing children. Int J Audiol 2015; 54:417-23. [PMID: 25634776 DOI: 10.3109/14992027.2014.998784] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
OBJECTIVE To compare some perceptual and acoustic characteristics of the voices of children who use the advanced combination encoder (ACE) or fine structure processing (FSP) speech coding strategies, and to investigate whether these characteristics differ from children with normal hearing. DESIGN Acoustic analysis of the sustained vowel /a/ was performed using the multi-dimensional voice program (MDVP). Analyses of sequential and spontaneous speech were performed using the real time pitch. Perceptual analyses of these samples were performed using visual-analogic scales of pre-selected parameters. STUDY SAMPLE Seventy-six children from three years to five years and 11 months of age participated. Twenty-eight were users of ACE, 23 were users of FSP, and 25 were children with normal hearing. RESULTS Although both groups with CI presented with some deviated vocal features, the users of ACE presented with voice quality more like children with normal hearing than the users of FSP. CONCLUSIONS Sound processing of ACE appeared to provide better conditions for auditory monitoring of the voice, and consequently, for better control of the voice production. However, these findings need to be further investigated due to the lack of comparative studies published to understand exactly which attributes of sound processing are responsible for differences in performance.
Collapse
Affiliation(s)
- Ana Cristina Coelho
- * Speech-Language Pathology and Audiology Graduate Program - University of São Paulo , Bauru, SP , Brazil
| | | | | |
Collapse
|
40
|
Tone and sentence perception in young Mandarin-speaking children with cochlear implants. Int J Pediatr Otorhinolaryngol 2014; 78:1923-30. [PMID: 25213422 DOI: 10.1016/j.ijporl.2014.08.025] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Revised: 08/16/2014] [Accepted: 08/18/2014] [Indexed: 11/24/2022]
Abstract
OBJECTIVES The purpose of this study was to examine the outcomes of cochlear implantation in young children in terms of (1) perception of lexical tones in quiet, (2) perception of sentences in quiet and in noise, (3) the effects of five demographic variables (i.e., preoperative hearing level, age at implantation, duration of cochlear implants use, maternal educational level, and whether a child underwent a hearing aid trial before implantation) on lexical tone perception and sentence perception, and (4) the relationship between lexical tone perception and sentence perception. METHODS 96 participants, aged from 2.41 years to 7.09 years, were recruited in mainland China. The children exhibited normal cognitive abilities and received unilateral implants at an average age of 2.72 years, with ages ranging from 0.69 to 5 years of age. RESULTS The mean score for tone identification was 77% (SD=13%; chance level=50%). Tone 2/tone 3 was the most difficult tone contrast to identify. Children with a longer duration of CI use and whose mothers had more years of education tended to perform better in sentence perception in quiet and in noise. Having undergone a hearing aid trial before implantation and more residual hearing were additional factors contributing to better sentence perception in noise. The only demographical variable that related to tone perception in quiet was duration of CI. In addition, while there was a modest correlation between tone perception and sentence perception in quiet (rs=0.47, p<0.001), the correlation between tone perception in quiet and sentence perception in noise was much weaker (rs=-0.28, p<0.05). CONCLUSIONS The findings suggested that most young children who had been implanted before 5 years of age and had 1-3 years of implant use did not catch up with their aged peers with normal hearing in tone perception and sentence perception. The weak to moderate correlation between tone perception in quiet and sentence perception might imply that the improvement of tone perception in quiet may not necessarily contribute to sentence perception, especially in noise condition.
Collapse
|
41
|
Erfanian Saeedi N, Blamey PJ, Burkitt AN, Grayden DB. Application of a pitch perception model to investigate the effect of stimulation field spread on the pitch ranking abilities of cochlear implant recipients. Hear Res 2014; 316:129-37. [DOI: 10.1016/j.heares.2014.08.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Revised: 08/14/2014] [Accepted: 08/19/2014] [Indexed: 10/24/2022]
|
42
|
Francart T, Lenssen A, Wouters J. Modulation enhancement in the electrical signal improves perception of interaural time differences with bimodal stimulation. J Assoc Res Otolaryngol 2014; 15:633-47. [PMID: 24890714 DOI: 10.1007/s10162-014-0457-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2013] [Accepted: 04/21/2014] [Indexed: 10/25/2022] Open
Abstract
Interaural timing cues are important for sound source localization and for binaural unmasking of speech that is spatially separated from interfering sounds. Users of a cochlear implant (CI) with residual hearing in the non-implanted ear (bimodal listeners) can only make very limited use of interaural timing cues with their clinical devices. Previous studies showed that bimodal listeners can be sensitive to interaural time differences (ITDs) for simple single- and three-channel stimuli. The modulation enhancement strategy (MEnS) was developed to improve the ITD perception of bimodal listeners. It enhances temporal modulations on all stimulated electrodes, synchronously with modulations in the acoustic signal presented to the non-implanted ear, based on measurement of the amplitude peaks occurring at the rate of the fundamental frequency in voiced phonemes. In the first experiment, ITD detection thresholds were measured using the method of constant stimuli for five bimodal listeners for an artificial vowel, processed with either the advanced combination encoder (ACE) strategy or with MEnS. With MEnS, detection thresholds were significantly lower, and for four subjects well within the physically relevant range. In the second experiment, the extent of lateralization was measured in three subjects with both strategies, and ITD sensitivity was determined using an adaptive procedure. All subjects could lateralize sounds based on ITD and sensitivity was significantly better with MEnS than with ACE. The current results indicate that ITD cues can be provided to bimodal listeners with modified sound processing.
Collapse
Affiliation(s)
- Tom Francart
- KU Leuven, Department of Neurosciences, ExpORL, O&N2, Herestraat 49 Bus 721, 3000, Leuven, Belgium,
| | | | | |
Collapse
|
43
|
Francart T, Innes-Brown H, McDermott HJ, McKay CM. Loudness of time-varying stimuli with electric stimulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:3513-3519. [PMID: 24907814 DOI: 10.1121/1.4874597] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
McKay, Henshall, Farrell, and McDermott [J. Acoust. Soc. Am. 113, 2054-2063 (2003)] developed a practical method to estimate the loudness of periodic electrical signals presented through a cochlear implant. In the present work, this method was extended to time-varying sounds based on two models of time-varying loudness for normal listeners. To fit the model parameters, loudness balancing data was collected with six cochlear implant listeners. The pulse rate of a modulated pulse train was adjusted to equalize its loudness to a reference stimulus. The stimuli were single-electrode time-limited pulse bursts, repeated at a rate of 50 Hz, with on-times varying between 2 and 20 ms. The parameters of two different models of time-varying loudness were fitted to the results. For each model, parameters defining the time windows over which the electrical pulses contribute to instantaneous loudness were optimized. In each case, a good fit was obtained with the loudness balancing results. Therefore, the practical method was successfully extended to time-varying sounds by combining it with existing models of time-varying loudness for acoustic stimulation.
Collapse
Affiliation(s)
- Tom Francart
- ExpORL, Department of Neurosciences, KU Leuven, Herestraat 49 bus 721, B-3000 Leuven, Belgium
| | - Hamish Innes-Brown
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| | - Hugh J McDermott
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| | - Colette M McKay
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| |
Collapse
|
44
|
Fielden CA, Kluk K, McKay CM. Interpulse interval discrimination within and across channels: comparison of monopolar and tripolar mode of stimulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:2913-2922. [PMID: 24815271 DOI: 10.1121/1.4869687] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Perception of temporal patterns is crucial to speech understanding and music perception in normal hearing, and is fundamental in the design and implementation of processing strategies for cochlear implants. Two experiments described here investigated the effect of stimulation mode (monopolar versus tripolar) on interpulse interval discrimination using single-electrode stimulation (experiment 1) and dual-electrode stimulation (experiment 2). Experiment 1 required participants to discriminate stimuli containing different interpulse intervals and experiment 2 required listeners to discriminate between two dual-electrode stimuli that had the same temporal pattern on each electrode, but differed in inter-electrode timing. The hypotheses were that (i) stimulation mode would affect the ability to distinguish interpulse interval patterns on a single electrode and (ii) the electrode separation range in which subjects were sensitive to inter-electrode timing would be more restricted in tripolar than in monopolar stimulation. Results in nine cochlear implant users showed that mode did not have a significant mean effect on either the ability to discriminate interpulse intervals in single-electrode stimulation or the range of electrode separation in dual-electrode stimulation in which participants were sensitive to inter-electrode timing. In conclusion, tripolar stimulation did not show any advantage in delivering temporal information within or across channels in this group.
Collapse
Affiliation(s)
- Claire A Fielden
- School of Psychological Sciences, Ellen Wilkinson Building, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| | - Karolina Kluk
- School of Psychological Sciences, Ellen Wilkinson Building, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| | - Colette M McKay
- School of Psychological Sciences, Ellen Wilkinson Building, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| |
Collapse
|
45
|
Lee T, Yu S, Yuan M, Wong TKC, Kong YY. The effect of enhancing temporal periodicity cues on Cantonese tone recognition by cochlear implantees. Int J Audiol 2014; 53:546-57. [PMID: 24694089 DOI: 10.3109/14992027.2014.893374] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
OBJECTIVES This study investigates the efficacy of a cochlear implant (CI) processing method that enhances temporal periodicity cues of speech. DESIGN Subjects participated in word and tone identification tasks. Two processing conditions - the conventional advanced combination encoder (ACE) and tone-enhanced ACE were tested. Test materials were Cantonese disyllabic words recorded from one male and one female speaker. Speech-shaped noise was added to clean speech. The fundamental frequency information for periodicity enhancement was extracted from the clean speech. Electrical stimuli generated from the noisy speech with and without periodicity enhancement were presented via direct stimulation using a Laura 34 research processor. Subjects were asked to identify the presented word. STUDY SAMPLE Seven post-lingually deafened native Cantonese-speaking CI users. RESULTS Percent correct word, segmental structure, and tone identification scores were calculated. While word and segmental structure identification accuracy remained similar between the two processing conditions, tone identification in noise was better with tone-enhanced ACE than with conventional ACE. Significant improvement on tone perception was found only for the female voice. CONCLUSIONS Temporal periodicity cues are important to tone perception in noise. Pitch and tone perception by CI users could be improved when listeners received enhanced temporal periodicity cues.
Collapse
Affiliation(s)
- Tan Lee
- * Department of Electronic Engineering, The Chinese University of Hong Kong , China
| | | | | | | | | |
Collapse
|
46
|
Chen F, Wong LLN, Hu Y. Effects of lexical tone contour on Mandarin sentence intelligibility. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:338-345. [PMID: 24023383 DOI: 10.1044/1092-4388(2013/12-0324)] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
PURPOSE This study examined the effects of lexical tone contour on the intelligibility of Mandarin sentences in quiet and in noise. METHOD A text-to-speech synthesis engine was used to synthesize Mandarin sentences with each word carrying the original lexical tone, flat tone, or a tone randomly selected from the 4 Mandarin lexical tones. The synthesized speech signals were presented to 11 normal-hearing listeners for recognition in quiet and in speech-shaped noise at 0 dB signal-to-noise ratio. RESULTS Normal-hearing listeners nearly perfectly recognized the Mandarin sentences produced with modified tone contours in quiet; however, performance declined substantially in noise. CONCLUSIONS Consistent with previous findings to some extent, the present findings suggest that lexical tones are relatively redundant cues for Mandarin sentence intelligibility in quiet and that other cues could compensate for the distorted lexical tone contour. However, in noise, the results provide direct evidence that lexical tone contour is important for the recognition of Mandarin sentences.
Collapse
|
47
|
Yamaguchi CT, Goffi-Gomez MVS. Prevalence of contralateral hearing aid use in adults with cochlear implants. Int Arch Otorhinolaryngol 2013; 17:370-4. [PMID: 25992038 PMCID: PMC4399180 DOI: 10.1055/s-0033-1352501] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Accepted: 06/06/2013] [Indexed: 11/20/2022] Open
Abstract
Introduction The exclusive use of a cochlear implant (CI) in one ear allows patients to effectively hear speech in a quiet environment. However, in environments with competing noise, the processing of multiple sounds becomes complex. In an attempt to promote binaural hearing in a noninvasive manner, the use of a hearing aid in the nonimplanted ear is suggested for patients with a unilateral CI. Aims To identify the prevalence of hearing aid use in the contralateral ear in adults who already have a CI; to determine the reasons why some patients do not use contralateral hearing aids (CHAs); and to analyze the effects of residual hearing in CHA users. Materials and Methods This is a clinical study in 82 adult patients with CI implants who responded to a questionnaire designed to determine current use of CHA. Results In our patient sample, 70 CHA nonusers were identified. The prevalence of CHA users was determined to be 12% with a 95% confidence interval of 11 to 13%. About 58.2% of the CHA nonusers reported a lack of noticeable benefit even after wearing hearing aids, and 23.6% reported not having received the option to use a CHA. CHA users had a pure tone average of 107-dB hearing level, whereas CHA nonusers had a pure tone average of 117-dB hearing level. Conclusion The prevalence of the use of a CHA is low in our study. We attribute the low use of a CHA to either a lack of residual hearing or to a lack of benefit from the amplification.
Collapse
Affiliation(s)
- Cintia Tizue Yamaguchi
- Division of Cochlear Implants, Department of Audiology, Clinic Hospital of the Medicine High School of the University of São Paulo (HCFMUSP), São Paulo/SP, Brazil
| | - Maria Valéria Schmidt Goffi-Gomez
- Division of Cochlear Implants, Department of Audiology, Clinic Hospital of the Medicine High School of the University of São Paulo (HCFMUSP), São Paulo/SP, Brazil
| |
Collapse
|
48
|
Massida Z, Marx M, Belin P, James C, Fraysse B, Barone P, Deguine O. Gender categorization in cochlear implant users. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2013; 56:1389-1401. [PMID: 24023381 DOI: 10.1044/1092-4388(2013/12-0132)] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
PURPOSE In this study, the authors examined the ability of subjects with cochlear implants (CIs) to discriminate voice gender and how this ability evolved as a function of CI experience. METHOD The authors presented a continuum of voice samples created by voice morphing, with 9 intermediate acoustic parameter steps between a typical male and a typical female. This method allowed for the evaluation of gender categorization not only when acoustical features were specific to gender but also for more ambiguous cases, when fundamental frequency or formant distribution were located between typical values. RESULTS Results showed a global, though variable, deficit for voice gender categorization in CI recipients compared with subjects with normal hearing. This deficit was stronger for ambiguous stimuli in the voice continuum: Average performance scores for CI users were 58% lower than average scores for subjects with normal hearing in cases of ambiguous stimuli and 19% lower for typical male and female voices. The authors found no significant improvement in voice gender categorization with CI experience. CONCLUSIONS These results emphasize the dissociation between recovery of speech recognition and voice feature perception after cochlear implantation. This large and durable deficit may be related to spectral and temporal degradation induced by CI sound coding, or it may be related to central voice processing deficits.
Collapse
|
49
|
McCreery D, Han M, Pikov V, Yadav K, Pannu S. Encoding of the amplitude modulation of pulsatile electrical stimulation in the feline cochlear nucleus by neurons in the inferior colliculus; effects of stimulus pulse rate. J Neural Eng 2013; 10:056010. [PMID: 23928683 DOI: 10.1088/1741-2560/10/5/056010] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
OBJECTIVES Persons without a functional auditory nerve cannot benefit from cochlear implants, but some hearing can be restored by an auditory brainstem implant (ABI) with stimulating electrodes implanted on the surface of the cochlear nucleus (CN). Most users benefit from their ABI, but speech recognition tends to be poorer than for users of cochlear implants. Psychophysical studies suggest that poor modulation detection may contribute to the limited performance of ABI users. In a cat model, we determined how the pulse rate of the electrical stimulus applied within or on the CN affects temporal and rate encoding of amplitude modulation (AM) by neurons in the central nucleus of the inferior colliculus (ICC). APPROACH Stimulating microelectrodes were implanted chronically in and on the cats' CN, and multi-site recording microelectrodes were implanted chronically into the ICC. Encoding of AM pulse trains by neurons in the ICC was characterized as vector strength (VS), the synchrony of neural activity with the AM, and as the mean rate of neuronal action potentials (neuronal spike rate (NSR)). MAIN RESULTS For intranuclear microstimulation, encoding of AM as VS was up to 3 dB greater when stimulus pulse rate was increased from 250 to 500 pps, but only for neuronal units with low best acoustic frequencies, and when the electrical stimulation was modulated at low frequencies (10-20 Hz). For stimulation on the surface of the CN, VS was similar at 250 and 500 pps, and the dynamic range of the VS was reduced for pulse rates greater than 250 pps. Modulation depth was encoded strongly as VS when the maximum stimulus amplitude was held constant across a range of modulation depth. This 'constant maximum' protocol allows enhancement of modulation depth while preserving overall dynamic range. However, modulation depth was not encoded as strongly as NSR. SIGNIFICANCE The findings have implications for improved sound processors for present and future ABIs. The performance of ABIs may benefit from using pulse rates greater than those presently used in most ABIs, and by sound processing strategies that enhance the modulation depth of the electrical stimulus while preserving dynamic range.
Collapse
|
50
|
Desmond JM, Collins LM, Throckmorton CS. Using channel-specific statistical models to detect reverberation in cochlear implant stimuli. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:1112-1120. [PMID: 23927111 PMCID: PMC3745505 DOI: 10.1121/1.4812273] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 05/29/2013] [Accepted: 06/08/2013] [Indexed: 06/02/2023]
Abstract
Reverberation is especially detrimental for cochlear implant listeners; thus, mitigating its effects has the potential to provide significant improvements to cochlear implant communication. Efforts to model and correct for reverberation in acoustic listening scenarios can be quite complex, requiring estimation of the room transfer function and localization of the source and receiver. However, due to the limited resolution associated with cochlear implant stimulation, simpler processing for reverberation detection and mitigation may be possible for cochlear implants. This study models speech stimuli in a cochlear implant on a per-channel basis both in quiet and in reverberation, and assesses the efficacy of these models for detecting the presence of reverberation. This study was able to successfully detect reverberation in cochlear implant pulse trains, and the results appear to be robust to varying room conditions and cochlear implant stimulation parameters. Reverberant signals were detected 100% of the time for a long reverberation time of 1.2 s and 86% of the time for a shorter reverberation time of 0.5 s.
Collapse
Affiliation(s)
- Jill M Desmond
- Department of Electrical and Computer Engineering, Duke University, 130 Hudson Hall, P.O. Box 90291, Durham, North Carolina 27708-0291, USA
| | | | | |
Collapse
|