1
|
Wohlbauer DM, Dillier N. A Hundred Ways to Encode Sound Signals for Cochlear Implants. Annu Rev Biomed Eng 2025; 27:335-369. [PMID: 40310887 DOI: 10.1146/annurev-bioeng-102623-121249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Cochlear implants are the most successful neural prostheses used to restore hearing in severe-to-profound hearing-impaired individuals. The field of cochlear implant coding investigates interdisciplinary approaches to translate acoustic signals into electrical pulses transmitted at the electrode-neuron interface, ranging from signal preprocessing algorithms, enhancement, and feature extraction methodologies to electric signal generation. In the last five decades, numerous coding strategies have been proposed clinically and experimentally. Initially developed to restore speech perception, increasing computational possibilities now allow coding of more complex signals, and new techniques to optimize the transmission of electrical signals are constantly gaining attention. This review provides insights into the history of multichannel coding and presents an extensive list of implemented strategies. The article briefly addresses each method and considers promising future directions of neural prostheses and possible signal processing, with the ultimate goal of providing a current big picture of the large field of cochlear implant coding.
Collapse
Affiliation(s)
- Dietmar M Wohlbauer
- Department of Otolaryngology, Head and Neck Surgery, Massachusetts Eye and Ear, Harvard Medical School, Boston, Massachusetts, USA;
| | - Norbert Dillier
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zürich, University of Zürich, Zürich, Switzerland
| |
Collapse
|
2
|
Luo J, Wang R, Xu K, Chao X, Zheng Y, Hu F, Liu X, Vandali AE, Wang H, Xu L. Outcomes Using the Optimized Pitch and Language Strategy Versus the Advanced Combination Encoder Strategy in Mandarin-Speaking Cochlear Implant Recipients. Ear Hear 2025; 46:210-222. [PMID: 39104002 PMCID: PMC11637569 DOI: 10.1097/aud.0000000000001572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 07/02/2024] [Indexed: 08/07/2024]
Abstract
OBJECTIVES The experimental Optimized Pitch and Language (OPAL) strategy enhances coding of fundamental frequency (F0) information in the temporal envelope of electrical signals delivered to channels of a cochlear implant (CI). Previous studies with OPAL have explored performance on speech and lexical tone perception in Mandarin- and English-speaking CI recipients. However, it was not clear which cues to lexical tone (primary and/or secondary) were used by the Mandarin CI listeners. The primary aim of the present study was to investigate whether OPAL provides improved recognition of Mandarin lexical tones in both quiet and noisy environments compared with the Advanced Combination Encoder (ACE) strategy. A secondary aim was to investigate whether, and to what extent, removal of secondary (duration and intensity envelope) cues to lexical tone affected Mandarin tone perception. DESIGN Thirty-two CI recipients with an average age of 24 (range 7 to 57) years were enrolled in the study. All recipients had at least 1 year of experience using ACE. Each subject attended two testing sessions, the first to measure baseline performance, and the second to evaluate the effect of strategy after provision of some take-home experience using OPAL. A minimum take-home duration of approximately 4 weeks was prescribed in which subjects were requested to use OPAL as much as possible but were allowed to also use ACE when needed. The evaluation tests included recognition of Mandarin lexical tones in quiet and in noise (signal to noise ratio [SNR] +5 dB) using naturally produced tones and duration/intensity envelope normalized versions of the tones; Mandarin sentence in adaptive noise; Mandarin monosyllabic and disyllabic word in quiet; a subset of Speech, Spatial, and Qualities of hearing questionnaire (SSQ, speech hearing scale); and subjective preference for strategy in quiet and noise. RESULTS For both the natural and normalized lexical tone tests, mean scores for OPAL were significantly higher than ACE in quiet by 2.7 and 2.9%-points, respectively, and in noise by 7.4 and 7.2%-points, respectively. Monosyllabic word recognition in quiet using OPAL was significantly higher than ACE by approximately 7.5% points. Average SSQ ratings for OPAL were significantly higher than ACE by approximately 0.5 points on a 10-point scale. In quiet conditions, 14 subjects preferred OPAL, 7 expressed a preference for ACE, and 9 reported no preference. Compared with quiet, in noisy situations, there was a stronger preference for OPAL (19 recipients), a similar preference for ACE (7 recipients), while fewer expressed no preference. Average daily take-home use of ACE and OPAL was 4.9 and 7.1 hr, respectively. CONCLUSIONS For Mandarin-speaking CI recipients, OPAL provided significant improvements to lexical tone perception for natural and normalized tones in quiet and noise, monosyllabic word recognition in quiet, and subjective ratings of speech intelligibility. Subjects accessed both primary and secondary cues to lexical tone for perception in quiet and noise conditions. The benefits of lexical tone recognition were attributed to enhanced F0 rate cues encoded by OPAL, especially in a noisy environment. The OPAL strategy was well accepted by many of the Mandarin-speaking CI recipients.
Collapse
Affiliation(s)
- Jianfen Luo
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-first authors
| | - Ruijie Wang
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-first authors
| | - Kaifan Xu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Xiuhua Chao
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Yi Zheng
- Cochlear Medical Device (Beijing) Co., Ltd, Beijing, China
| | - Fangxia Hu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | - Xianqi Liu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
| | | | - Haibo Wang
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-corresponding authors
| | - Lei Xu
- Department of Otolaryngology-Head and Neck Surgery, Shandong Provincial ENT Hospital, Shandong University, Jinan, Shandong, People’s Republic of China
- These authors are co-corresponding authors
| |
Collapse
|
3
|
Wohlbauer DM, Lai WK, Dillier N. InterlACE Sound Coding for Unilateral and Bilateral Cochlear Implants. IEEE Trans Biomed Eng 2024; 71:904-915. [PMID: 37796675 DOI: 10.1109/tbme.2023.3322348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/07/2023]
Abstract
OBJECTIVE Cochlear implant signal processing strategies define the rules of how acoustic signals are converted into electrical stimulation patterns. Technological and anatomical limitations, however, impose constraints on the signal transmission and the accurate excitation of the auditory nerve. Acoustic signals are degraded throughout cochlear implant processing, and electrical signal interactions at the electrode-neuron interface constrain spectral and temporal precision. In this work, we propose a novel InterlACE signal processing strategy to counteract the occurring limitations. METHODS By replacing the maxima selection of the Advanced Combination Encoder strategy with a method that defines spatially and temporally alternating channels, InterlACE can compensate for discarded signal content of the conventional processing. The strategy can be extended bilaterally by introducing synchronized timing and channel selection. InterlACE was explored unilaterally and bilaterally by assessing speech intelligibility and spectral resolution. Five experienced bilaterally implanted cochlear implant recipients participated in the Oldenburg Sentence Recognition Test in background noise and the spectral ripple discrimination task. RESULTS The introduced alternating channel selection methodology shows promising outcomes for speech intelligibility but could not indicate better spectral ripple discrimination. CONCLUSION InterlACE processing positively affects speech intelligibility, increases available unilateral and bilateral signal content, and may potentially counteract signal interactions at the electrode-neuron interface. SIGNIFICANCE This work shows how cochlear implant channel selection can be modified and extended bilaterally. The clinical impact of the modifications needs to be explored with a larger sample size.
Collapse
|
4
|
Donati E, Valle G. Neuromorphic hardware for somatosensory neuroprostheses. Nat Commun 2024; 15:556. [PMID: 38228580 PMCID: PMC10791662 DOI: 10.1038/s41467-024-44723-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 01/03/2024] [Indexed: 01/18/2024] Open
Abstract
In individuals with sensory-motor impairments, missing limb functions can be restored using neuroprosthetic devices that directly interface with the nervous system. However, restoring the natural tactile experience through electrical neural stimulation requires complex encoding strategies. Indeed, they are presently limited in effectively conveying or restoring tactile sensations by bandwidth constraints. Neuromorphic technology, which mimics the natural behavior of neurons and synapses, holds promise for replicating the encoding of natural touch, potentially informing neurostimulation design. In this perspective, we propose that incorporating neuromorphic technologies into neuroprostheses could be an effective approach for developing more natural human-machine interfaces, potentially leading to advancements in device performance, acceptability, and embeddability. We also highlight ongoing challenges and the required actions to facilitate the future integration of these advanced technologies.
Collapse
Affiliation(s)
- Elisa Donati
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, Zurich, Switzerland.
| | - Giacomo Valle
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
5
|
Khurana L, Harczos T, Moser T, Jablonski L. En route to sound coding strategies for optical cochlear implants. iScience 2023; 26:107725. [PMID: 37720089 PMCID: PMC10502376 DOI: 10.1016/j.isci.2023.107725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/19/2023] Open
Abstract
Hearing loss is the most common human sensory deficit. Severe-to-complete sensorineural hearing loss is often treated by electrical cochlear implants (eCIs) bypassing dysfunctional or lost hair cells by direct stimulation of the auditory nerve. The wide current spread from each intracochlear electrode array contact activates large sets of tonotopically organized neurons limiting spectral selectivity of sound coding. Despite many efforts, an increase in the number of independent eCI stimulation channels seems impossible to achieve. Light, which can be better confined in space than electric current may help optical cochlear implants (oCIs) to overcome eCI shortcomings. In this review, we present the current state of the optogenetic sound encoding. We highlight optical sound coding strategy development capitalizing on the optical stimulation that requires fine-grained, fast, and power-efficient real-time sound processing controlling dozens of microscale optical emitters as an emerging research area.
Collapse
Affiliation(s)
- Lakshay Khurana
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
- Auditory Neuroscience and Synaptic Nanophysiology Group, Max-Planck-Institute for Multidisciplinary Sciences, Göttingen, Germany
- Junior Research Group “Computational Neuroscience and Neuroengineering”, Göttingen, Germany
- The Doctoral Program “Sensory and Motor Neuroscience”, Göttingen Graduate Center for Neurosciences, Biophysics, and Molecular Biosciences (GGNB), Göttingen, Germany
- InnerEarLab, University Medical Center Göttingen, Göttingen, Germany
| | - Tamas Harczos
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
| | - Tobias Moser
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
- Auditory Neuroscience and Synaptic Nanophysiology Group, Max-Planck-Institute for Multidisciplinary Sciences, Göttingen, Germany
- InnerEarLab, University Medical Center Göttingen, Göttingen, Germany
- Cluster of Excellence “Multiscale Bioimaging: from Molecular Machines to Networks of Excitable Cells” (MBExC), University of Göttingen, Göttingen, Germany
| | - Lukasz Jablonski
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
- Junior Research Group “Computational Neuroscience and Neuroengineering”, Göttingen, Germany
- InnerEarLab, University Medical Center Göttingen, Göttingen, Germany
| |
Collapse
|
6
|
Dambon J, Mewes A, Beyer A, Dambon J, Ambrosch P, Hey M. Facilitation properties in electrically evoked compound action potentials depending on spatial location and on threshold. Hear Res 2023; 438:108858. [PMID: 37556897 DOI: 10.1016/j.heares.2023.108858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 07/12/2023] [Accepted: 07/28/2023] [Indexed: 08/11/2023]
Abstract
Spiral ganglion neurons (SGNs) facilitation properties can be recorded utilizing electrically evoked compound action potential (ECAP). While intracochlear variation of the ECAP threshold in relation to its electrode channel is reported, no study investigated its impact on facilitation. In this study, we quantified intracochlear variation of the facilitation properties in cochlear implants (CI) using ECAPs. We hypothesized that the facilitation effect is dependent on the electrode channel and its ECAP threshold. Therefore, ECAPs were recorded in 23 CI subjects. For each subject, five default (channel-derived) and up to two additional (threshold-derived) stimulation sites were defined. Facilitation was quantified by the paradigm introduced by (Hey et al., 2017) with optimized parameter settings. For each channel the maximum facilitated amplitude was determined by a series of ECAP measurements. A linear mixed-effects model was used to investigate the impact of the electrode channel and ECAP threshold on the maximum facilitated amplitude. The maximum facilitated amplitude was found to be dependent on the ECAP threshold and independent on the electrode channel. We conclude that the facilitation paradigm is a useful and feasible tool to gain local information on the SGNs temporal processing patterns.
Collapse
Affiliation(s)
- Jan Dambon
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Schleswig-Holstein, Christian-Albrechts-University Kiel, Germany.
| | - Alexander Mewes
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Schleswig-Holstein, Christian-Albrechts-University Kiel, Germany
| | - Annika Beyer
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Schleswig-Holstein, Christian-Albrechts-University Kiel, Germany
| | - Jakob Dambon
- Swiss Re, Zurich, Switzerland; Department of Mathematics, Swiss Federal Institute of Technology Zurich, Switzerland; School of Business, Lucerne University of Applied Sciences and Arts, Lucerne, Switzerland
| | - Petra Ambrosch
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Schleswig-Holstein, Christian-Albrechts-University Kiel, Germany
| | - Matthias Hey
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Schleswig-Holstein, Christian-Albrechts-University Kiel, Germany
| |
Collapse
|
7
|
Saba JN, Ali H, Hansen JHL. The effects of estimation accuracy, estimation approach, and number of selected channels using formant-priority channel selection for an "n-of-m" sound processing strategy for cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:3100. [PMID: 37227411 PMCID: PMC10219683 DOI: 10.1121/10.0019416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 04/16/2023] [Accepted: 04/28/2023] [Indexed: 05/26/2023]
Abstract
Previously, selection of l channels was prioritized according to formant frequency locations in an l-of-n-of-m-based signal processing strategy to provide important voicing information independent of listening environments for cochlear implant (CI) users. In this study, ideal, or ground truth, formants were incorporated into the selection stage to determine the effect of accuracy on (1) subjective speech intelligibility, (2) objective channel selection patterns, and (3) objective stimulation patterns (current). An average +11% improvement (p < 0.05) was observed across six CI users in quiet, but not for noise or reverberation conditions. Analogous increases in channel selection and current for the upper range of F1 and a decrease across mid-frequencies with higher corresponding current, were both observed at the expense of noise-dominant channels. Objective channel selection patterns were analyzed a second time to determine the effects of estimation approach and number of selected channels (n). A significant effect of estimation approach was only observed in the noise and reverberation condition with minor differences in channel selection and significantly decreased stimulated current. Results suggest that estimation method, accuracy, and number of channels in the proposed strategy using ideal formants may improve intelligibility when corresponding stimulated current of formant channels are not masked by noise-dominant channels.
Collapse
Affiliation(s)
- Juliana N Saba
- University of Texas at Dallas, Center for Robust Speech Systems, Cochlear Implant Laboratory, 800 W. Campbell Rd, EC 33, Richardson, Texas 75080, USA
| | - Hussnain Ali
- University of Texas at Dallas, Center for Robust Speech Systems, Cochlear Implant Laboratory, 800 W. Campbell Rd, EC 33, Richardson, Texas 75080, USA
| | - John H L Hansen
- University of Texas at Dallas, Center for Robust Speech Systems, Cochlear Implant Laboratory, 800 W. Campbell Rd, EC 33, Richardson, Texas 75080, USA
| |
Collapse
|
8
|
Zoneff ER, Gao DX, Nisbet DR, Grayden DB, Clark GM. Restoration of the senses and human communication: Sustainable Development Goals 3 and 9. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023; 25:9-14. [PMID: 36476000 DOI: 10.1080/17549507.2022.2142290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
PURPOSE This invited commentary addresses the importance of the senses in human communication, outlines advances achieved with cochlear implants, and new research directions to improve neural prostheses. RESULT In severely deaf people, cochlear implants restore speech understanding and enable children to achieve spoken language. Research in neural prostheses is advancing the restoration of hearing, vision, tactile senses, movement and the management of epilepsy. Bio-inspired stimulation strategies incorporating temporal and spatial characteristics of neural responses may deliver improved speech, vision and tactile perception using prostheses. To achieve stable long-term stimulation, chronic inflammation at the brain-electrode interface may be reduced using ROCK/Rho signalling pathway inhibitors and materials with brain-mimicking properties. CONCLUSION This commentary paper addresses two Sustainable Development Goals: industry, innovation and infrastructure (SDG 9) and good health and well-being (SDG 3).
Collapse
Affiliation(s)
- Elizabeth R Zoneff
- The Graeme Clark Institute, The University of Melbourne, Parkville, Australia
- Department of Biomedical Engineering, The University of Melbourne, Parkville, Australia
| | - Demi X Gao
- The Graeme Clark Institute, The University of Melbourne, Parkville, Australia
- Department of Biomedical Engineering, The University of Melbourne, Parkville, Australia
| | - David R Nisbet
- The Graeme Clark Institute, The University of Melbourne, Parkville, Australia
- Department of Biomedical Engineering, The University of Melbourne, Parkville, Australia
- Melbourne Medical School, The University of Melbourne, Parkville, Australia
| | - David B Grayden
- The Graeme Clark Institute, The University of Melbourne, Parkville, Australia
- Department of Biomedical Engineering, The University of Melbourne, Parkville, Australia
- Melbourne Medical School, The University of Melbourne, Parkville, Australia
| | - Graeme M Clark
- The Graeme Clark Institute, The University of Melbourne, Parkville, Australia
- Department of Biomedical Engineering, The University of Melbourne, Parkville, Australia
- Melbourne Medical School, The University of Melbourne, Parkville, Australia
| |
Collapse
|
9
|
Zheng Z, Li K, Feng G, Guo Y, Li Y, Xiao L, Liu C, He S, Zhang Z, Qian D, Feng Y. Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition. Front Neurosci 2021; 15:744959. [PMID: 34924928 PMCID: PMC8678109 DOI: 10.3389/fnins.2021.744959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 11/15/2021] [Indexed: 12/04/2022] Open
Abstract
Objectives: Mandarin-speaking users of cochlear implants (CI) perform poorer than their English counterpart. This may be because present CI speech coding schemes are largely based on English. This study aims to evaluate the relative contributions of temporal envelope (E) cues to Mandarin phoneme (including vowel, and consonant) and lexical tone recognition to provide information for speech coding schemes specific to Mandarin. Design: Eleven normal hearing subjects were studied using acoustic temporal E cues that were extracted from 30 continuous frequency bands between 80 and 7,562 Hz using the Hilbert transform and divided into five frequency regions. Percent-correct recognition scores were obtained with acoustic E cues presented in three, four, and five frequency regions and their relative weights calculated using the least-square approach. Results: For stimuli with three, four, and five frequency regions, percent-correct scores for vowel recognition using E cues were 50.43–84.82%, 76.27–95.24%, and 96.58%, respectively; for consonant recognition 35.49–63.77%, 67.75–78.87%, and 87.87%; for lexical tone recognition 60.80–97.15%, 73.16–96.87%, and 96.73%. For frequency region 1 to frequency region 5, the mean weights in vowel recognition were 0.17, 0.31, 0.22, 0.18, and 0.12, respectively; in consonant recognition 0.10, 0.16, 0.18, 0.23, and 0.33; in lexical tone recognition 0.38, 0.18, 0.14, 0.16, and 0.14. Conclusion: Regions that contributed most for vowel recognition was Region 2 (502–1,022 Hz) that contains first formant (F1) information; Region 5 (3,856–7,562 Hz) contributed most to consonant recognition; Region 1 (80–502 Hz) that contains fundamental frequency (F0) information contributed most to lexical tone recognition.
Collapse
Affiliation(s)
- Zhong Zheng
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.,Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
| | - Keyi Li
- Sydney Institute of Language and Commerce, Shanghai University, Shanghai, China
| | - Gang Feng
- Department of Graduate, The First Affiliated Hospital of Jinzhou Medical University, Jinzhou, China
| | - Yang Guo
- Ear, Nose, and Throat Institute and Otorhinolaryngology Department, Eye and ENT Hospital of Fudan University, Shanghai, China
| | - Yinan Li
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.,Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
| | - Lili Xiao
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.,Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
| | - Chengqi Liu
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.,Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
| | - Shouhuan He
- Department of Otolaryngology, Qingpu Branch of Zhongshan Hospital Affiliated to Fudan University, Shanghai, China
| | - Zhen Zhang
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.,Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
| | - Di Qian
- Department of Otolaryngology, Shenzhen Longhua District People's Hospital, Shenzhen, China
| | - Yanmei Feng
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.,Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
| |
Collapse
|
10
|
Brant JA, Adewole DO, Vitale F, Cullen DK. Bioengineering applications for hearing restoration: emerging biologically inspired and biointegrated designs. Curr Opin Biotechnol 2021; 72:131-138. [PMID: 34826683 DOI: 10.1016/j.copbio.2021.11.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 10/29/2021] [Accepted: 11/08/2021] [Indexed: 12/21/2022]
Abstract
Cochlear implantation has become the standard of care for hearing loss not amenable to amplification by bypassing the structures of the cochlea and stimulating the spiral ganglion neurons directly. Since the first single channel electrodes were implanted, significant advancements have been made: multi-channel arrays are now standard, they are softer to avoid damage to the cochlea and pre-curved to better position the electrode array adjacent to the nerve, and surgical and stimulation techniques have helped to conform to the anatomy and physiology of the cochlea. However, even with these advances the experience does not approach that of normal hearing. In order to make significant advances in performance, the next generation of implants will require novel interface technology. Advances in regenerative techniques, optogenetics, piezoelectric materials, and bioengineered living scaffolds hold the promise for the next generation of implantable hearing devices, and hope for the restoration of natural hearing.
Collapse
Affiliation(s)
- Jason A Brant
- Department of Otorhinolaryngology, Perelman School of Medicine, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA; Center for Brain Injury & Repair, Department of Neurosurgery, University of Pennsylvania, 240 S. 33rd St., 301 Hayden Hall, Philadelphia, PA 19104, USA
| | - Dayo O Adewole
- Department of Otorhinolaryngology, Perelman School of Medicine, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA; Center for Brain Injury & Repair, Department of Neurosurgery, University of Pennsylvania, 240 S. 33rd St., 301 Hayden Hall, Philadelphia, PA 19104, USA; Department of Bioengineering, School of Engineering and Applied, Science, University of Pennsylvania, 220 S 33rd St., Philadelphia, PA 19104, USA; Center for Brain Injury & Repair, Department of Neurosurgery, University of Pennsylvania, 3320 Smith Walk, 105 Hayden Hall, Philadelphia, PA 19104, USA; Center for Neuroengineering & Therapeutics, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA
| | - Flavia Vitale
- Center for Brain Injury & Repair, Department of Neurosurgery, University of Pennsylvania, 240 S. 33rd St., 301 Hayden Hall, Philadelphia, PA 19104, USA; Department of Bioengineering, School of Engineering and Applied, Science, University of Pennsylvania, 220 S 33rd St., Philadelphia, PA 19104, USA; Department of Neurology, Perelman School of Medicine, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA; Department of Physical Medicine & Rehabilitation, Perelman School of Medicine, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA; Center for Neuroengineering & Therapeutics, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA
| | - Daniel K Cullen
- Center for Brain Injury & Repair, Department of Neurosurgery, University of Pennsylvania, 240 S. 33rd St., 301 Hayden Hall, Philadelphia, PA 19104, USA; Department of Bioengineering, School of Engineering and Applied, Science, University of Pennsylvania, 220 S 33rd St., Philadelphia, PA 19104, USA; Center for Brain Injury & Repair, Department of Neurosurgery, University of Pennsylvania, 3320 Smith Walk, 105 Hayden Hall, Philadelphia, PA 19104, USA; Center for Neuroengineering & Therapeutics, University of Pennsylvania, 3400 Spruce St., Philadelphia, PA 19104, USA.
| |
Collapse
|
11
|
Huang EHH, Wu CM, Lin HC. Combination and Comparison of Sound Coding Strategies Using Cochlear Implant Simulation With Mandarin Speech. IEEE Trans Neural Syst Rehabil Eng 2021; 29:2407-2416. [PMID: 34767509 DOI: 10.1109/tnsre.2021.3128064] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Three cochlear implant (CI) sound coding strategies were combined in the same signal processing path and compared for speech intelligibility with vocoded Mandarin sentences. The three CI coding strategies, biologically-inspired hearing aid algorithm (BioAid), envelope enhancement (EE), and fundamental frequency modulation (F0mod), were combined with the advanced combination encoder (ACE) strategy. Hence, four singular coding strategies and four combinational coding strategies were derived. Mandarin sentences with speech-shape noise were processed using these coding strategies. Speech understanding of vocoded Mandarin sentences was evaluated using short-time objective intelligibility (STOI) and subjective sentence recognition tests with normal-hearing listeners. For signal-to-noise ratios at 5 dB or above, the EE strategy had slightly higher average scores in both STOI and listening tests compared to ACE. The addition of EE to BioAid slightly increased the mean scores for BioAid+EE, which was the combination strategy with the highest scores in both objective and subjective speech intelligibility. The benefits of BioAid, F0mod, and the four combinational coding strategies were not observed in CI simulation. The findings of this study may be useful for the future design of coding strategies and related studies with Mandarin.
Collapse
|
12
|
Phenomenological model of auditory nerve population responses to cochlear implant stimulation. J Neurosci Methods 2021; 358:109212. [PMID: 33957156 DOI: 10.1016/j.jneumeth.2021.109212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 04/26/2021] [Accepted: 04/29/2021] [Indexed: 01/19/2023]
Abstract
BACKGROUND Models of auditory nerve fiber (ANF) responses to electrical stimulation are helpful to develop advanced coding for cochlear implants (CIs). A phenomenological model of ANF population responses to CI electrical stimulation with a lower computational complexity compared to a biophysical model would be beneficial to evaluate new CI coding strategies. NEW METHOD This study presents a phenomenological model which combines four temporal characteristics of ANFs (refractoriness, facilitation, accommodation and spike rate adaptation) in addition to a spatial spread of the electric field. RESULTS The model predicts the performances of CI subjects in the melodic contour identification (MCI) experiment. The simulations for the MCI experiment were consistent with CI recipients' experimental outcomes that were not predictable from the electrical stimulation patterns themselves. COMPARISON WITH EXISTING METHODS Previously, no phenomenological population model of ANFs has combined all four aforementioned temporal phenomena. CONCLUSIONS The proposed model would help the further investigations of ANFs responses to different electrical stimulation patterns and comparison of different sound coding strategies in CIs.
Collapse
|
13
|
Kludt E, Nogueira W, Lenarz T, Buechner A. A sound coding strategy based on a temporal masking model for cochlear implants. PLoS One 2021; 16:e0244433. [PMID: 33417608 PMCID: PMC7793249 DOI: 10.1371/journal.pone.0244433] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2020] [Accepted: 12/09/2020] [Indexed: 12/16/2022] Open
Abstract
Auditory masking occurs when one sound is perceptually altered by the presence of another sound. Auditory masking in the frequency domain is known as simultaneous masking and in the time domain is known as temporal masking or non-simultaneous masking. This works presents a sound coding strategy that incorporates a temporal masking model to select the most relevant channels for stimulation in a cochlear implant (CI). A previous version of the strategy, termed psychoacoustic advanced combination encoder (PACE), only used a simultaneous masking model for the same purpose, for this reason the new strategy has been termed temporal-PACE (TPACE). We hypothesized that a sound coding strategy that focuses on stimulating the auditory nerve with pulses that are as masked as possible can improve speech intelligibility for CI users. The temporal masking model used within TPACE attenuates the simultaneous masking thresholds estimated by PACE over time. The attenuation is designed to fall exponentially with a strength determined by a single parameter, the temporal masking half-life T½. This parameter gives the time interval at which the simultaneous masking threshold is halved. The study group consisted of 24 postlingually deaf subjects with a minimum of six months experience after CI activation. A crossover design was used to compare four variants of the new temporal masking strategy TPACE (T½ ranging between 0.4 and 1.1 ms) with respect to the clinical MP3000 strategy, a commercial implementation of the PACE strategy, in two prospective, within-subject, repeated-measure experiments. The outcome measure was speech intelligibility in noise at 15 to 5 dB SNR. In two consecutive experiments, the TPACE with T½ of 0.5 ms obtained a speech performance increase of 11% and 10% with respect to the MP3000 (T½ = 0 ms), respectively. The improved speech test scores correlated with the clinical performance of the subjects: CI users with above-average outcome in their routine speech tests showed higher benefit with TPACE. It seems that the consideration of short-acting temporal masking can improve speech intelligibility in CI users. The half-live with the highest average speech perception benefit (0.5 ms) corresponds to time scales that are typical for neuronal refractory behavior.
Collapse
Affiliation(s)
- Eugen Kludt
- Department of Otolaryngology, Medical University of Hannover, Hanover, Germany
| | - Waldo Nogueira
- Department of Otolaryngology, Medical University of Hannover, Hanover, Germany
- Hearing4all, Oldenburg, Germany
| | - Thomas Lenarz
- Department of Otolaryngology, Medical University of Hannover, Hanover, Germany
- Hearing4all, Oldenburg, Germany
| | - Andreas Buechner
- Department of Otolaryngology, Medical University of Hannover, Hanover, Germany
- Hearing4all, Oldenburg, Germany
| |
Collapse
|