1
|
Wohlbauer DM, Dillier N. A Hundred Ways to Encode Sound Signals for Cochlear Implants. Annu Rev Biomed Eng 2025; 27:335-369. [PMID: 40310887 DOI: 10.1146/annurev-bioeng-102623-121249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Cochlear implants are the most successful neural prostheses used to restore hearing in severe-to-profound hearing-impaired individuals. The field of cochlear implant coding investigates interdisciplinary approaches to translate acoustic signals into electrical pulses transmitted at the electrode-neuron interface, ranging from signal preprocessing algorithms, enhancement, and feature extraction methodologies to electric signal generation. In the last five decades, numerous coding strategies have been proposed clinically and experimentally. Initially developed to restore speech perception, increasing computational possibilities now allow coding of more complex signals, and new techniques to optimize the transmission of electrical signals are constantly gaining attention. This review provides insights into the history of multichannel coding and presents an extensive list of implemented strategies. The article briefly addresses each method and considers promising future directions of neural prostheses and possible signal processing, with the ultimate goal of providing a current big picture of the large field of cochlear implant coding.
Collapse
Affiliation(s)
- Dietmar M Wohlbauer
- Department of Otolaryngology, Head and Neck Surgery, Massachusetts Eye and Ear, Harvard Medical School, Boston, Massachusetts, USA;
| | - Norbert Dillier
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zürich, University of Zürich, Zürich, Switzerland
| |
Collapse
|
2
|
Khurana L, Harczos T, Moser T, Jablonski L. En route to sound coding strategies for optical cochlear implants. iScience 2023; 26:107725. [PMID: 37720089 PMCID: PMC10502376 DOI: 10.1016/j.isci.2023.107725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/19/2023] Open
Abstract
Hearing loss is the most common human sensory deficit. Severe-to-complete sensorineural hearing loss is often treated by electrical cochlear implants (eCIs) bypassing dysfunctional or lost hair cells by direct stimulation of the auditory nerve. The wide current spread from each intracochlear electrode array contact activates large sets of tonotopically organized neurons limiting spectral selectivity of sound coding. Despite many efforts, an increase in the number of independent eCI stimulation channels seems impossible to achieve. Light, which can be better confined in space than electric current may help optical cochlear implants (oCIs) to overcome eCI shortcomings. In this review, we present the current state of the optogenetic sound encoding. We highlight optical sound coding strategy development capitalizing on the optical stimulation that requires fine-grained, fast, and power-efficient real-time sound processing controlling dozens of microscale optical emitters as an emerging research area.
Collapse
Affiliation(s)
- Lakshay Khurana
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
- Auditory Neuroscience and Synaptic Nanophysiology Group, Max-Planck-Institute for Multidisciplinary Sciences, Göttingen, Germany
- Junior Research Group “Computational Neuroscience and Neuroengineering”, Göttingen, Germany
- The Doctoral Program “Sensory and Motor Neuroscience”, Göttingen Graduate Center for Neurosciences, Biophysics, and Molecular Biosciences (GGNB), Göttingen, Germany
- InnerEarLab, University Medical Center Göttingen, Göttingen, Germany
| | - Tamas Harczos
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
| | - Tobias Moser
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
- Auditory Neuroscience and Synaptic Nanophysiology Group, Max-Planck-Institute for Multidisciplinary Sciences, Göttingen, Germany
- InnerEarLab, University Medical Center Göttingen, Göttingen, Germany
- Cluster of Excellence “Multiscale Bioimaging: from Molecular Machines to Networks of Excitable Cells” (MBExC), University of Göttingen, Göttingen, Germany
| | - Lukasz Jablonski
- Institute for Auditory Neuroscience, University Medical Center Göttingen, Göttingen, Germany
- Auditory Neuroscience and Optogenetics Laboratory, German Primate Center, Göttingen, Germany
- Junior Research Group “Computational Neuroscience and Neuroengineering”, Göttingen, Germany
- InnerEarLab, University Medical Center Göttingen, Göttingen, Germany
| |
Collapse
|
3
|
Saba JN, Ali H, Hansen JHL. The effects of estimation accuracy, estimation approach, and number of selected channels using formant-priority channel selection for an "n-of-m" sound processing strategy for cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:3100. [PMID: 37227411 PMCID: PMC10219683 DOI: 10.1121/10.0019416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 04/16/2023] [Accepted: 04/28/2023] [Indexed: 05/26/2023]
Abstract
Previously, selection of l channels was prioritized according to formant frequency locations in an l-of-n-of-m-based signal processing strategy to provide important voicing information independent of listening environments for cochlear implant (CI) users. In this study, ideal, or ground truth, formants were incorporated into the selection stage to determine the effect of accuracy on (1) subjective speech intelligibility, (2) objective channel selection patterns, and (3) objective stimulation patterns (current). An average +11% improvement (p < 0.05) was observed across six CI users in quiet, but not for noise or reverberation conditions. Analogous increases in channel selection and current for the upper range of F1 and a decrease across mid-frequencies with higher corresponding current, were both observed at the expense of noise-dominant channels. Objective channel selection patterns were analyzed a second time to determine the effects of estimation approach and number of selected channels (n). A significant effect of estimation approach was only observed in the noise and reverberation condition with minor differences in channel selection and significantly decreased stimulated current. Results suggest that estimation method, accuracy, and number of channels in the proposed strategy using ideal formants may improve intelligibility when corresponding stimulated current of formant channels are not masked by noise-dominant channels.
Collapse
Affiliation(s)
- Juliana N Saba
- University of Texas at Dallas, Center for Robust Speech Systems, Cochlear Implant Laboratory, 800 W. Campbell Rd, EC 33, Richardson, Texas 75080, USA
| | - Hussnain Ali
- University of Texas at Dallas, Center for Robust Speech Systems, Cochlear Implant Laboratory, 800 W. Campbell Rd, EC 33, Richardson, Texas 75080, USA
| | - John H L Hansen
- University of Texas at Dallas, Center for Robust Speech Systems, Cochlear Implant Laboratory, 800 W. Campbell Rd, EC 33, Richardson, Texas 75080, USA
| |
Collapse
|
4
|
Tahmasebi S, Segovia-Martinez M, Nogueira W. Optimization of Sound Coding Strategies to Make Singing Music More Accessible for Cochlear Implant Users. Trends Hear 2023; 27:23312165221148022. [PMID: 36628453 PMCID: PMC9837293 DOI: 10.1177/23312165221148022] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Cochlear implants (CIs) are implantable medical devices that can partially restore hearing to people suffering from profound sensorineural hearing loss. While these devices provide good speech understanding in quiet, many CI users face difficulties when listening to music. Reasons include poor spatial specificity of electric stimulation, limited transmission of spectral and temporal fine structure of acoustic signals, and restrictions in the dynamic range that can be conveyed via electric stimulation of the auditory nerve. The coding strategies currently used in CIs are typically designed for speech rather than music. This work investigates the optimization of CI coding strategies to make singing music more accessible to CI users. The aim is to reduce the spectral complexity of music by selecting fewer bands for stimulation, attenuating the background instruments by strengthening a noise reduction algorithm, and optimizing the electric dynamic range through a back-end compressor. The optimizations were evaluated through both objective and perceptual measures of speech understanding and melody identification of singing voice with and without background instruments, as well as music appreciation questionnaires. Consistent with the objective measures, results gathered from the perceptual evaluations indicated that reducing the number of selected bands and optimizing the electric dynamic range significantly improved speech understanding in music. Moreover, results obtained from questionnaires show that the new music back-end compressor significantly improved music enjoyment. These results have potential as a new CI program for improved singing music perception.
Collapse
Affiliation(s)
- Sina Tahmasebi
- Department of Otolaryngology, Hannover Medical School, Hannover, Germany
- Cluster of Excellence Hearing4all, Hannover, Germany
- Sina Tahmasebi, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
Waldo Nogueira, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
| | | | - Waldo Nogueira
- Department of Otolaryngology, Hannover Medical School, Hannover, Germany
- Cluster of Excellence Hearing4all, Hannover, Germany
- Sina Tahmasebi, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
Waldo Nogueira, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
| |
Collapse
|
5
|
Monaghan JJM, Carlyon RP, Deeks JM. Modulation Depth Discrimination by Cochlear Implant Users. J Assoc Res Otolaryngol 2022; 23:285-299. [PMID: 35080684 PMCID: PMC8964891 DOI: 10.1007/s10162-022-00834-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 12/30/2021] [Indexed: 11/29/2022] Open
Abstract
Cochlear implants (CIs) convey the amplitude envelope of speech by modulating high-rate pulse trains. However, not all of the envelope may be necessary to perceive amplitude modulations (AMs); the effective envelope depth may be limited by forward and backward masking from the envelope peaks. Three experiments used modulated pulse trains to measure which portions of the envelope can be effectively processed by CI users as a function of AM frequency. Experiment 1 used a three-interval forced-choice task to test the ability of CI users to discriminate less-modulated pulse trains from a fully modulated standard, without controlling for loudness. The stimuli in experiment 2 were identical, but a two-interval task was used in which participants were required to choose the less-modulated interval, ignoring loudness. Catch trials, in which judgements based on level or modulation depth would give opposing answers, were included. Experiment 3 employed novel stimuli whose modulation envelope could be modified below a variable point in the dynamic range, without changing the loudness of the stimulus. Overall, results showed that substantial portions of the envelope are not accurately encoded by CI users. In experiment 1, where loudness cues were available, participants on average were insensitive to changes in the bottom 30% of their dynamic range. In experiment 2, where loudness was controlled, participants appeared insensitive to changes in the bottom 50% of the dynamic range. In experiment 3, participants were insensitive to changes in the bottom 80% of the dynamic range. We discuss potential reasons for this insensitivity and implications for CI speech-processing strategies.
Collapse
Affiliation(s)
- Jessica J M Monaghan
- Macquarie University, The Australian Hearing Hub, NSW, 2109, Sydney, Australia.
- National Acoustic Laboratories, The Australian Hearing Hub, Sydney, NSW, 2109, Australia.
| | - Robert P Carlyon
- Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| | - John M Deeks
- Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| |
Collapse
|
6
|
Carlyon RP, Goehring T. Cochlear Implant Research and Development in the Twenty-first Century: A Critical Update. J Assoc Res Otolaryngol 2021; 22:481-508. [PMID: 34432222 PMCID: PMC8476711 DOI: 10.1007/s10162-021-00811-5] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 08/02/2021] [Indexed: 12/22/2022] Open
Abstract
Cochlear implants (CIs) are the world's most successful sensory prosthesis and have been the subject of intense research and development in recent decades. We critically review the progress in CI research, and its success in improving patient outcomes, from the turn of the century to the present day. The review focuses on the processing, stimulation, and audiological methods that have been used to try to improve speech perception by human CI listeners, and on fundamental new insights in the response of the auditory system to electrical stimulation. The introduction of directional microphones and of new noise reduction and pre-processing algorithms has produced robust and sometimes substantial improvements. Novel speech-processing algorithms, the use of current-focusing methods, and individualised (patient-by-patient) deactivation of subsets of electrodes have produced more modest improvements. We argue that incremental advances have and will continue to be made, that collectively these may substantially improve patient outcomes, but that the modest size of each individual advance will require greater attention to experimental design and power. We also briefly discuss the potential and limitations of promising technologies that are currently being developed in animal models, and suggest strategies for researchers to collectively maximise the potential of CIs to improve hearing in a wide range of listening situations.
Collapse
Affiliation(s)
- Robert P Carlyon
- Cambridge Hearing Group, MRC Cognition & Brain Sciences Unit, University of Cambridge, Cambridge, CB2 7EF, UK.
| | - Tobias Goehring
- Cambridge Hearing Group, MRC Cognition & Brain Sciences Unit, University of Cambridge, Cambridge, CB2 7EF, UK
| |
Collapse
|