Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Healy EW, Yoho SE, Wang Y, Wang D. An algorithm to improve speech recognition in noise for hearing-impaired listeners. J Acoust Soc Am 2013;134:3029-38. [PMID: 24116438 PMCID: PMC3799726 DOI: 10.1121/1.4820893] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Revised: 08/22/2013] [Accepted: 08/26/2013] [Indexed: 05/09/2023]

For:	Healy EW, Yoho SE, Wang Y, Wang D. An algorithm to improve speech recognition in noise for hearing-impaired listeners. J Acoust Soc Am 2013;134:3029-38. [PMID: 24116438 PMCID: PMC3799726 DOI: 10.1121/1.4820893] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Revised: 08/22/2013] [Accepted: 08/26/2013] [Indexed: 05/09/2023]

Number

Cited by Other Article(s)

Johnson EM, Healy EW. An ideal compressed mask for increasing speech intelligibility without sacrificing environmental sound recognitiona). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;156:3958-3969. [PMID: 39666959 PMCID: PMC11646135 DOI: 10.1121/10.0034599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 10/23/2024] [Accepted: 11/18/2024] [Indexed: 12/14/2024]

Henry F, Glavin M, Jones E, Parsi A. Impact of Mask Type as Training Target for Speech Intelligibility and Quality in Cochlear-Implant Noise Reduction. SENSORS (BASEL, SWITZERLAND) 2024;24:6614. [PMID: 39460094 PMCID: PMC11511210 DOI: 10.3390/s24206614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Revised: 10/06/2024] [Accepted: 10/10/2024] [Indexed: 10/28/2024]

Brice S, Zakis J, Almond H. Changing Knowledge, Principles, and Technology in Contemporary Clinical Audiological Practice: A Narrative Review. J Clin Med 2024;13:4538. [PMID: 39124804 PMCID: PMC11313557 DOI: 10.3390/jcm13154538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Revised: 07/23/2024] [Accepted: 07/31/2024] [Indexed: 08/12/2024] Open

Gaultier C, Goehring T. Recovering speech intelligibility with deep learning and multiple microphones in noisy-reverberant situations for people using cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;155:3833-3847. [PMID: 38884525 DOI: 10.1121/10.0026218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 05/10/2024] [Indexed: 06/18/2024]

Fan J, Williamson DS. From the perspective of perceptual speech quality: The robustness of frequency bands to noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;155:1916-1927. [PMID: 38456734 DOI: 10.1121/10.0025272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 02/22/2024] [Indexed: 03/09/2024]

Sabin AT, McElhone D, Gauger D, Rabinowitz B. Modeling the Intelligibility Benefit of Active Noise Cancelation in Hearing Devices That Improve Signal-to-Noise Ratio. Trends Hear 2024;28:23312165241260029. [PMID: 38831646 PMCID: PMC11149449 DOI: 10.1177/23312165241260029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 05/20/2024] [Accepted: 05/21/2024] [Indexed: 06/05/2024] Open

Gutz SE, Maffei MF, Green JR. Feedback From Automatic Speech Recognition to Elicit Clear Speech in Healthy Speakers. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023;32:2940-2959. [PMID: 37824377 PMCID: PMC10721250 DOI: 10.1044/2023_ajslp-23-00030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 04/10/2023] [Accepted: 08/01/2023] [Indexed: 10/14/2023]

Henry F, Parsi A, Glavin M, Jones E. Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants. SENSORS (BASEL, SWITZERLAND) 2023;23:7553. [PMID: 37688009 PMCID: PMC10490615 DOI: 10.3390/s23177553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Revised: 08/21/2023] [Accepted: 08/28/2023] [Indexed: 09/10/2023]

Borrie SA, Yoho SE, Healy EW, Barrett TS. The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:1853-1866. [PMID: 36944186 PMCID: PMC10457087 DOI: 10.1044/2023_jslhr-22-00558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Revised: 12/13/2022] [Accepted: 01/10/2023] [Indexed: 05/11/2023]

Healy EW, Johnson EM, Pandey A, Wang D. Progress made in the efficacy and viability of deep-learning-based noise reduction. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;153:2751. [PMID: 37133814 PMCID: PMC10159658 DOI: 10.1121/10.0019341] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 04/17/2023] [Accepted: 04/17/2023] [Indexed: 05/04/2023]

Venkata Lakshmi S, Sujatha K, Janet J. A hybrid discriminant fuzzy DNN with enhanced modularity bat algorithm for speech recognition. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-212945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Chou KF, Boyd AD, Best V, Colburn HS, Sen K. A biologically oriented algorithm for spatial sound segregation. Front Neurosci 2022;16:1004071. [PMID: 36312015 PMCID: PMC9614053 DOI: 10.3389/fnins.2022.1004071] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 09/28/2022] [Indexed: 11/13/2022] Open

Carter BL, Apoux F, Healy EW. The Influence of Noise Type and Semantic Predictability on Word Recall in Older Listeners and Listeners With Hearing Impairment. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:3548-3565. [PMID: 35973100 PMCID: PMC9913215 DOI: 10.1044/2022_jslhr-22-00075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/01/2022] [Accepted: 05/11/2022] [Indexed: 06/15/2023]

Abstract

PURPOSE

A dual-task paradigm was implemented to investigate how noise type and sentence context may interact with age and hearing loss to impact word recall during speech recognition.

METHOD

Three noise types with varying degrees of temporal/spectrotemporal modulation were used: speech-shaped noise, speech-modulated noise, and three-talker babble. Participant groups included younger listeners with normal hearing (NH), older listeners with near-normal hearing, and older listeners with sensorineural hearing loss. An adaptive measure was used to establish the signal-to-noise ratio approximating 70% sentence recognition for each participant in each noise type. A word-recall task was then implemented while matching speech-recognition performance across noise types and participant groups. Random-intercept linear mixed-effects models were used to determine the effects of and interactions between noise type, sentence context, and participant group on word recall.

RESULTS

The results suggest that noise type does not significantly impact word recall when word-recognition performance is controlled. When data from noise types were pooled and compared with quiet, and recall was assessed: older listeners with near-normal hearing performed well when either quiet backgrounds or high sentence context (or both) were present, but older listeners with hearing loss performed well only when both quiet backgrounds and high sentence context were present. Younger listeners with NH were robust to the detrimental effects of noise and low context.

CONCLUSIONS

The general presence of noise has the potential to decrease word recall, but type of noise does not appear to significantly impact this observation when overall task difficulty is controlled. The presence of noise as well as deficits related to age and/or hearing loss appear to limit the availability of cognitive processing resources available for working memory during conversation in difficult listening environments. The conversation environments that impact these resources appear to differ depending on age and/or hearing status.

Collapse

Speech recognition using Taylor-gradient Descent political optimization based Deep residual network. COMPUT SPEECH LANG 2022. [DOI: 10.1016/j.csl.2022.101442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Gutz SE, Rowe HP, Tilton-Bolowsky VE, Green JR. Speaking with a KN95 face mask: a within-subjects study on speaker adaptation and strategies to improve intelligibility. Cogn Res Princ Implic 2022;7:73. [PMID: 35907167 PMCID: PMC9339031 DOI: 10.1186/s41235-022-00423-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 07/18/2022] [Indexed: 11/15/2022] Open

Recent Trends in AI-Based Intelligent Sensing. ELECTRONICS 2022. [DOI: 10.3390/electronics11101661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

Creating Clarity in Noisy Environments by Using Deep Learning in Hearing Aids. Semin Hear 2021;42:260-281. [PMID: 34594089 PMCID: PMC8463126 DOI: 10.1055/s-0041-1735134] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Healy EW, Johnson EM, Delfarah M, Krishnagiri DS, Sevich VA, Taherian H, Wang D. Deep learning based speaker separation and dereverberation can generalize across different languages to improve intelligibility. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;150:2526. [PMID: 34717521 PMCID: PMC8637753 DOI: 10.1121/10.0006565] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 09/16/2021] [Accepted: 09/16/2021] [Indexed: 05/20/2023]

Tang Y. Glimpse-based estimation of speech intelligibility from speech-in-noise using artificial neural networks. COMPUT SPEECH LANG 2021. [DOI: 10.1016/j.csl.2021.101220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Geravanchizadeh M, Zakeri S. Ear-EEG-based binaural speech enhancement (ee-BSE) using auditory attention detection and audiometric characteristics of hearing-impaired subjects. J Neural Eng 2021;18. [PMID: 34289464 DOI: 10.1088/1741-2552/ac16b4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Accepted: 07/21/2021] [Indexed: 11/11/2022]

Abstract

Objective. Speech perception in cocktail party scenarios has been the concern of a group of researchers who are involved with the design of hearing-aid devices.Approach. In this paper, a new unified ear-EEG-based binaural speech enhancement system is introduced for hearing-impaired (HI) listeners. The proposed model, which is based on auditory attention detection (AAD) and individual hearing threshold (HT) characteristics, has four main processing stages. In the binaural processing stage, a system based on the deep neural network is trained to estimate auditory ratio masks for each of the speakers in the mixture signal. In the EEG processing stage, AAD is employed to select one ratio mask corresponding to the attended speech. Here, the same EEG data is also used to predict the HTs of listeners who participated in the EEG recordings. The third stage, called insertion gain computation, concerns the calculation of a special amplification gain based on individual HTs. Finally, in the selection-resynthesis-amplification stage, the attended speech signals of the target are resynthesized based on the selected auditory mask and then are amplified using the computed insertion gain.Main results. The detection of the attended speech and the HTs are achieved by classifiers that are trained with features extracted from the scalp EEG or the ear EEG signals. The results of evaluating AAD and HT detection show high detection accuracies. The systematic evaluations of the proposed system yield substantial intelligibility and quality improvements for the HI and normal-hearingaudiograms.Significance. The AAD method determines the direction of attention from single-trial EEG signals without access to audio signals of the speakers. The amplification procedure could be adjusted for each subject based on the individual HTs. The present model has the potential to be considered as an important processing tool to personalize the neuro-steered hearing aids.

Collapse

Behavioral Pattern Analysis between Bilingual and Monolingual Listeners’ Natural Speech Perception on Foreign-Accented English Language Using Different Machine Learning Approaches. TECHNOLOGIES 2021. [DOI: 10.3390/technologies9030051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Healy EW, Tan K, Johnson EM, Wang D. An effectively causal deep learning algorithm to increase intelligibility in untrained noises for hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;149:3943. [PMID: 34241481 PMCID: PMC8186949 DOI: 10.1121/10.0005089] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 05/09/2021] [Accepted: 05/10/2021] [Indexed: 05/20/2023]

Defending Against Microphone-Based Attacks with Personalized Noise. PROCEEDINGS ON PRIVACY ENHANCING TECHNOLOGIES 2021. [DOI: 10.2478/popets-2021-0021] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Improving Speech Quality for Hearing Aid Applications Based on Wiener Filter and Composite of Deep Denoising Autoencoders. SIGNALS 2020. [DOI: 10.3390/signals1020008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Wearable Hearing Device Spectral Enhancement Driven by Non-Negative Sparse Coding-Based Residual Noise Reduction. SENSORS 2020;20:s20205751. [PMID: 33050447 PMCID: PMC7600179 DOI: 10.3390/s20205751] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/02/2020] [Accepted: 10/06/2020] [Indexed: 11/17/2022]

Rajesh Kumar T, Suresh GR, Kanaga Subaraja S, Karthikeyan C. Taylor‐AMS features and deep convolutional neural network for converting nonaudible murmur to normal speech. Comput Intell 2020. [DOI: 10.1111/coin.12281] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10155026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Mourão GL, Costa MH, Paul S. Speech Intelligibility for Cochlear Implant Users with the MMSE Noise-Reduction Time-Frequency Mask. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2020.101982] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Healy EW, Johnson EM, Delfarah M, Wang D. A talker-independent deep learning algorithm to increase intelligibility for hearing-impaired listeners in reverberant competing talker conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;147:4106. [PMID: 32611178 PMCID: PMC7314568 DOI: 10.1121/10.0001441] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 05/28/2020] [Accepted: 05/29/2020] [Indexed: 05/20/2023]

Liu F, Demosthenous A, Yasin I. Auditory filter-bank compression improves estimation of signal-to-noise ratio for speech in noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;147:3197. [PMID: 32486788 DOI: 10.1121/10.0001168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 04/09/2020] [Indexed: 06/11/2023]

Khaleelur Rahiman PF, Jayanthi VS, Jayanthi AN. RETRACTED: Speech enhancement method using deep learning approach for hearing-impaired listeners. Health Informatics J 2020;27:1460458219893850. [PMID: 31969042 DOI: 10.1177/1460458219893850] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Gutiérrez-Muñoz M, González-Salazar A, Coto-Jiménez M. Evaluation of Mixed Deep Neural Networks for Reverberant Speech Enhancement. Biomimetics (Basel) 2019;5:biomimetics5010001. [PMID: 31861828 PMCID: PMC7148527 DOI: 10.3390/biomimetics5010001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 12/06/2019] [Accepted: 12/16/2019] [Indexed: 11/16/2022] Open

Abstract

Speech signals are degraded in real-life environments, as a product of background noise or other factors. The processing of such signals for voice recognition and voice analysis systems presents important challenges. One of the conditions that make adverse quality difficult to handle in those systems is reverberation, produced by sound wave reflections that travel from the source to the microphone in multiple directions. To enhance signals in such adverse conditions, several deep learning-based methods have been proposed and proven to be effective. Recently, recurrent neural networks, especially those with long short-term memory (LSTM), have presented surprising results in tasks related to time-dependent processing of signals, such as speech. One of the most challenging aspects of LSTM networks is the high computational cost of the training procedure, which has limited extended experimentation in several cases. In this work, we present a proposal to evaluate the hybrid models of neural networks to learn different reverberation conditions without any previous information. The results show that some combinations of LSTM and perceptron layers produce good results in comparison to those from pure LSTM networks, given a fixed number of layers. The evaluation was made based on quality measurements of the signal’s spectrum, the training time of the networks, and statistical validation of results. In total, 120 artificial neural networks of eight different types were trained and compared. The results help to affirm the fact that hybrid networks represent an important solution for speech signal enhancement, given that reduction in training time is on the order of 30%, in processes that can normally take several days or weeks, depending on the amount of data. The results also present advantages in efficiency, but without a significant drop in quality.

Collapse

Delfarah M, Wang D. Deep Learning for Talker-dependent Reverberant Speaker Separation: An Empirical Study. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 2019;27:1839-1848. [PMID: 33748321 PMCID: PMC7970708 DOI: 10.1109/taslp.2019.2934319] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Goehring T, Keshavarzi M, Carlyon RP, Moore BCJ. Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;146:705. [PMID: 31370586 PMCID: PMC6773603 DOI: 10.1121/1.5119226] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 07/08/2019] [Indexed: 05/20/2023]

Bhat GS, Shankar N, Reddy CKA, Panahi IMS. A Real-Time Convolutional Neural Network Based Speech Enhancement for Hearing Impaired Listeners Using Smartphone. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2019;7:78421-78433. [PMID: 32661495 PMCID: PMC7357966 DOI: 10.1109/access.2019.2922370] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Healy EW, Vasko JL, Wang D. The optimal threshold for removing noise from speech is similar across normal and impaired hearing-a time-frequency masking study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;145:EL581. [PMID: 31255108 PMCID: PMC6786891 DOI: 10.1121/1.5112828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]

Keshavarzi M, Goehring T, Turner RE, Moore BCJ. Comparison of effects on subjective intelligibility and quality of speech in babble for two algorithms: A deep recurrent neural network and spectral subtraction. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;145:1493. [PMID: 31067946 DOI: 10.1121/1.5094765] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 03/01/2019] [Indexed: 06/09/2023]

Healy EW, Delfarah M, Johnson EM, Wang D. A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;145:1378. [PMID: 31067936 PMCID: PMC6420339 DOI: 10.1121/1.5093547] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 02/06/2019] [Accepted: 02/19/2019] [Indexed: 05/20/2023]

Multi-objective learning based speech enhancement method to increase speech quality and intelligibility for hearing aid device users. Biomed Signal Process Control 2019. [DOI: 10.1016/j.bspc.2018.09.010] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Borrie SA, Barrett TS, Yoho SE. Autoscore: An open-source automated tool for scoring listener perception of speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;145:392. [PMID: 30710955 PMCID: PMC6347573 DOI: 10.1121/1.5087276] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 11/26/2018] [Accepted: 12/10/2018] [Indexed: 05/19/2023]

RETRACTED ARTICLE: Deep convolutional neural network-based speech enhancement to improve speech intelligibility and quality for hearing-impaired listeners. Med Biol Eng Comput 2018;57:757. [DOI: 10.1007/s11517-018-1933-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Wang D, Chen J. Supervised Speech Separation Based on Deep Learning: An Overview. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 2018;26:1702-1726. [PMID: 31223631 PMCID: PMC6586438 DOI: 10.1109/taslp.2018.2842159] [Citation(s) in RCA: 132] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Healy EW, Vasko JL. An ideal quantized mask to increase intelligibility and quality of speech in noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018;144:1392. [PMID: 30424638 PMCID: PMC6136922 DOI: 10.1121/1.5053115] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Revised: 08/06/2018] [Accepted: 08/20/2018] [Indexed: 05/25/2023]

Zhao Y, Wang D, Johnson EM, Healy EW. A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018;144:1627. [PMID: 30424625 PMCID: PMC6167229 DOI: 10.1121/1.5055562] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 08/27/2018] [Accepted: 09/06/2018] [Indexed: 05/20/2023]

Bramsløw L, Naithani G, Hafez A, Barker T, Pontoppidan NH, Virtanen T. Improving competing voices segregation for hearing impaired listeners using a low-latency deep neural network algorithm. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018;144:172. [PMID: 30075667 DOI: 10.1121/1.5045322] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Montazeri V, Assmann PF. Constraints on ideal binary masking for the perception of spectrally-reduced speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018;144:EL59. [PMID: 30075663 DOI: 10.1121/1.5046442] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 06/26/2018] [Indexed: 06/08/2023]

Bentsen T, May T, Kressner AA, Dau T. The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility. PLoS One 2018;13:e0196924. [PMID: 29763459 PMCID: PMC5953465 DOI: 10.1371/journal.pone.0196924] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 04/23/2018] [Indexed: 11/19/2022] Open

Keshavarzi M, Goehring T, Zakis J, Turner RE, Moore BCJ. Use of a Deep Recurrent Neural Network to Reduce Wind Noise: Effects on Judged Speech Intelligibility and Sound Quality. Trends Hear 2018;22:2331216518770964. [PMID: 29708061 PMCID: PMC5949931 DOI: 10.1177/2331216518770964] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Koning R, Bruce IC, Denys S, Wouters J. Perceptual and Model-Based Evaluation of Ideal Time-Frequency Noise Reduction in Hearing-Impaired Listeners. IEEE Trans Neural Syst Rehabil Eng 2018. [PMID: 29522412 DOI: 10.1109/tnsre.2018.2794557] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Soleymani R, Selesnick IW, Landsberger DM. SEDA: A tunable Q-factor wavelet-based noise reduction algorithm for multi-talker babble. SPEECH COMMUNICATION 2018;96:102-115. [PMID: 29606781 PMCID: PMC5875444 DOI: 10.1016/j.specom.2017.11.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]