Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

141
(from Reference Citation Analysis)

Article PDFs (23)

Cited by > 0 (106)

Searched Name

Mounya Elhilali

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

McMullin MA, Kumar R, Higgins NC, Gygi B, Elhilali M, Snyder JS. Preliminary Evidence for Global Properties in Human Listeners During Natural Auditory Scene Perception. Open Mind (Camb) 2024;8:333-365. [PMID: 38571530 PMCID: PMC10990578 DOI: 10.1162/opmi_a_00131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 02/10/2024] [Indexed: 04/05/2024] Open

Abstract

Theories of auditory and visual scene analysis suggest the perception of scenes relies on the identification and segregation of objects within it, resembling a detail-oriented processing style. However, a more global process may occur while analyzing scenes, which has been evidenced in the visual domain. It is our understanding that a similar line of research has not been explored in the auditory domain; therefore, we evaluated the contributions of high-level global and low-level acoustic information to auditory scene perception. An additional aim was to increase the field's ecological validity by using and making available a new collection of high-quality auditory scenes. Participants rated scenes on 8 global properties (e.g., open vs. enclosed) and an acoustic analysis evaluated which low-level features predicted the ratings. We submitted the acoustic measures and average ratings of the global properties to separate exploratory factor analyses (EFAs). The EFA of the acoustic measures revealed a seven-factor structure explaining 57% of the variance in the data, while the EFA of the global property measures revealed a two-factor structure explaining 64% of the variance in the data. Regression analyses revealed each global property was predicted by at least one acoustic variable (R2 = 0.33-0.87). These findings were extended using deep neural network models where we examined correlations between human ratings of global properties and deep embeddings of two computational models: an object-based model and a scene-based model. The results support that participants' ratings are more strongly explained by a global analysis of the scene setting, though the relationship between scene perception and auditory perception is multifaceted, with differing correlation patterns evident between the two models. Taken together, our results provide evidence for the ability to perceive auditory scenes from a global perspective. Some of the acoustic measures predicted ratings of global scene perception, suggesting representations of auditory objects may be transformed through many stages of processing in the ventral auditory stream, similar to what has been proposed in the ventral visual stream. These findings and the open availability of our scene collection will make future studies on perception, attention, and memory for natural auditory scenes possible.

Collapse

Englitz B, Akram S, Elhilali M, Shamma S. Decoding contextual influences on auditory perception from primary auditory cortex. bioRxiv 2023:2023.12.24.573229. [PMID: 38187523 PMCID: PMC10769425 DOI: 10.1101/2023.12.24.573229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]

Kothinti SR, Elhilali M. Are acoustics enough? Semantic effects on auditory salience in natural scenes. Front Psychol 2023;14:1276237. [PMID: 38098516 PMCID: PMC10720592 DOI: 10.3389/fpsyg.2023.1276237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/10/2023] [Indexed: 12/17/2023] Open

Abstract

Auditory salience is a fundamental property of a sound that allows it to grab a listener's attention regardless of their attentional state or behavioral goals. While previous research has shed light on acoustic factors influencing auditory salience, the semantic dimensions of this phenomenon have remained relatively unexplored owing both to the complexity of measuring salience in audition as well as limited focus on complex natural scenes. In this study, we examine the relationship between acoustic, contextual, and semantic attributes and their impact on the auditory salience of natural audio scenes using a dichotic listening paradigm. The experiments present acoustic scenes in forward and backward directions; the latter allows to diminish semantic effects, providing a counterpoint to the effects observed in forward scenes. The behavioral data collected from a crowd-sourced platform reveal a striking convergence in temporal salience maps for certain sound events, while marked disparities emerge in others. Our main hypothesis posits that differences in the perceptual salience of events are predominantly driven by semantic and contextual cues, particularly evident in those cases displaying substantial disparities between forward and backward presentations. Conversely, events exhibiting a high degree of alignment can largely be attributed to low-level acoustic attributes. To evaluate this hypothesis, we employ analytical techniques that combine rich low-level mappings from acoustic profiles with high-level embeddings extracted from a deep neural network. This integrated approach captures both acoustic and semantic attributes of acoustic scenes along with their temporal trajectories. The results demonstrate that perceptual salience is a careful interplay between low-level and high-level attributes that shapes which moments stand out in a natural soundscape. Furthermore, our findings underscore the important role of longer-term context as a critical component of auditory salience, enabling us to discern and adapt to temporal regularities within an acoustic scene. The experimental and model-based validation of semantic factors of salience paves the way for a complete understanding of auditory salience. Ultimately, the empirical and computational analyses have implications for developing large-scale models for auditory salience and audio analytics.

Collapse

Rennoll V, McLane I, Eisape A, Grant D, Hahn H, Elhilali M, West JE. Electrostatic Acoustic Sensor with an Impedance-Matched Diaphragm Characterized for Body Sound Monitoring. ACS Appl Bio Mater 2023;6:3241-3256. [PMID: 37470762 PMCID: PMC10804910 DOI: 10.1021/acsabm.3c00359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]

Kala A, McCollum ED, Elhilali M. Reference free auscultation quality metric and its trends. Biomed Signal Process Control 2023;85:104852. [PMID: 38274002 PMCID: PMC10809975 DOI: 10.1016/j.bspc.2023.104852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]

Kala A, Elhilali M. Constrained Synthetic Sampling for Augmentation of Crackle Lung Sounds. Annu Int Conf IEEE Eng Med Biol Soc 2023;2023:1-5. [PMID: 38083624 PMCID: PMC10823588 DOI: 10.1109/embc40787.2023.10340579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]

Bellur A, Thakkar K, Elhilali M. Explicit-memory multiresolution adaptive framework for speech and music separation. EURASIP J Audio Speech Music Process 2023;2023:20. [PMID: 37181589 PMCID: PMC10169896 DOI: 10.1186/s13636-023-00286-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 04/21/2023] [Indexed: 05/16/2023]

Higgins NC, Scurry AN, Jiang F, Little DF, Alain C, Elhilali M, Snyder JS. Adaptation in the sensory cortex drives bistable switching during auditory stream segregation. Neurosci Conscious 2023;2023:niac019. [PMID: 36751309 PMCID: PMC9899071 DOI: 10.1093/nc/niac019] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 10/17/2022] [Accepted: 12/26/2022] [Indexed: 02/06/2023] Open

Rennoll V, McLane I, Elhilali M, West JE. Optimized Acoustic Phantom Design for Characterizing Body Sound Sensors. Sensors (Basel) 2022;22:9086. [PMID: 36501787 PMCID: PMC9735779 DOI: 10.3390/s22239086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 11/19/2022] [Accepted: 11/19/2022] [Indexed: 06/17/2023]

Kala A, McCollum ED, Elhilali M. Implications of clinical variability on computer-aided lung auscultation classification. Annu Int Conf IEEE Eng Med Biol Soc 2022;2022:4421-4425. [PMID: 36086501 DOI: 10.1109/embc48229.2022.9871393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Park S, Han DK, Elhilali M. Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures. IEEE Trans Multimedia 2022;25:4573-4585. [PMID: 37928617 PMCID: PMC10621403 DOI: 10.1109/tmm.2022.3178591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2023]

Park DE, Watson NL, Focht C, Feikin D, Hammitt LL, Brooks WA, Howie SRC, Kotloff KL, Levine OS, Madhi SA, Murdoch DR, O'Brien KL, Scott JAG, Thea DM, Amorninthapichet T, Awori J, Bunthi C, Ebruke B, Elhilali M, Higdon M, Hossain L, Jahan Y, Moore DP, Mulindwa J, Mwananyanda L, Naorat S, Prosperi C, Thamthitiwat S, Verwey C, Jablonski KA, Power MC, Young HA, Deloria Knoll M, McCollum ED. Digitally recorded and remotely classified lung auscultation compared with conventional stethoscope classifications among children aged 1-59 months enrolled in the Pneumonia Etiology Research for Child Health (PERCH) case-control study. BMJ Open Respir Res 2022;9:9/1/e001144. [PMID: 35577452 PMCID: PMC9115042 DOI: 10.1136/bmjresp-2021-001144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Accepted: 04/28/2022] [Indexed: 01/19/2023] Open

Abstract

BACKGROUND

Diagnosis of pneumonia remains challenging. Digitally recorded and remote human classified lung sounds may offer benefits beyond conventional auscultation, but it is unclear whether classifications differ between the two approaches. We evaluated concordance between digital and conventional auscultation.

METHODS

We collected digitally recorded lung sounds, conventional auscultation classifications and clinical measures and samples from children with pneumonia (cases) in low-income and middle-income countries. Physicians remotely classified recordings as crackles, wheeze or uninterpretable. Conventional and digital auscultation concordance was evaluated among 383 pneumonia cases with concurrently (within 2 hours) collected conventional and digital auscultation classifications using prevalence-adjusted bias-adjusted kappa (PABAK). Using an expanded set of 737 cases that also incorporated the non-concurrently collected assessments, we evaluated whether associations between auscultation classifications and clinical or aetiological findings differed between conventional or digital auscultation using χ² tests and logistic regression adjusted for age, sex and site.

RESULTS

Conventional and digital auscultation concordance was moderate for classifying crackles and/or wheeze versus neither crackles nor wheeze (PABAK=0.50), and fair for crackles-only versus not crackles-only (PABAK=0.30) and any wheeze versus no wheeze (PABAK=0.27). Crackles were more common on conventional auscultation, whereas wheeze was more frequent on digital auscultation. Compared with neither crackles nor wheeze, crackles-only on both conventional and digital auscultation was associated with abnormal chest radiographs (adjusted OR (aOR)=1.53, 95% CI 0.99 to 2.36; aOR=2.09, 95% CI 1.19 to 3.68, respectively); any wheeze was inversely associated with C-reactive protein >40 mg/L using conventional auscultation (aOR=0.50, 95% CI 0.27 to 0.92) and with very severe pneumonia using digital auscultation (aOR=0.67, 95% CI 0.46 to 0.97). Crackles-only on digital auscultation was associated with mortality compared with any wheeze (aOR=2.70, 95% CI 1.12 to 6.25).

CONCLUSIONS

Conventional auscultation and remotely-classified digital auscultation displayed moderate concordance for presence/absence of wheeze and crackles among cases. Conventional and digital auscultation may provide different classification patterns, but wheeze was associated with decreased clinical severity on both.

Collapse

Affiliation(s)

Daniel E Park Department of Environmental and Occupational Health, The George Washington University, Washington, District of Columbia, USA
Nora L Watson The Emmes Corporation, Rockville, Maryland, USA
Christopher Focht The Emmes Corporation, Rockville, Maryland, USA
Daniel Feikin Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA
Laura L Hammitt Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA,Kenya Medical Research Institute - Wellcome Trust Research Programme, Kilifi, Kenya
W Abdullah Brooks International Centre for Diarrhoeal Disease Research Bangladesh, Dhaka and Matlab, Bangladesh,Johns Hopkins University Bloomberg School of Public Health, Baltimore, Maryland, USA
Stephen R C Howie Medical Research Council Unit, Basse, Gambia,Department of Paediatrics, The University of Auckland, Auckland, New Zealand
Karen L Kotloff Department of Pediatrics, University of Maryland Center for Vaccine Development, Baltimore, Maryland, USA
Orin S Levine Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA,Bill & Melinda Gates Foundation, Seattle, Washington, USA
Shabir A Madhi South African Medical Research Council Vaccines and Infectious Diseases Analytics Research Unit, University of the Witwatersrand, Johannesburg, Gauteng, South Africa,Department of Science and Innovation/National Research Foundation: Vaccine Preventable Diseases Unit, University of the Witwatersrand, Johannesburg, Gauteng, South Africa
David R Murdoch Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand,Microbiology Unit, Canterbury Health Laboratories, Christchurch, New Zealand
Katherine L O'Brien Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA
J Anthony G Scott Kenya Medical Research Institute - Wellcome Trust Research Programme, Kilifi, Kenya,Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
Donald M Thea Department of Global Health, Boston University School of Public Health, Boston, Massachusetts, USA
Tussanee Amorninthapichet Sakaeo Crown Prince Hospital, Royal Thai Government Ministry of Public Health, Sakaeo, Thailand
Juliet Awori Kenya Medical Research Institute - Wellcome Trust Research Programme, Kilifi, Kenya
Charatdao Bunthi Division of Global Health Protection, Thailand Ministry of Public Health – US CDC Collaboration, Royal Thai Government Ministry of Public Health, Bangkok, Thailand
Bernard Ebruke Medical Research Council Unit, Basse, Gambia,International Foundation Against Infectious Disease in Nigeria, Abuja, Nigeria
Mounya Elhilali Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, Maryland, USA
Melissa Higdon Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA
Lokman Hossain International Centre for Diarrhoeal Disease Research Bangladesh, Dhaka and Matlab, Bangladesh
Yasmin Jahan International Centre for Diarrhoeal Disease Research Bangladesh, Dhaka and Matlab, Bangladesh
David P Moore South African Medical Research Council Vaccines and Infectious Diseases Analytics Research Unit, University of the Witwatersrand, Johannesburg, South Africa,Department of Paediatrics and Child Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
Justin Mulindwa Department of Paediatrics and Child Health, University Teaching Hospital, Lusaka, Zambia
Lawrence Mwananyanda Department of Global Health, Boston University School of Public Health, Boston, Massachusetts, USA,Right to Care - Zambia, Lusaka, Zambia
Sathapana Naorat RTI International, Research Triangle Park, North Carolina, USA
Christine Prosperi Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA
Somsak Thamthitiwat Division of Global Health Protection, Thailand Ministry of Public Health – US CDC Collaboration, Royal Thai Government Ministry of Public Health, Nonthaburi, Thailand
Charl Verwey South African Medical Research Council Vaccines and Infectious Diseases Analytics Research Unit, University of the Witwatersrand, Johannesburg, Gauteng, South Africa,Department of Paediatrics and Child Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
Kathleen A Jablonski The George Washington University Biostatistics Center, Rockville, Maryland, USA
Melinda C Power Department of Epidemiology, The George Washington University, Washington, District of Columbia, USA
Heather A Young Department of Epidemiology, The George Washington University, Washington, District of Columbia, USA
Maria Deloria Knoll Department of International Health, Johns Hopkins University International Vaccine Access Center, Baltimore, Maryland, USA
Eric D McCollum Global Program in Respiratory Sciences, Eudowood Division of Pediatric Respiratory Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA,Department of International Health, Johns Hopkins University Bloomberg School of Public Health, Baltimore, Maryland, USA

Collapse

Allen KM, Salles A, Park S, Elhilali M, Moss CF. Effect of background clutter on neural discrimination in the bat auditory midbrain. J Neurophysiol 2021;126:1772-1782. [PMID: 34669503 PMCID: PMC8794058 DOI: 10.1152/jn.00109.2021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 09/22/2021] [Accepted: 10/12/2021] [Indexed: 11/22/2022] Open

Abstract

The discrimination of complex sounds is a fundamental function of the auditory system. This operation must be robust in the presence of noise and acoustic clutter. Echolocating bats are auditory specialists that discriminate sonar objects in acoustically complex environments. Bats produce brief signals, interrupted by periods of silence, rendering echo snapshots of sonar objects. Sonar object discrimination requires that bats process spatially and temporally overlapping echoes to make split-second decisions. The mechanisms that enable this discrimination are not well understood, particularly in complex environments. We explored the neural underpinnings of sonar object discrimination in the presence of acoustic scattering caused by physical clutter. We performed electrophysiological recordings in the inferior colliculus of awake big brown bats, to broadcasts of prerecorded echoes from physical objects. We acquired single unit responses to echoes and discovered a subpopulation of IC neurons that encode acoustic features that can be used to discriminate between sonar objects. We further investigated the effects of environmental clutter on this population's encoding of acoustic features. We discovered that the effect of background clutter on sonar object discrimination is highly variable and depends on object properties and target-clutter spatiotemporal separation. In many conditions, clutter impaired discrimination of sonar objects. However, in some instances clutter enhanced acoustic features of echo returns, enabling higher levels of discrimination. This finding suggests that environmental clutter may augment acoustic cues used for sonar target discrimination and provides further evidence in a growing body of literature that noise is not universally detrimental to sensory encoding.NEW & NOTEWORTHY Bats are powerful animal models for investigating the encoding of auditory objects under acoustically challenging conditions. Although past work has considered the effect of acoustic clutter on sonar target detection, less is known about target discrimination in clutter. Our work shows that the neural encoding of auditory objects was affected by clutter in a distance-dependent manner. These findings advance the knowledge on auditory object detection and discrimination and noise-dependent stimulus enhancement.

Collapse

Higgins NC, Monjaras AG, Yerkes BD, Little DF, Nave-Blodgett JE, Elhilali M, Snyder JS. Resetting of Auditory and Visual Segregation Occurs After Transient Stimuli of the Same Modality. Front Psychol 2021;12:720131. [PMID: 34621219 PMCID: PMC8490814 DOI: 10.3389/fpsyg.2021.720131] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 08/16/2021] [Indexed: 12/03/2022] Open

Abstract

In the presence of a continually changing sensory environment, maintaining stable but flexible awareness is paramount, and requires continual organization of information. Determining which stimulus features belong together, and which are separate is therefore one of the primary tasks of the sensory systems. Unknown is whether there is a global or sensory-specific mechanism that regulates the final perceptual outcome of this streaming process. To test the extent of modality independence in perceptual control, an auditory streaming experiment, and a visual moving-plaid experiment were performed. Both were designed to evoke alternating perception of an integrated or segregated percept. In both experiments, transient auditory and visual distractor stimuli were presented in separate blocks, such that the distractors did not overlap in frequency or space with the streaming or plaid stimuli, respectively, thus preventing peripheral interference. When a distractor was presented in the opposite modality as the bistable stimulus (visual distractors during auditory streaming or auditory distractors during visual streaming), the probability of percept switching was not significantly different than when no distractor was presented. Conversely, significant differences in switch probability were observed following within-modality distractors, but only when the pre-distractor percept was segregated. Due to the modality-specificity of the distractor-induced resetting, the results suggest that conscious perception is at least partially controlled by modality-specific processing. The fact that the distractors did not have peripheral overlap with the bistable stimuli indicates that the perceptual reset is due to interference at a locus in which stimuli of different frequencies and spatial locations are integrated.

Collapse

Kothinti SR, Huang N, Elhilali M. Auditory salience using natural scenes: An online study. J Acoust Soc Am 2021;150:2952. [PMID: 34717500 PMCID: PMC8528551 DOI: 10.1121/10.0006750] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Skerritt-Davis B, Elhilali M. Neural Encoding of Auditory Statistics. J Neurosci 2021;41:6726-6739. [PMID: 34193552 PMCID: PMC8336711 DOI: 10.1523/jneurosci.1887-20.2021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 05/19/2021] [Accepted: 05/26/2021] [Indexed: 11/21/2022] Open

Abstract

The human brain extracts statistical regularities embedded in real-world scenes to sift through the complexity stemming from changing dynamics and entwined uncertainty along multiple perceptual dimensions (e.g., pitch, timbre, location). While there is evidence that sensory dynamics along different auditory dimensions are tracked independently by separate cortical networks, how these statistics are integrated to give rise to unified objects remains unknown, particularly in dynamic scenes that lack conspicuous coupling between features. Using tone sequences with stochastic regularities along spectral and spatial dimensions, this study examines behavioral and electrophysiological responses from human listeners (male and female) to changing statistics in auditory sequences and uses a computational model of predictive Bayesian inference to formulate multiple hypotheses for statistical integration across features. Neural responses reveal multiplexed brain responses reflecting both local statistics along individual features in frontocentral networks, together with global (object-level) processing in centroparietal networks. Independent tracking of local surprisal along each acoustic feature reveals linear modulation of neural responses, while global melody-level statistics follow a nonlinear integration of statistical beliefs across features to guide perception. Near identical results are obtained in separate experiments along spectral and spatial acoustic dimensions, suggesting a common mechanism for statistical inference in the brain. Potential variations in statistical integration strategies and memory deployment shed light on individual variability between listeners in terms of behavioral efficacy and fidelity of neural encoding of stochastic change in acoustic sequences.SIGNIFICANCE STATEMENT The world around us is complex and ever changing: in everyday listening, sound sources evolve along multiple dimensions, such as pitch, timbre, and spatial location, and they exhibit emergent statistical properties that change over time. In the face of this complexity, the brain builds an internal representation of the external world by collecting statistics from the sensory input along multiple dimensions. Using a Bayesian predictive inference model, this work considers alternative hypotheses for how statistics are combined across sensory dimensions. Behavioral and neural responses from human listeners show the brain multiplexes two representations, where local statistics along each feature linearly affect neural responses, and global statistics nonlinearly combine statistical beliefs across dimensions to shape perception of stochastic auditory sequences.

Collapse

McLane I, Emmanouilidou D, West JE, Elhilali M. Design and Comparative Performance of a Robust Lung Auscultation System for Noisy Clinical Settings. IEEE J Biomed Health Inform 2021;25:2583-2594. [PMID: 33534721 PMCID: PMC8374873 DOI: 10.1109/jbhi.2021.3056916] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Rennoll V, McLane I, Emmanouilidou D, West J, Elhilali M. Electronic Stethoscope Filtering Mimics the Perceived Sound Characteristics of Acoustic Stethoscope. IEEE J Biomed Health Inform 2021;25:1542-1549. [PMID: 32870803 PMCID: PMC7917155 DOI: 10.1109/jbhi.2020.3020494] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Skerritt-Davis B, Elhilali M. Computational framework for investigating predictive processing in auditory perception. J Neurosci Methods 2021;360:109177. [PMID: 33839191 DOI: 10.1016/j.jneumeth.2021.109177] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 03/07/2021] [Accepted: 03/25/2021] [Indexed: 11/24/2022]

McCollum ED, Park DE, Watson NL, Fancourt NSS, Focht C, Baggett HC, Brooks WA, Howie SRC, Kotloff KL, Levine OS, Madhi SA, Murdoch DR, Scott JAG, Thea DM, Awori JO, Chipeta J, Chuananon S, DeLuca AN, Driscoll AJ, Ebruke BE, Elhilali M, Emmanouilidou D, Githua LP, Higdon MM, Hossain L, Jahan Y, Karron RA, Kyalo J, Moore DP, Mulindwa JM, Naorat S, Prosperi C, Verwey C, West JE, Knoll MD, O'Brien KL, Feikin DR, Hammitt LL. Digital auscultation in PERCH: Associations with chest radiography and pneumonia mortality in children. Pediatr Pulmonol 2020;55:3197-3208. [PMID: 32852888 PMCID: PMC7692889 DOI: 10.1002/ppul.25046] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 08/15/2020] [Accepted: 08/17/2020] [Indexed: 01/16/2023]

Abstract

BACKGROUND

Whether digitally recorded lung sounds are associated with radiographic pneumonia or clinical outcomes among children in low-income and middle-income countries is unknown. We sought to address these knowledge gaps.

METHODS

We enrolled 1 to 59monthold children hospitalized with pneumonia at eight African and Asian Pneumonia Etiology Research for Child Health sites in six countries, recorded digital stethoscope lung sounds, obtained chest radiographs, and collected clinical outcomes. Recordings were processed and classified into binary categories positive or negative for adventitial lung sounds. Listening and reading panels classified recordings and radiographs. Recording classification associations with chest radiographs with World Health Organization (WHO)-defined primary endpoint pneumonia (radiographic pneumonia) or mortality were evaluated. We also examined case fatality among risk strata.

RESULTS

Among children without WHO danger signs, wheezing (without crackles) had a lower adjusted odds ratio (aOR) for radiographic pneumonia (0.35, 95% confidence interval (CI): 0.15, 0.82), compared to children with normal recordings. Neither crackle only (no wheeze) (aOR: 2.13, 95% CI: 0.91, 4.96) or any wheeze (with or without crackle) (aOR: 0.63, 95% CI: 0.34, 1.15) were associated with radiographic pneumonia. Among children with WHO danger signs no lung recording classification was independently associated with radiographic pneumonia, although trends toward greater odds of radiographic pneumonia were observed among children classified with crackle only (no wheeze) or any wheeze (with or without crackle). Among children without WHO danger signs, those with recorded wheezing had a lower case fatality than those without wheezing (3.8% vs. 9.1%, p = .03).

CONCLUSIONS

Among lower risk children without WHO danger signs digitally recorded wheezing is associated with a lower odds for radiographic pneumonia and with lower mortality. Although further research is needed, these data indicate that with further development digital auscultation may eventually contribute to child pneumonia care.

Collapse

Affiliation(s)

Eric D McCollum Global Program in Respiratory Sciences, Eudowood Division of Pediatric Respiratory Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA.,Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Daniel E Park Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA.,Department of Epidemiology and Biostatistics, Milken Institute School of Public Health, George Washington University, Washington, District of Columbia, USA
Nora L Watson The Emmes Corporation, Rockville, Maryland, USA
Nicholas S S Fancourt Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Christopher Focht The Emmes Corporation, Rockville, Maryland, USA
Henry C Baggett Global Disease Detection Center, US Centers for Disease Control and Prevention Collaboration, Thailand Ministry of Public Health, Mueang Nonthaburi, Nonthaburi, Thailand.,Division of Global Health Protection, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
W Abdullah Brooks Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA.,International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka and Matlab, Bangladesh
Stephen R C Howie Medical Research Council Unit, Basse, The Gambia.,Department of Paediatrics, University of Auckland, Auckland, New Zealand.,Centre for International Health, University of Otago, Dunedin, New Zealand
Karen L Kotloff Division of Infectious Disease and Tropical Pediatrics, Department of Pediatrics, Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, Maryland
Orin S Levine Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA.,Bill & Melinda Gates Foundation, Seattle, Washington, USA
Shabir A Madhi Medical Research Council: Respiratory and Meningeal Pathogens Research Unit, University of the Witwatersrand, Johannesburg, South Africa.,Department of Science and Technology/National Research Foundation: Vaccine Preventable Diseases Unite, University of the Witwatersrand, Johannesburg, South Africa
David R Murdoch Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand.,Microbiology Unit, Canterbury Health Laboratories, Christchurch, New Zealand
J Anthony G Scott Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya.,Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
Donald M Thea Department of Global Health, Boston University School of Public Health, Boston, Massachusetts, USA
Juliet O Awori Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
James Chipeta Department of Paediatrics and Child Health, University Teaching Hospital, Lusaka, Zambia
Somchai Chuananon Global Disease Detection Center, US Centers for Disease Control and Prevention Collaboration, Thailand Ministry of Public Health, Mueang Nonthaburi, Nonthaburi, Thailand
Andrea N DeLuca Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA.,Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Amanda J Driscoll Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Bernard E Ebruke Medical Research Council Unit, Basse, The Gambia.,International Foundation Against Infectious Disease in Nigeria, Abuja, Nigeria
Mounya Elhilali Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, Maryland, USA
Dimitra Emmanouilidou Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, Maryland, USA
Louis Peter Githua Medical Research Council Unit, Basse, The Gambia
Melissa M Higdon Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Lokman Hossain International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka and Matlab, Bangladesh
Yasmin Jahan International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka and Matlab, Bangladesh
Ruth A Karron Department of International Health, Center for Immunization Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Joshua Kyalo Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
David P Moore Medical Research Council: Respiratory and Meningeal Pathogens Research Unit, University of the Witwatersrand, Johannesburg, South Africa.,Department of Paediatrics, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
Justin M Mulindwa Department of Paediatrics and Child Health, University Teaching Hospital, Lusaka, Zambia
Sathapana Naorat Global Disease Detection Center, US Centers for Disease Control and Prevention Collaboration, Thailand Ministry of Public Health, Mueang Nonthaburi, Nonthaburi, Thailand
Christine Prosperi Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Charl Verwey Medical Research Council: Respiratory and Meningeal Pathogens Research Unit, University of the Witwatersrand, Johannesburg, South Africa.,Department of Paediatrics, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
James E West Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, Maryland, USA
Maria Deloria Knoll Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Katherine L O'Brien Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Daniel R Feikin Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Laura L Hammitt Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA.,Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya

Collapse

Graceffo S, Husain A, Ahmed S, McCollum ED, Elhilali M. Validation of Auscultation Technologies using Objective and Clinical Comparisons. Annu Int Conf IEEE Eng Med Biol Soc 2020;2020:992-997. [PMID: 33018152 DOI: 10.1109/embc44109.2020.9176456] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Kala A, Husain A, McCollum ED, Elhilali M. An objective measure of signal quality for pediatric lung auscultations. Annu Int Conf IEEE Eng Med Biol Soc 2020;2020:772-775. [PMID: 33018100 DOI: 10.1109/embc44109.2020.9176539] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Summers V, Grant KW, Walden BE, Cord MT, Surr RK, Elhilali M. Evaluation of A “Direct-Comparison” Approach to Automatic Switching In Omnidirectional/Directional Hearing Aids. J Am Acad Audiol 2020;19:708-20. [DOI: 10.3766/jaaa.19.9.6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract Background: Hearing aids today often provide both directional (DIR) and omnidirectional (OMNI) processing options with the currently active mode selected automatically by the device. The most common approach to automatic switching involves “acoustic scene analysis” where estimates of various acoustic properties of the listening environment (e.g., signal-to-noise ratio [SNR], overall sound level) are used as a basis for switching decisions. Purpose: The current study was carried out to evaluate an alternative, “direct-comparison” approach to automatic switching that does not involve assumptions about how the listening environment may relate to microphone preferences. Predictions of microphone preference were based on whether DIR- or OMNI-processing of a given listening environment produced a closer match to a reference template representing the spectral and temporal modulations present in clean speech. Research Design: A descriptive and correlational study. Predictions of OMNI/DIR preferences were determined based on degree of similarity between spectral and temporal modulations contained in a reference, clean-speech template, and in OMNI- and DIR-processed recordings of various listening environments. These predictions were compared with actual preference judgments (both real-world judgments and laboratory responses to the recordings). Data Collection And Analysis: Predictions of microphone preference were based on whether DIR- or OMNI-processing of a given listening environment produced a closer match to a reference template representing clean speech. The template is the output of an auditory processing model that characterizes the spectral and temporal modulations associated with a given input signal (clean speech in this case). A modified version of the spectro-temporal modulation index (mSTMI) was used to compare the template to both DIR- and OMNI-processed versions of a given listening environment, as processed through the same auditory model. These analyses were carried out on recordings (originally collected by Walden et al, 2007) of OMNI- and DIR-processed speech produced in a range of everyday listening situations. Walden et al reported OMNI/DIR preference judgments made by raters at the same time the field recordings were made and judgments based on laboratory presentations of these recordings to hearing-impaired and normal-hearing listeners. Preference predictions based on the mSTMI analyses were compared with both sets of preference judgments. Results: The mSTMI analyses showed better than 92% accuracy in predicting the field preferences and 82–85% accuracy in predicting the laboratory preference judgments. OMNI processing tended to be favored over DIR processing in cases where the analysis indicated fairly similar mSTMI scores across the two processing modes. This is consistent with the common clinical assignment of OMNI mode as the default setting, most likely to be preferred in cases where neither mode produces a substantial improvement in SNR. Listeners experienced with switchable OMNI/DIR hearing aids were more likely than other listeners to favor the DIR mode in instances where mSTMI scores only slightly favored DIR processing. Conclusions: A direct-comparison approach to OMNI/DIR mode selection was generally successful in predicting user preferences in a range of listening environments. Future modifications to the approach to further improve predictive accuracy are discussed. Collapse

Kaya EM, Huang N, Elhilali M. Pitch, Timbre and Intensity Interdependently Modulate Neural Responses to Salient Sounds. Neuroscience 2020;440:1-14. [PMID: 32445938 DOI: 10.1016/j.neuroscience.2020.05.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 04/28/2020] [Accepted: 05/10/2020] [Indexed: 01/31/2023]

Abstract

As we listen to everyday sounds, auditory perception is heavily shaped by interactions between acoustic attributes such as pitch, timbre and intensity; though it is not clear how such interactions affect judgments of acoustic salience in dynamic soundscapes. Salience perception is believed to rely on an internal brain model that tracks the evolution of acoustic characteristics of a scene and flags events that do not fit this model as salient. The current study explores how the interdependency between attributes of dynamic scenes affects the neural representation of this internal model and shapes encoding of salient events. Specifically, the study examines how deviations along combinations of acoustic attributes interact to modulate brain responses, and subsequently guide perception of certain sound events as salient given their context. Human volunteers have their attention focused on a visual task and ignore acoustic melodies playing in the background while their brain activity using electroencephalography is recorded. Ambient sounds consist of musical melodies with probabilistically-varying acoustic attributes. Salient notes embedded in these scenes deviate from the melody's statistical distribution along pitch, timbre and/or intensity. Recordings of brain responses to salient notes reveal that neural power in response to the melodic rhythm as well as cross-trial phase alignment in the theta band are modulated by degree of salience of the notes, estimated across all acoustic attributes given their probabilistic context. These neural nonlinear effects across attributes strongly parallel behavioral nonlinear interactions observed in perceptual judgments of auditory salience using similar dynamic melodies; suggesting a neural underpinning of nonlinear interactions that underlie salience perception.

Collapse

Little DF, Snyder JS, Elhilali M. Ensemble modeling of auditory streaming reveals potential sources of bistability across the perceptual hierarchy. PLoS Comput Biol 2020;16:e1007746. [PMID: 32275706 PMCID: PMC7185718 DOI: 10.1371/journal.pcbi.1007746] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 04/27/2020] [Accepted: 02/25/2020] [Indexed: 11/19/2022] Open

Huang N, Elhilali M. Push-pull competition between bottom-up and top-down auditory attention to natural soundscapes. eLife 2020;9:52984. [PMID: 32196457 PMCID: PMC7083598 DOI: 10.7554/elife.52984] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Accepted: 02/13/2020] [Indexed: 12/17/2022] Open

Bellur A, Elhilali M. Audio object classification using distributed beliefs and attention. IEEE/ACM Trans Audio Speech Lang Process 2020;28:729-739. [PMID: 33564695 PMCID: PMC7869589 DOI: 10.1109/taslp.2020.2966867] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

Salles A, Park S, Sundar H, Macías S, Elhilali M, Moss CF. Neural Response Selectivity to Natural Sounds in the Bat Midbrain. Neuroscience 2020;434:200-211. [PMID: 31918008 DOI: 10.1016/j.neuroscience.2019.11.047] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 11/27/2019] [Accepted: 11/28/2019] [Indexed: 11/29/2022]

Liu SC, Harris JG, Elhilali M, Slaney M. Editorial: Bio-inspired Audio Processing, Models and Systems. Front Neurosci 2019;13:978. [PMID: 31572122 PMCID: PMC6753195 DOI: 10.3389/fnins.2019.00978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2019] [Accepted: 08/30/2019] [Indexed: 11/24/2022] Open

Elhilali M, West JE. The Stethoscope Gets Smart: Engineers from Johns Hopkins are giving the humble stethoscope an AI upgrade. IEEE Spectr 2019;56:36-41. [PMID: 34588704 PMCID: PMC8478072 DOI: 10.1109/mspec.2019.8635815] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Chakrabarty D, Elhilali M. A Gestalt inference model for auditory scene segregation. PLoS Comput Biol 2019;15:e1006711. [PMID: 30668568 PMCID: PMC6358108 DOI: 10.1371/journal.pcbi.1006711] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Revised: 02/01/2019] [Accepted: 12/12/2018] [Indexed: 11/18/2022] Open

Skerritt-Davis B, Elhilali M. A Model for Statistical Regularity Extraction from Dynamic Sounds. ACTA ACUST UNITED AC 2019;105:1-4. [PMID: 31929768 PMCID: PMC6953992 DOI: 10.3813/aaa.919279] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Huang N, Slaney M, Elhilali M. Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals. Front Neurosci 2018;12:532. [PMID: 30154688 PMCID: PMC6102345 DOI: 10.3389/fnins.2018.00532] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2018] [Accepted: 07/16/2018] [Indexed: 11/13/2022] Open

Skerritt-Davis B, Elhilali M. Detecting change in stochastic sound sequences. PLoS Comput Biol 2018;14:e1006162. [PMID: 29813049 PMCID: PMC5993325 DOI: 10.1371/journal.pcbi.1006162] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Revised: 06/08/2018] [Accepted: 04/30/2018] [Indexed: 01/18/2023] Open

Kaya EM, Elhilali M. Correction to ‘Modelling auditory attention’. Philos Trans R Soc Lond B Biol Sci 2017;372:rstb.2017.0194. [DOI: 10.1098/rstb.2017.0194] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

McCollum ED, Park DE, Watson NL, Buck WC, Bunthi C, Devendra A, Ebruke BE, Elhilali M, Emmanouilidou D, Garcia-Prats AJ, Githinji L, Hossain L, Madhi SA, Moore DP, Mulindwa J, Olson D, Awori JO, Vandepitte WP, Verwey C, West JE, Knoll MD, O'Brien KL, Feikin DR, Hammit LL. Listening panel agreement and characteristics of lung sounds digitally recorded from children aged 1-59 months enrolled in the Pneumonia Etiology Research for Child Health (PERCH) case-control study. BMJ Open Respir Res 2017;4:e000193. [PMID: 28883927 PMCID: PMC5531306 DOI: 10.1136/bmjresp-2017-000193] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 05/25/2017] [Accepted: 05/25/2017] [Indexed: 01/14/2023] Open

Abstract

INTRODUCTION

Paediatric lung sound recordings can be systematically assessed, but methodological feasibility and validity is unknown, especially from developing countries. We examined the performance of acoustically interpreting recorded paediatric lung sounds and compared sound characteristics between cases and controls.

METHODS

Pneumonia Etiology Research for Child Health staff in six African and Asian sites recorded lung sounds with a digital stethoscope in cases and controls. Cases aged 1-59 months had WHO severe or very severe pneumonia; age-matched community controls did not. A listening panel assigned examination results of normal, crackle, wheeze, crackle and wheeze or uninterpretable, with adjudication of discordant interpretations. Classifications were recategorised into any crackle, any wheeze or abnormal (any crackle or wheeze) and primary listener agreement (first two listeners) was analysed among interpretable examinations using the prevalence-adjusted, bias-adjusted kappa (PABAK). We examined predictors of disagreement with logistic regression and compared case and control lung sounds with descriptive statistics.

RESULTS

Primary listeners considered 89.5% of 792 case and 92.4% of 301 control recordings interpretable. Among interpretable recordings, listeners agreed on the presence or absence of any abnormality in 74.9% (PABAK 0.50) of cases and 69.8% (PABAK 0.40) of controls, presence/absence of crackles in 70.6% (PABAK 0.41) of cases and 82.4% (PABAK 0.65) of controls and presence/absence of wheeze in 72.6% (PABAK 0.45) of cases and 73.8% (PABAK 0.48) of controls. Controls, tachypnoea, >3 uninterpretable chest positions, crying, upper airway noises and study site predicted listener disagreement. Among all interpretable examinations, 38.0% of cases and 84.9% of controls were normal (p<0.0001); wheezing was the most common sound (49.9%) in cases.

CONCLUSIONS

Listening panel and case-control data suggests our methodology is feasible, likely valid and that small airway inflammation is common in WHO pneumonia. Digital auscultation may be an important future pneumonia diagnostic in developing countries.

Collapse

Affiliation(s)

Eric D McCollum Eudowood Division of Pediatric Respiratory Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA,Department of International Health, Johns Hopkins Bloomberg School of Public Health, Dhaka, Bangladesh,Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Daniel E Park Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Nora L Watson The Emmes Corporation, Rockville, Maryland, USA
W Chris Buck Department of Pediatrics, University of California Los Angeles, Maputo, Mozambique
Charatdao Bunthi International Emerging Infections Program, Global Disease Detection Center, Thailand Ministry of Public Health – US Centers for Disease Control and Prevention Collaboration, Nonthaburi, Thailand
Akash Devendra National Health Service Highland, Inverness, UK
Bernard E Ebruke The Medical Research Council, Basse, The Gambia
Mounya Elhilali Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, USA
Dimitra Emmanouilidou Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, USA
Anthony J Garcia-Prats Department of Paediatrics and Child Health, Stellenbosch University, Tygerberg, South Africa
Leah Githinji Division of Paediatric Pulmonology, University of Cape Town, Cape Town, South Africa
Lokman Hossain Respiratory Vaccines, Center for Vaccine Sciences, icddr,b, Dhaka, Bangladesh
Shabir A Madhi Medical Research Council, Respiratory and Meningeal Pathogens Research Unit, University of the Witwatersrand, Johannesburg, South Africa,Department of Science and Technology/National Research Foundation, South African Research Chair: Vaccine Preventable Diseases, University of the Witwatersrand, Johannesburg, South Africa
David P Moore Medical Research Council, Respiratory and Meningeal Pathogens Research Unit, University of the Witwatersrand, Johannesburg, South Africa,Department of Paediatrics, University of the Witwatersrand, Chris Hani Baragwanath Academic Hospital, Johannesburg, South Africa
Justin Mulindwa Department of Paediatrics and Child Health, University Teaching Hospital, Lusaka, Zambia
Dan Olson Department of Pediatrics, Section of Infectious Disease, Center for Global Health, University of Colorado, Colorado, USA
Juliet O Awori Kenya Medical Research Institute Wellcome Trust Research Programme, Kilifi, Kenya
Warunee P Vandepitte Queen Sirikit National Institute of Child Health, Rangsit University, Bangkok, Thailand
Charl Verwey Medical Research Council, Respiratory and Meningeal Pathogens Research Unit, University of the Witwatersrand, Johannesburg, South Africa,Department of Paediatrics, University of the Witwatersrand, Chris Hani Baragwanath Academic Hospital, Johannesburg, South Africa
James E West Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, USA
Maria D Knoll Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Katherine L O'Brien Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Daniel R Feikin Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA,Division of Viral Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Laura L Hammit Kenya Medical Research Institute Wellcome Trust Research Programme, Kilifi, Kenya

Collapse

Emmanouilidou D, McCollum ED, Park DE, Elhilali M. Computerized Lung Sound Screening for Pediatric Auscultation in Noisy Field Environments. IEEE Trans Biomed Eng 2017. [PMID: 28641244 DOI: 10.1109/tbme.2017.2717280] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Abstract

GOAL

Chest auscultations offer a non-invasive and low-cost tool for monitoring lung disease. However, they present many shortcomings, including inter-listener variability, subjectivity, and vulnerability to noise and distortions. This work proposes a computer-aided approach to process lung signals acquired in the field under adverse noisy conditions, by improving the signal quality and offering automated identification of abnormal auscultations indicative of respiratory pathologies.

METHODS

The developed noise-suppression scheme eliminates ambient sounds, heart sounds, sensor artifacts, and crying contamination. The improved high-quality signal is then mapped onto a rich spectrotemporal feature space before being classified using a trained support-vector machine classifier. Individual signal frame decisions are then combined using an evaluation scheme, providing an overall patient-level decision for unseen patient records.

RESULTS

All methods are evaluated on a large dataset with 1000 children enrolled, 1-59 months old. The noise suppression scheme is shown to significantly improve signal quality, and the classification system achieves an accuracy of 86.7% in distinguishing normal from pathological sounds, far surpassing other state-of-the-art methods.

CONCLUSION

Computerized lung sound processing can benefit from the enforcement of advanced noise suppression. A fairly short processing window size ( s) combined with detailed spectrotemporal features is recommended, in order to capture transient adventitious events without highlighting sharp noise occurrences.

SIGNIFICANCE

Unlike existing methodologies in the literature, the proposed work is not limited in scope or confined to laboratory settings: This work validates a practical method for fully automated chest sound processing applicable to realistic and noisy auscultation settings.

Collapse

Snyder JS, Elhilali M. Recent advances in exploring the neural underpinnings of auditory scene perception. Ann N Y Acad Sci 2017;1396:39-55. [PMID: 28199022 PMCID: PMC5446279 DOI: 10.1111/nyas.13317] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Revised: 12/21/2016] [Accepted: 01/08/2017] [Indexed: 11/29/2022]

Bellur A, Elhilali M. Feedback-Driven Sensory Mapping Adaptation for Robust Speech Activity Detection. IEEE/ACM Trans Audio Speech Lang Process 2017;25:481-492. [PMID: 28736736 PMCID: PMC5516649 DOI: 10.1109/taslp.2016.2639322] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Huang N, Elhilali M. Auditory salience using natural soundscapes. J Acoust Soc Am 2017;141:2163. [PMID: 28372080 PMCID: PMC6909985 DOI: 10.1121/1.4979055] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 03/09/2017] [Accepted: 03/10/2017] [Indexed: 05/26/2023]

Kaya EM, Elhilali M. Modelling auditory attention. Philos Trans R Soc Lond B Biol Sci 2017;372:20160101. [PMID: 28044012 PMCID: PMC5206269 DOI: 10.1098/rstb.2016.0101] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2016] [Indexed: 12/16/2022] Open

Carlin MA, Elhilali M. A Framework for Speech Activity Detection Using Adaptive Auditory Receptive Fields. IEEE/ACM Trans Audio Speech Lang Process 2015;23:2422-2433. [PMID: 29904642 PMCID: PMC5997283 DOI: 10.1109/taslp.2015.2481179] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Sakai M, Elhilali M, Papadopoulos V. The GnRH Antagonist Degarelix Directly Inhibits Benign Prostate Hyperplasia Cell Growth. Horm Metab Res 2015. [PMID: 26197852 DOI: 10.1055/s-0035-1555899] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Carlin MA, Elhilali M. Modeling attention-driven plasticity in auditory cortical receptive fields. Front Comput Neurosci 2015;9:106. [PMID: 26347643 PMCID: PMC4541291 DOI: 10.3389/fncom.2015.00106] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 07/30/2015] [Indexed: 11/24/2022] Open

Abstract

To navigate complex acoustic environments, listeners adapt neural processes to focus on behaviorally relevant sounds in the acoustic foreground while minimizing the impact of distractors in the background, an ability referred to as top-down selective attention. Particularly striking examples of attention-driven plasticity have been reported in primary auditory cortex via dynamic reshaping of spectro-temporal receptive fields (STRFs). By enhancing the neural response to features of the foreground while suppressing those to the background, STRFs can act as adaptive contrast matched filters that directly contribute to an improved cognitive segregation between behaviorally relevant and irrelevant sounds. In this study, we propose a novel discriminative framework for modeling attention-driven plasticity of STRFs in primary auditory cortex. The model describes a general strategy for cortical plasticity via an optimization that maximizes discriminability between the foreground and distractors while maintaining a degree of stability in the cortical representation. The first instantiation of the model describes a form of feature-based attention and yields STRF adaptation patterns consistent with a contrast matched filter previously reported in neurophysiological studies. An extension of the model captures a form of object-based attention, where top-down signals act on an abstracted representation of the sensory input characterized in the modulation domain. The object-based model makes explicit predictions in line with limited neurophysiological data currently available but can be readily evaluated experimentally. Finally, we draw parallels between the model and anatomical circuits reported to be engaged during active attention. The proposed model strongly suggests an interpretation of attention-driven plasticity as a discriminative adaptation operating at the level of sensory cortex, in line with similar strategies previously described across different sensory modalities.

Collapse

Kaya EM, Elhilali M. Abnormality detection in noisy biosignals. Annu Int Conf IEEE Eng Med Biol Soc 2015;2013:3949-52. [PMID: 24110596 DOI: 10.1109/embc.2013.6610409] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Emmanouilidou D, McCollum ED, Park DE, Elhilali M. Adaptive Noise Suppression of Pediatric Lung Auscultations With Real Applications to Noisy Clinical Settings in Developing Countries. IEEE Trans Biomed Eng 2015;62:2279-88. [PMID: 25879837 DOI: 10.1109/tbme.2015.2422698] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Sell G, Suied C, Elhilali M, Shamma S. Perceptual susceptibility to acoustic manipulations in speaker discrimination. J Acoust Soc Am 2015;137:911-922. [PMID: 25698023 PMCID: PMC5392054 DOI: 10.1121/1.4906826] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Revised: 09/11/2014] [Accepted: 12/08/2014] [Indexed: 06/04/2023]

Patil K, Elhilali M. Biomimetic spectro-temporal features for music instrument recognition in isolated notes and solo phrases. EURASIP J Audio Speech Music Process 2015;2015:27. [PMID: 30555520 PMCID: PMC6290678 DOI: 10.1186/s13636-015-0070-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Krishnan L, Elhilali M, Shamma S. Segregating complex sound sources through temporal coherence. PLoS Comput Biol 2014;10:e1003985. [PMID: 25521593 PMCID: PMC4270434 DOI: 10.1371/journal.pcbi.1003985] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 10/14/2014] [Indexed: 11/18/2022] Open

Abstract

A new approach for the segregation of monaural sound mixtures is presented based on the principle of temporal coherence and using auditory cortical representations. Temporal coherence is the notion that perceived sources emit coherently modulated features that evoke highly-coincident neural response patterns. By clustering the feature channels with coincident responses and reconstructing their input, one may segregate the underlying source from the simultaneously interfering signals that are uncorrelated with it. The proposed algorithm requires no prior information or training on the sources. It can, however, gracefully incorporate cognitive functions and influences such as memories of a target source or attention to a specific set of its attributes so as to segregate it from its background. Aside from its unusual structure and computational innovations, the proposed model provides testable hypotheses of the physiological mechanisms of this ubiquitous and remarkable perceptual ability, and of its psychophysical manifestations in navigating complex sensory environments.

Humans and many animals can effortlessly navigate complex sensory environments, segregating and attending to one desired target source while suppressing distracting and interfering others. In this paper, we present an algorithmic model that can accomplish this task with no prior information or training on complex signals such as speech mixtures, and speech in noise and music. The model accounts for this ability relying solely on the temporal coherence principle, the notion that perceived sources emit coherently modulated features that evoke coincident cortical response patterns. It further demonstrates how basic cortical mechanisms common to all sensory systems can implement the necessary representations, as well as the adaptive computations necessary to maintain continuity by tracking slowly changing characteristics of different sources in a scene.

Collapse

Akram S, Englitz B, Elhilali M, Simon JZ, Shamma SA. Investigating the neural correlates of a streaming percept in an informational-masking paradigm. PLoS One 2014;9:e114427. [PMID: 25490720 PMCID: PMC4260833 DOI: 10.1371/journal.pone.0114427] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Accepted: 11/10/2014] [Indexed: 11/19/2022] Open

Abstract

Humans routinely segregate a complex acoustic scene into different auditory streams, through the extraction of bottom-up perceptual cues and the use of top-down selective attention. To determine the neural mechanisms underlying this process, neural responses obtained through magnetoencephalography (MEG) were correlated with behavioral performance in the context of an informational masking paradigm. In half the trials, subjects were asked to detect frequency deviants in a target stream, consisting of a rhythmic tone sequence, embedded in a separate masker stream composed of a random cloud of tones. In the other half of the trials, subjects were exposed to identical stimuli but asked to perform a different task—to detect tone-length changes in the random cloud of tones. In order to verify that the normalized neural response to the target sequence served as an indicator of streaming, we correlated neural responses with behavioral performance under a variety of stimulus parameters (target tone rate, target tone frequency, and the “protection zone”, that is, the spectral area with no tones around the target frequency) and attentional states (changing task objective while maintaining the same stimuli). In all conditions that facilitated target/masker streaming behaviorally, MEG normalized neural responses also changed in a manner consistent with the behavior. Thus, attending to the target stream caused a significant increase in power and phase coherence of the responses in recording channels correlated with an increase in the behavioral performance of the listeners. Normalized neural target responses also increased as the protection zone widened and as the frequency of the target tones increased. Finally, when the target sequence rate increased, the buildup of the normalized neural responses was significantly faster, mirroring the accelerated buildup of the streaming percepts. Our data thus support close links between the perceptual and neural consequences of the auditory stream segregation.

Collapse