1
|
Wong DWC, Wang J, Cheung SMY, Lai DKH, Chiu ATS, Pu D, Cheung JCW, Kwok TCY. Current Technological Advances in Dysphagia Screening: Systematic Scoping Review. J Med Internet Res 2025; 27:e65551. [PMID: 40324167 DOI: 10.2196/65551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Revised: 12/30/2024] [Accepted: 03/25/2025] [Indexed: 05/07/2025] Open
Abstract
BACKGROUND Dysphagia affects more than half of older adults with dementia and is associated with a 10-fold increase in mortality. The development of accessible, objective, and reliable screening tools is crucial for early detection and management. OBJECTIVE This systematic scoping review aimed to (1) examine the current state of the art in artificial intelligence (AI) and sensor-based technologies for dysphagia screening, (2) evaluate the performance of these AI-based screening tools, and (3) assess the methodological quality and rigor of studies on AI-based dysphagia screening tools. METHODS We conducted a systematic literature search across CINAHL, Embase, PubMed, and Web of Science from inception to July 4, 2024, following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) framework. In total, 2 independent researchers conducted the search, screening, and data extraction. Eligibility criteria included original studies using sensor-based instruments with AI to identify individuals with dysphagia or unsafe swallow events. We excluded studies on pediatric, infant, or postextubation dysphagia, as well as those using non-sensor-based assessments or diagnostic tools. We used a modified Quality Assessment of Diagnostic Accuracy Studies-2 tool to assess methodological quality, adding a "model" domain for AI-specific evaluation. Data were synthesized narratively. RESULTS This review included 24 studies involving 2979 participants (1717 with dysphagia and 1262 controls). In total, 75% (18/24) of the studies focused solely on per-individual classification rather than per-swallow event classification. Acoustic (13/24, 54%) and vibratory (9/24, 38%) signals were the primary modality sources. In total, 25% (6/24) of the studies used multimodal approaches, whereas 75% (18/24) used a single modality. Support vector machine was the most common AI model (15/24, 62%), with deep learning approaches emerging in recent years (3/24, 12%). Performance varied widely-accuracy ranged from 71.2% to 99%, area under the receiver operating characteristic curve ranged from 0.77 to 0.977, and sensitivity ranged from 63.6% to 100%. Multimodal systems generally outperformed unimodal systems. The methodological quality assessment revealed a risk of bias, particularly in patient selection (unclear in 18/24, 75% of the studies), index test (unclear in 23/24, 96% of the studies), and modeling (high risk in 13/24, 54% of the studies). Notably, no studies conducted external validation or domain adaptation testing, raising concerns about real-world applicability. CONCLUSIONS This review provides a comprehensive overview of technological advancements in AI and sensor-based dysphagia screening. While these developments show promise for continuous long-term tele-swallowing assessments, significant methodological limitations were identified. Future studies can explore how each modality can target specific anatomical regions and manifestations of dysphagia. This detailed understanding of how different modalities address various aspects of dysphagia can significantly benefit multimodal systems, enabling them to better handle the multifaceted nature of dysphagia conditions.
Collapse
Affiliation(s)
- Duo Wai-Chi Wong
- Department of Biomedical Engineering, Faculty of Engineering, Hong Kong Polytechnic University, Hong Kong, China (Hong Kong)
| | - Jiao Wang
- Department of Biomedical Engineering, Faculty of Engineering, Hong Kong Polytechnic University, Hong Kong, China (Hong Kong)
- Department of Clinical Laboratory, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
| | - Sophia Ming-Yan Cheung
- Department of Mathematics, School of Science, Hong Kong University of Science and Technology, Hong Kong, China (Hong Kong)
| | - Derek Ka-Hei Lai
- Department of Biomedical Engineering, Faculty of Engineering, Hong Kong Polytechnic University, Hong Kong, China (Hong Kong)
| | - Armstrong Tat-San Chiu
- Kowloon Home for the Aged Blind, Hong Kong Society for the Blind, Hong Kong, China (Hong Kong)
| | - Dai Pu
- School of Primary and Allied Health Care, Monash University, Melbourne, Australia
| | - James Chung-Wai Cheung
- Department of Biomedical Engineering, Faculty of Engineering, Hong Kong Polytechnic University, Hong Kong, China (Hong Kong)
- Research Institute for Smart Ageing, Hong Kong Polytechnic University, Hong Kong, China (Hong Kong)
| | - Timothy Chi-Yui Kwok
- Department of Medicine and Therapeutics, Faculty of Medicine, Chinese University of Hong Kong, Hong Kong, China (Hong Kong)
- Jockey Club Centre for Positive Ageing, Chinese University of Hong Kong, Hong Kong, China (Hong Kong)
| |
Collapse
|
2
|
Khodami F, Mahoney AS, Coyle JL, Sejdić E. Elevating Patient Care With Deep Learning: High-Resolution Cervical Auscultation Signals for Swallowing Kinematic Analysis in Nasogastric Tube Patients. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2024; 12:711-720. [PMID: 39698476 PMCID: PMC11655099 DOI: 10.1109/jtehm.2024.3497895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Accepted: 11/04/2024] [Indexed: 12/20/2024]
Abstract
Patients with nasogastric (NG) tubes require careful monitoring due to the potential impact of the tube on their ability to swallow safely. This study aimed to investigate the utility of high-resolution cervical auscultation (HRCA) signals in assessing swallowing functionality of patients using feeding tubes. HRCA, capturing swallowing vibratory and acoustic signals, has been explored as a surrogate for videofluoroscopy image analysis in previous research. In this study, we analyzed HRCA signals recorded from patients with NG tubes to identify swallowing kinematic events within this group of subjects. Machine learning architectures from prior research endeavors, originally designed for participants without NG tubes, were fine-tuned to accomplish three tasks in the target population: estimating the duration of upper esophageal sphincter opening, estimating the duration of laryngeal vestibule closure, and tracking the hyoid bone. The convolutional recurrent neural network proposed for the first task predicted the onset of upper esophageal sphincter opening and closure for 67.61% and 82.95% of patients, respectively, with an error margin of fewer than three frames. The hybrid model employed for the second task successfully predicted the onset of laryngeal vestibule closure and reopening for 79.62% and 75.80% of patients, respectively, with the same error margin. The stacked recurrent neural network identified hyoid bone position in test frames, achieving a 41.27% overlap with ground-truth outputs. By applying established algorithms to an unseen population, we demonstrated the utility of HRCA signals for swallowing assessment in individuals with NG tubes and showcased the generalizability of algorithms developed in our previous studies. Clinical impact: This study highlights the promise of HRCA signals for assessing swallowing in patients with NG tubes, potentially improving diagnosis, management, and care integration in both clinical and home healthcare settings.
Collapse
Affiliation(s)
- Farnaz Khodami
- Department of Electrical and Computer EngineeringFaculty of Applied Science and EngineeringUniversity of TorontoTorontoONM5S 1A4Canada
| | - Amanda S. Mahoney
- Department of the Communication Science and DisordersSchool of Health and Rehabilitation SciencesUniversity of PittsburghPittsburghPA15213USA
| | - James L. Coyle
- Department of the Communication Science and DisordersSchool of Health and Rehabilitation SciencesUniversity of PittsburghPittsburghPA15213USA
| | - Ervin Sejdić
- Department of Electrical and Computer EngineeringFaculty of Applied Science and EngineeringUniversity of TorontoTorontoONM5S 1A4Canada
- North York General HospitalTorontoONM2K 1E1Canada
| |
Collapse
|
3
|
Allen J, Clunie G, Newman H, Slinger C. Comment on Chatwin et al. Waves of Precision: A Practical Guide for Reviewing New Tools to Evaluate Mechanical In-Exsufflation Efficacy in Neuromuscular Disorders. J. Clin. Med. 2024, 13, 2643. J Clin Med 2024; 13:4991. [PMID: 39274202 PMCID: PMC11396265 DOI: 10.3390/jcm13174991] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 08/09/2024] [Accepted: 08/14/2024] [Indexed: 09/16/2024] Open
Abstract
We read with interest the paper published by Chatwin et al [...].
Collapse
Affiliation(s)
- Jodi Allen
- The National Hospital for Neurology & Neurosurgery, University College London Hospitals NHS Foundation Trust, London WC1N 3BG, UK
- University College London Centre for Medical Imaging, London W1W 7TS, UK
| | - Gemma Clunie
- Surgery and Cancer, Imperial College London, London W12 0BZ, UK
- Charing Cross Hospital, Imperial College NHS Healthcare Trust, London W6 8RF, UK
| | - Helen Newman
- Barnet Hospital, Royal Free London NHS Foundation Trust, London EN5 3DJ, UK
- University College London Division of Surgery and Interventional Science, London WC1E 6BT, UK
| | - Claire Slinger
- Royal Preston Hospital, Lancashire Teaching Hospitals NHS Trust, Preston PR2 9HT, UK
| |
Collapse
|
4
|
Srinivasan Y, Liu A, Rameau A. Machine learning in the evaluation of voice and swallowing in the head and neck cancer patient. Curr Opin Otolaryngol Head Neck Surg 2024; 32:105-112. [PMID: 38116798 DOI: 10.1097/moo.0000000000000948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
PURPOSE OF REVIEW The purpose of this review is to present recent advances and limitations in machine learning applied to the evaluation of speech, voice, and swallowing in head and neck cancer. RECENT FINDINGS Novel machine learning models incorporating diverse data modalities with improved discriminatory capabilities have been developed for predicting toxicities following head and neck cancer therapy, including dysphagia, dysphonia, xerostomia, and weight loss as well as guiding treatment planning. Machine learning has been applied to the care of posttreatment voice and swallowing dysfunction by offering objective and standardized assessments and aiding innovative technologies for functional restoration. Voice and speech are also being utilized in machine learning algorithms to screen laryngeal cancer. SUMMARY Machine learning has the potential to help optimize, assess, predict, and rehabilitate voice and swallowing function in head and neck cancer patients as well as aid in cancer screening. However, existing studies are limited by the lack of sufficient external validation and generalizability, insufficient transparency and reproducibility, and no clear superior predictive modeling strategies. Algorithms and applications will need to be trained on large multiinstitutional data sets, incorporate sociodemographic data to reduce bias, and achieve validation through clinical trials for optimal performance and utility.
Collapse
Affiliation(s)
- Yashes Srinivasan
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York
| | - Amy Liu
- University of California, San Diego, School of Medicine, San Diego, California, USA
| | - Anaïs Rameau
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York
| |
Collapse
|
5
|
Khalifa Y, Mahoney AS, Lucatorto E, Coyle JL, Sejdić E. Non-Invasive Sensor-Based Estimation of Anterior-Posterior Upper Esophageal Sphincter Opening Maximal Distension. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2023; 11:182-190. [PMID: 36873304 PMCID: PMC9976940 DOI: 10.1109/jtehm.2023.3246919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 01/25/2023] [Accepted: 02/15/2023] [Indexed: 02/22/2023]
Abstract
OBJECTIVE Dysphagia management relies on the evaluation of the temporospatial kinematic events of swallowing performed in videofluoroscopy (VF) by trained clinicians. The upper esophageal sphincter (UES) opening distension represents one of the important kinematic events that contribute to healthy swallowing. Insufficient distension of UES opening can lead to an accumulation of pharyngeal residue and subsequent aspiration which in turn can lead to adverse outcomes such as pneumonia. VF is usually used for the temporal and spatial evaluation of the UES opening; however, VF is not available in all clinical settings and may be inappropriate or undesirable for some patients. High resolution cervical auscultation (HRCA) is a noninvasive technology that uses neck-attached sensors and machine learning to characterize swallowing physiology by analyzing the swallow-induced vibrations/sounds in the anterior neck region. We investigated the ability of HRCA to noninvasively estimate the maximal distension of anterior-posterior (A-P) UES opening as accurately as the measurements performed by human judges from VF images. METHODS AND PROCEDURES Trained judges performed the kinematic measurement of UES opening duration and A-P UES opening maximal distension on 434 swallows collected from 133 patients. We used a hybrid convolutional recurrent neural network supported by attention mechanisms which takes HRCA raw signals as input and estimates the value of the A-P UES opening maximal distension as output. RESULTS The proposed network estimated the A-P UES opening maximal distension with an absolute percentage error of 30% or less for more than 64.14% of the swallows in the dataset. CONCLUSION This study provides substantial evidence for the feasibility of using HRCA to estimate one of the key spatial kinematic measurements used for dysphagia characterization and management. Clinical and Translational Impact Statement: The findings in this study have a direct impact on dysphagia diagnosis and management through providing a non-invasive and cheap way to estimate one of the most important swallowing kinematics, the UES opening distension, that contributes to safe swallowing. This study, along with other studies that utilize HRCA for swallowing kinematic analysis, paves the way for developing a widely available and easy-to-use tool for dysphagia diagnosis and management.
Collapse
Affiliation(s)
- Yassin Khalifa
- Department of Biomedical EngineeringCairo UniversityGiza12613Egypt
- Department of Electrical and Computer EngineeringSwanson School of EngineeringUniversity of PittsburghPittsburghPA15260USA
- Case Western Reserve University School of MedicineClevelandOH44106USA
- University Hospitals Harrington Heart and Vascular InstituteClevelandOH44106USA
| | - Amanda S. Mahoney
- Department of Communication Science and DisordersUniversity of PittsburghPittsburghPA15260USA
| | - Erin Lucatorto
- Department of Communication Science and DisordersUniversity of PittsburghPittsburghPA15260USA
| | - James L. Coyle
- Department of Communication Science and DisordersUniversity of PittsburghPittsburghPA15260USA
- Department of OtolaryngologyUniversity of PittsburghPittsburghPA15260USA
| | - Ervin Sejdić
- The Edward S. Rogers Sr. Department of Electrical and Computer EngineeringFaculty of Applied Science and EngineeringUniversity of TorontoTorontoONM5S 1A1Canada
- North York General HospitalTorontoONM2K 1E1Canada
| |
Collapse
|
6
|
Mialland A, Atallah I, Bonvilain A. Toward a robust swallowing detection for an implantable active artificial larynx: a survey. Med Biol Eng Comput 2023; 61:1299-1327. [PMID: 36792845 DOI: 10.1007/s11517-023-02772-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 01/04/2023] [Indexed: 02/17/2023]
Abstract
Total laryngectomy consists in the removal of the larynx and is intended as a curative treatment for laryngeal cancer, but it leaves the patient with no possibility to breathe, talk, and swallow normally anymore. A tracheostomy is created to restore breathing through the throat, but the aero-digestive tracts are permanently separated and the air no longer passes through the nasal tracts, which allowed filtration, warming, humidification, olfaction, and acceleration of the air for better tissue oxygenation. As for phonation restoration, various techniques allow the patient to talk again. The main one consists of a tracheo-esophageal valve prosthesis that makes the air passes from the esophagus to the pharynx, and makes the air vibrate to allow speech through articulation. Finally, swallowing is possible through the original tract as it is now isolated from the trachea. Yet, many methods exist to detect and assess a swallowing, but none is intended as a definitive restoration technique of the natural airway, which would permanently close the tracheostomy and avoid its adverse effects. In addition, these methods are non-invasive and lack detection accuracy. The feasibility of an effective early detection of swallowing would allow to further develop an implantable active artificial larynx and therefore restore the aero-digestive tracts. A previous attempt has been made on an artificial larynx implanted in 2012, but no active detection was included and the system was completely mechanic. This led to residues in the airway because of the imperfect sealing of the mechanism. An active swallowing detection coupled with indwelling measurements would thus likely add a significant reliability on such a system as it would allow to actively close an artificial larynx. So, after a brief explanation of the swallowing mechanism, this survey intends to first provide a detailed consideration of the anatomical region involved in swallowing, with a detection perspective. Second, the swallowing mechanism following total laryngectomy surgery is detailed. Third, the current non-invasive swallowing detection technique and their limitations are discussed. Finally, the previous points are explored with regard to the inherent requirements for the feasibility of an effective swallowing detection for an artificial larynx. Graphical Abstract.
Collapse
Affiliation(s)
- Adrien Mialland
- Institute of Engineering and Management Univ. Grenoble Alpes, Univ. Grenoble Alpes, CNRS, Grenoble INP, Gipsa-lab, 38000, Grenoble, France.
| | - Ihab Atallah
- Institute of Engineering and Management Univ. Grenoble Alpes, Otorhinolaryngology, CHU Grenoble Alpes, 38700, La Tronche, France
| | - Agnès Bonvilain
- Institute of Engineering and Management Univ. Grenoble Alpes, Univ. Grenoble Alpes, CNRS, Grenoble INP, Gipsa-lab, 38000, Grenoble, France
| |
Collapse
|
7
|
Khalifa Y, Donohue C, Coyle JL, Sejdic E. Autonomous Swallow Segment Extraction Using Deep Learning in Neck-Sensor Vibratory Signals From Patients With Dysphagia. IEEE J Biomed Health Inform 2023; 27:956-967. [PMID: 36417738 PMCID: PMC10079637 DOI: 10.1109/jbhi.2022.3224323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Dysphagia occurs secondary to a variety of underlying etiologies and can contribute to increased risk of adverse events such as aspiration pneumonia and premature mortality. Dysphagia is primarily diagnosed and characterized by instrumental swallowing exams such as videofluoroscopic swallowing studies. videofluoroscopic swallowing studies involve the inspection of a series of radiographic images for signs of swallowing dysfunction. Though effective, videofluoroscopic swallowing studies are only available in certain clinical settings and are not always desirable or feasible for certain patients. Because of the limitations of current instrumental swallow exams, research studies have explored the use of acceleration signals collected from neck sensors and demonstrated their potential in providing comparable radiation-free diagnostic value as videofluoroscopic swallowing studies. In this study, we used a hybrid deep convolutional recurrent neural network that can perform multi-level feature extraction (localized and across time) to annotate swallow segments automatically via multi-channel swallowing acceleration signals. In total, we used signals and videofluoroscopic swallowing study images of 3144 swallows from 248 patients with suspected dysphagia. Compared to other deep network variants, our network was superior at detecting swallow segments with an average area under the receiver operating characteristic curve value of 0.82 (95% confidence interval: 0.807-0.841), and was in agreement with up to 90% of the gold standard-labeled segments.
Collapse
|
8
|
Donohue C, Khalifa Y, Mao S, Perera S, Sejdić E, Coyle JL. Characterizing Swallows From People With Neurodegenerative Diseases Using High-Resolution Cervical Auscultation Signals and Temporal and Spatial Swallow Kinematic Measurements. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3416-3431. [PMID: 34428093 PMCID: PMC8642099 DOI: 10.1044/2021_jslhr-21-00134] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 04/21/2021] [Accepted: 05/21/2021] [Indexed: 06/13/2023]
Abstract
Purpose The prevalence of dysphagia in patients with neurodegenerative diseases (ND) is alarmingly high and frequently results in morbidity and accelerated mortality due to subsequent adverse events (e.g., aspiration pneumonia). Swallowing in patients with ND should be continuously monitored due to the progressive disease nature. Access to instrumental swallow evaluations can be challenging, and limited studies have quantified changes in temporal/spatial swallow kinematic measures in patients with ND. High-resolution cervical auscultation (HRCA), a dysphagia screening method, has accurately differentiated between safe and unsafe swallows, identified swallow kinematic events (e.g., laryngeal vestibule closure [LVC]), and classified swallows between healthy adults and patients with ND. This study aimed to (a) compare temporal/spatial swallow kinematic measures between patients with ND and healthy adults and (b) investigate HRCA's ability to annotate swallow kinematic events in patients with ND. We hypothesized there would be significant differences in temporal/spatial swallow measurements between groups and that HRCA would accurately annotate swallow kinematic events in patients with ND. Method Participants underwent videofluoroscopic swallowing studies with concurrent HRCA. We used linear mixed models to compare temporal/spatial swallow measurements (n = 170 ND patient swallows, n = 171 healthy adult swallows) and deep learning machine-learning algorithms to annotate specific temporal and spatial kinematic events in swallows from patients with ND. Results Differences (p < .05) were found between groups for several temporal and spatial swallow kinematic measures. HRCA signal features were used as input to machine-learning algorithms and annotated upper esophageal sphincter (UES) opening, UES closure, LVC, laryngeal vestibule reopening, and hyoid bone displacement with 66.25%, 85%, 68.18%, 70.45%, and 44.6% accuracy, respectively, compared to human judges' measurements. Conclusion This study demonstrates HRCA's potential in characterizing swallow function in patients with ND and other patient populations.
Collapse
Affiliation(s)
- Cara Donohue
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, PA
| | - Yassin Khalifa
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, PA
| | - Shitong Mao
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, PA
| | - Subashan Perera
- Division of Geriatric Medicine, Department of Medicine, University of Pittsburgh, PA
| | - Ervin Sejdić
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, PA
- Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, PA
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, PA
- Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, PA
| | - James L. Coyle
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, PA
- Department of Otolaryngology, School of Medicine, University of Pittsburgh Medical Center, PA
| |
Collapse
|
9
|
Donohue C, Khalifa Y, Mao S, Perera S, Sejdić E, Coyle JL. Establishing Reference Values for Temporal Kinematic Swallow Events Across the Lifespan in Healthy Community Dwelling Adults Using High-Resolution Cervical Auscultation. Dysphagia 2021; 37:664-675. [PMID: 34018024 DOI: 10.1007/s00455-021-10317-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 05/13/2021] [Indexed: 11/29/2022]
Abstract
Few research studies have investigated temporal kinematic swallow events in healthy adults to establish normative reference values. Determining cutoffs for normal and disordered swallowing is vital for differentially diagnosing presbyphagia, variants of normal swallowing, and dysphagia; and for ensuring that different swallowing research laboratories produce consistent results in common measurements from different samples within the same population. High-resolution cervical auscultation (HRCA), a sensor-based dysphagia screening method, has accurately annotated temporal kinematic swallow events in patients with dysphagia, but hasn't been used to annotate temporal kinematic swallow events in healthy adults to establish dysphagia screening cutoffs. This study aimed to determine: (1) Reference values for temporal kinematic swallow events, (2) Whether HRCA can annotate temporal kinematic swallow events in healthy adults. We hypothesized (1) Our reference values would align with a prior study; (2) HRCA would detect temporal kinematic swallow events as accurately as human judges. Trained judges completed temporal kinematic measurements on 659 swallows (N = 70 adults). Swallow reaction time and LVC duration weren't different (p > 0.05) from a previously published historical cohort (114 swallows, N = 38 adults), while other temporal kinematic measurements were different (p < 0.05), suggesting a need for further standardization to feasibly pool data analyses across laboratories. HRCA signal features were used as input to machine learning algorithms and annotated UES opening (69.96% accuracy), UES closure (64.52% accuracy), LVC (52.56% accuracy), and LV re-opening (69.97% accuracy); providing preliminary evidence that HRCA can noninvasively and accurately annotate temporal kinematic measurements in healthy adults to determine dysphagia screening cutoffs.
Collapse
Affiliation(s)
- Cara Donohue
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, 6035 Forbes Tower, Pittsburgh, PA, 15260, USA
| | - Yassin Khalifa
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Shitong Mao
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Subashan Perera
- Division of Geriatrics, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Ervin Sejdić
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Biomedical Informatics, School of Medicine Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - James L Coyle
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, 6035 Forbes Tower, Pittsburgh, PA, 15260, USA. .,Department of Otolaryngology, School of Medicine, University of Pittsburgh Medical Center, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
10
|
Donohue C, Khalifa Y, Perera S, Sejdić E, Coyle JL. How Closely do Machine Ratings of Duration of UES Opening During Videofluoroscopy Approximate Clinician Ratings Using Temporal Kinematic Analyses and the MBSImP? Dysphagia 2020; 36:707-718. [PMID: 32955619 DOI: 10.1007/s00455-020-10191-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 09/14/2020] [Indexed: 10/23/2022]
Abstract
Clinicians evaluate swallow kinematic events by analyzing videofluoroscopy (VF) images for dysphagia management. The duration of upper esophageal sphincter opening (DUESO) is one important temporal swallow event, because reduced DUESO can result in pharyngeal residue and penetration/aspiration. VF is frequently used for evaluating swallowing but exposes patients to radiation and is not always feasible/readily available. High resolution cervical auscultation (HRCA) is a non-invasive, sensor-based dysphagia screening method that uses signal processing and machine learning to characterize swallowing. We investigated HRCA's ability to annotate DUESO and predict Modified Barium Swallow Impairment Profile (MBSImP) scores (component #14). We hypothesized that HRCA and machine learning techniques would detect DUESO with similar accuracy as human judges. Trained judges completed temporal kinematic measurements of DUESO on 719 swallows (116 patients) and 50 swallows (15 age-matched healthy adults). An MBSImP certified clinician completed MBSImP ratings on 100 swallows. A multi-layer convolutional recurrent neural network (CRNN) using HRCA signal features for input was used to detect DUESO. Generalized estimating equations models were used to determine statistically significant HRCA signal features for predicting DUESO MBSImP scores. A support vector machine (SVM) classifier and a leave-one-out procedure was used to predict DUESO MBSImP scores. The CRNN detected UES opening within a 3-frame tolerance for 82.6% of patient and 86% of healthy swallows and UES closure for 72.3% of patient and 64% of healthy swallows. The SVM classifier predicted DUESO MBSImP scores with 85.7% accuracy. This study provides evidence of HRCA's feasibility in detecting DUESO without VF images.
Collapse
Affiliation(s)
- Cara Donohue
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, 6035 Forbes Tower, Pittsburgh, PA, 15260, USA
| | - Yassin Khalifa
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Subashan Perera
- Division of Geriatrics, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Ervin Sejdić
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - James L Coyle
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, 6035 Forbes Tower, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
11
|
Donohue C, Mao S, Sejdić E, Coyle JL. Tracking Hyoid Bone Displacement During Swallowing Without Videofluoroscopy Using Machine Learning of Vibratory Signals. Dysphagia 2020; 36:259-269. [PMID: 32419103 DOI: 10.1007/s00455-020-10124-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Accepted: 04/29/2020] [Indexed: 10/24/2022]
Abstract
Identifying physiological impairments of swallowing is essential for determining accurate diagnosis and appropriate treatment for patients with dysphagia. The hyoid bone is an anatomical landmark commonly monitored during analysis of videofluoroscopic swallow studies (VFSSs). Its displacement is predictive of penetration/aspiration and is associated with other swallow kinematic events. However, VFSSs are not always readily available/feasible and expose patients to radiation. High-resolution cervical auscultation (HRCA), which uses acoustic and vibratory signals from a microphone and tri-axial accelerometer, is under investigation as a non-invasive dysphagia screening method and potential adjunct to VFSS when it is unavailable or not feasible. We investigated the ability of HRCA to independently track hyoid bone displacement during swallowing with similar accuracy to VFSS, by analyzing vibratory signals from a tri-axial accelerometer using machine learning techniques. We hypothesized HRCA would track hyoid bone displacement with a high degree of accuracy compared to humans. Trained judges completed frame-by-frame analysis of hyoid bone displacement on 400 swallows from 114 patients and 48 swallows from 16 age-matched healthy adults. Extracted features from vibratory signals were used to train the predictive algorithm to generate a bounding box surrounding the hyoid body on each frame. A metric of relative overlapped percentage (ROP) compared human and machine ratings. The mean ROP for all swallows analyzed was 50.75%, indicating > 50% of the bounding box containing the hyoid bone was accurately predicted in every frame. This provides evidence of the feasibility of accurate, automated hyoid bone displacement tracking using HRCA signals without use of VFSS images.
Collapse
Affiliation(s)
- Cara Donohue
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, 6035 Forbes Tower, Pittsburgh, PA, 15260, USA
| | - Shitong Mao
- Department of Electrical and Computer Engineering, Swanson School of Engineering, Department of Bioengineering, Swanson School of Engineering, Department of Biomedical Informatics, School of Medicine Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Ervin Sejdić
- Department of Electrical and Computer Engineering, Swanson School of Engineering, Department of Bioengineering, Swanson School of Engineering, Department of Biomedical Informatics, School of Medicine Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - James L Coyle
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, 6035 Forbes Tower, Pittsburgh, PA, 15260, USA.
| |
Collapse
|