Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Manfredi C, Lebacq J, Cantarella G, Schoentgen J, Orlandi S, Bandini A, DeJonckere P. Smartphones Offer New Opportunities in Clinical Voice Research. J Voice 2017;31:111.e1-111.e7. [DOI: 10.1016/j.jvoice.2015.12.020] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Accepted: 12/30/2015] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Hawley JL, Hancock AB. Incorporating Mobile App Technology in Voice Modification Protocol for Transgender Women. J Voice 2024;38:337-345. [PMID: 34706847 DOI: 10.1016/j.jvoice.2021.09.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 09/02/2021] [Accepted: 09/07/2021] [Indexed: 11/23/2022]

Evangelista E, Kale R, McCutcheon D, Rameau A, Gelbard A, Powell M, Johns M, Law A, Song P, Naunheim M, Watts S, Bryson PC, Crowson MG, Pinto J, Bensoussan Y. Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey. Laryngoscope 2024;134:1333-1339. [PMID: 38087983 DOI: 10.1002/lary.31052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Revised: 08/08/2023] [Accepted: 08/29/2023] [Indexed: 02/17/2024]

Abstract

INTRODUCTION

Accuracy and validity of voice AI algorithms rely on substantial quality voice data. Although commensurable amounts of voice data are captured daily in voice centers across North America, there is no standardized protocol for acoustic data management, which limits the usability of these datasets for voice artificial intelligence (AI) research.

OBJECTIVE

The aim was to capture current practices of voice data collection, storage, analysis, and perceived limitations to collaborative voice research.

METHODS

A 30-question online survey was developed with expert guidance from the voicecollab.ai members, an international collaborative of voice AI researchers. The survey was disseminated via REDCap to an estimated 200 practitioners at North American voice centers. Survey questions assessed respondents' current practices in terms of acoustic data collection, storage, and retrieval as well as limitations to collaborative voice research.

RESULTS

Seventy-two respondents completed the survey of which 81.7% were laryngologists and 18.3% were speech language pathologists (SLPs). Eighteen percent of respondents reported seeing 40%-60% and 55% reported seeing >60 patients with voice disorders weekly (conservative estimate of over 4000 patients/week). Only 28% of respondents reported utilizing standardized protocols for collection and storage of acoustic data. Although, 87% of respondents conduct voice research, only 38% of respondents report doing so on a multi-institutional level. Perceived limitations to conducting collaborative voice research include lack of standardized methodology for collection (30%) and lack of human resources to prepare and label voice data adequately (55%).

CONCLUSION

To conduct large-scale multi-institutional voice research with AI, there is a pertinent need for standardization of acoustic data management, as well as an infrastructure for secure and efficient data sharing.

LEVEL OF EVIDENCE

5 Laryngoscope, 134:1333-1339, 2024.

Collapse

Affiliation(s)

Emily Evangelista University of South Florida Morsani College of Medicine, Tampa, Florida, U.S.A
Rohan Kale Department of Biology, University of South Florida, Tampa, Florida, U.S.A
Desiree McCutcheon USF Health, University of South Florida, Tampa, Florida, U.S.A
Anais Rameau Department of Otolaryngology, Head and Neck Surgery Weill Cornell Medical College, Ithaca, New York, U.S.A
Alexander Gelbard Department of Otolaryngology, Head and Neck Surgery Vanderbilt University Medical Center, Nashville, Tennessee, U.S.A
Maria Powell Department of Otolaryngology, Head and Neck Surgery Vanderbilt University Medical Center, Nashville, Tennessee, U.S.A
Michael Johns Department of Otolaryngology-Head and Neck Surgery Keck College of Medicine, University of Southern California, Los Angeles, California, U.S.A
Anthony Law Department of Otolaryngology, Emory University School of Medicine, Atlanta, Georgia, U.S.A
Phillip Song Massachusetts Eye and Ear, Division of Laryngology, Otolaryngology-Head and Neck Surgery Harvard Medical School, Boston, Massachusetts, U.S.A
Matthew Naunheim Massachusetts Eye and Ear, Division of Laryngology, Otolaryngology-Head and Neck Surgery Harvard Medical School, Boston, Massachusetts, U.S.A
Stephanie Watts Department of Otolaryngology, Head and Neck Surgery at University of South Florida Morsani College of Medicine, Tampa, Florida, U.S.A
Paul C Bryson Department of Otolaryngology, Head and Neck Surgery at Cleveland Clinic, Cleveland, Ohio, U.S.A
Matthew G Crowson Massachusetts Eye and Ear, Otolaryngology-Head and Neck Surgery Harvard Medical School, Boston, Massachusetts, U.S.A
Jeremy Pinto Mila Quebec Artificial Intelligence Institute, Montreal, Quebec, Canada
Yael Bensoussan Division of Laryngology Department of Otolaryngology, Head and Neck Surgery at University of South Florida Morsani College of Medicine, Tampa, Florida, U.S.A

Collapse

Busquet F, Efthymiou F, Hildebrand C. Voice analytics in the wild: Validity and predictive accuracy of common audio-recording devices. Behav Res Methods 2024;56:2114-2134. [PMID: 37253958 PMCID: PMC10228884 DOI: 10.3758/s13428-023-02139-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/27/2023] [Indexed: 06/01/2023]

Ceylan ME, Cangi ME, Yılmaz G, Peru BS, Yiğit Ö. Are smartphones and low-cost external microphones comparable for measuring time-domain acoustic parameters? Eur Arch Otorhinolaryngol 2023;280:5433-5444. [PMID: 37584753 DOI: 10.1007/s00405-023-08179-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 08/05/2023] [Indexed: 08/17/2023]

Dhawan K, Varghese A, Kumar N, Varghese SS. Utility of Smart Phones as a Voice Acquisition Device for Assessing Pre and Post Treatment Voice Using PRAAT. Indian J Otolaryngol Head Neck Surg 2023;75:2901-2906. [PMID: 37974690 PMCID: PMC10645755 DOI: 10.1007/s12070-023-03884-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 05/08/2023] [Indexed: 11/19/2023] Open

Abstract

Voice assessment before and after treatment helps the clinician to assess the effectiveness of the treatment given and facilitates comparison between different treatment modalities. Voice handicap index -10(VHI-10) questionnaire is a tool which allows the voice to be evaluated subjectively from the patient's perspective. PRAAT is a freely available, software programme that acoustically analyse voice signals. Smart phones are widely used and the high quality of the embedded microphone in it makes it a suitable and easily available voice recording device. This study aims at using PRAAT and VHI-10 questionnaire in evaluating voice before and after treatment. The utility of smart phones as a voice acquisition device is also explored in the study. Prospective, observational study, carried out from 1st November 2019 to 30th September 2021in the ENT out- patient department at a tertiary hospital in Punjab. 58 patients complaining of dysphonia were enrolled consecutively in the study. All patients underwent detailed history, examination of the larynx using 70-degree rigid laryngoscope. The voice handicap was scored by (VHI-10) questionnaire and acoustic evaluation of voice was done using the PRAAT software. Patients' voice was further evaluated 3 months post-therapy with VHI 10 questionnaire and acoustic analysis. The parameters measured on PRAAT were mean pitch, jitter (local), shimmer (local), and mean harmonics to noise ratio (HNR). The voice was recorded using a smart phone and later transferred onto a laptop for analysis. The pre and post treatment acoustic parameters and VHI-10 scores were compared and correlated. There was significant difference (p < 0.001) between the pre and post treatment VHI-10 scores and all the acoustic parameters measured except for median pitch (p = 0.995). A poor positive correlation was found between the pre treatment VHI-10 scores and jitter(r = 0.188, p = 0.157) and shimmer (r = 0.288, p = 0.028) values. A negative correlation was observed between pre treatment VHI-10 scores and pitch (r = - 0.151, p = 0.259) and HNR(r = - 0.424, p = 0.001). Post treatment VHI-10 scores showed positive correlation with jitter (r = 0.302, p = 0.021) and shimmer (0.162, p = 0.225) values and negative correlation with pitch (r = - 0.10, p = 0.457) and HNR (r = - 0.356, p = 0.006) values. We found significant differences in the VHI-10 scores and PRAAT voice analysis results before and after treatment in patients complaining with voice change (dysphonia). VHI-10 questionnaire and PRAAT are good and convenient tools for assessing the voice subjectively and objectively. Only a poor to fair correlation was found between VHI-10 scores and PRAAT analysis results. More studies must be done to confirm the utility of smart phones as a voice acquisition device and PRAAT software in voice analysis.

Collapse

Calà F, Frassineti L, Sforza E, Onesimo R, D’Alatri L, Manfredi C, Lanata A, Zampino G. Artificial Intelligence Procedure for the Screening of Genetic Syndromes Based on Voice Characteristics. Bioengineering (Basel) 2023;10:1375. [PMID: 38135966 PMCID: PMC10741055 DOI: 10.3390/bioengineering10121375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/25/2023] [Accepted: 11/27/2023] [Indexed: 12/24/2023] Open

Llico AF, Shanley SN, Friedman AD, Bamford LM, Roberts RM, McKenna VS. Comparison Between Custom Smartphone Acoustic Processing Algorithms and Praat in Healthy and Disordered Voices. J Voice 2023:S0892-1997(23)00241-2. [PMID: 37690854 DOI: 10.1016/j.jvoice.2023.07.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 07/31/2023] [Accepted: 07/31/2023] [Indexed: 09/12/2023]

Frassineti L, Calà F, Sforza E, Onesimo R, Leoni C, Lanatà A, Zampino G, Manfredi C. Quantitative acoustical analysis of genetic syndromes in the number listing task. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]

Uloza V, Ulozaitė-Stanienė N, Petrauskas T, Pribuišis K, Blažauskas T, Damaševičius R, Maskeliūnas R. Reliability of Universal-Platform-Based Voice Screen Application in AVQI Measurements Captured with Different Smartphones. J Clin Med 2023;12:4119. [PMID: 37373811 DOI: 10.3390/jcm12124119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 06/15/2023] [Accepted: 06/16/2023] [Indexed: 06/29/2023] Open

Vinney LA, Tripp R, Shelly S, Gillespie A. Indexing Cognitive Resource Usage for Acquisition of Initial Voice Therapy Targets. Am J Speech Lang Pathol 2023;32:717-732. [PMID: 36701805 DOI: 10.1044/2022_ajslp-22-00197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Abstract

PURPOSE

The purpose of this study was to index cognitive resource usage for acquisition of initial targets of two common voice therapy techniques (resonant voice therapy [RVT] and conversation training therapy [CTT]) based on the theorized depletion effect (i.e., when an initial task requiring high cognitive load leads to poorer performance on a subsequent task).

METHOD

Eleven vocally healthy participants, ages 23-41 years, read aloud the Rainbow Passage and produced consonant-vowel resonant targets (/mi, ma, mu/) followed by a baseline computerized Stroop task and a 15-min washout. Following this baseline period, participants watched and interacted with two videos instructing them in RVT or CTT initial targets. After viewing each video and practicing the associated vocal skills, participants rated the degree of mental effort required to engage in the target vocal technique on a modified Borg scale. Participants recorded their attempts at RVT on /mi, ma, mu/ and CTT on the Rainbow Passage, which were later rated by three voice-specialized speech-language pathologists as to how representative they were of each respective target technique. Changes in fundamental frequency and average auditory-perceptual ratings from baseline were examined to determine if participants adjusted their technique from RVT and CTT baseline to acquisition.

RESULTS

Performance on the Stroop task was, on average, worse post CTT than post RVT, but both post-CTT and post-RVT Stroop scores were poorer than baseline. These results suggest that both treatment techniques taxed cognitive resources but that CTT was more cognitively taxing than RVT. However, despite differences in raw averages, no statistically significant differences were found between the baseline, post-CTT, and post-RVT Stroop scores, likely due to the small sample size. Participant ratings of mental effort for CTT and RVT were statistically similar. Likewise, poorer post-RVT Stroop scores were associated with participants' greater perceived mental effort with RVT acquisition, but there was no significant association between mental effort ratings for CTT acquisition and post-CTT Stroop scores. Significantly higher fundamental frequency and perceived ratings of the accuracy of technique from baseline to acquisition for both CTT and RVT were found, providing evidence of vocal behavior changes as a result of each technique.

CONCLUSIONS

Brief exposure to initial treatment tasks in CTT is more cognitively depleting than initial RVT tasks. Results also indicate that vocally healthy participants are able to make a voice change in response to a brief therapy prompt. Finally, participant-rated measures of mental effort and secondary measures of cognitive depletion do not always correlate.

Collapse

Cavalcanti JC, Englert M, Oliveira M, Constantini AC. Microphone and Audio Compression Effects on Acoustic Voice Analysis: A Pilot Study. J Voice 2023;37:162-172. [PMID: 33451892 DOI: 10.1016/j.jvoice.2020.12.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 12/01/2020] [Accepted: 12/03/2020] [Indexed: 11/29/2022]

Calà F, Manfredi C, Battilocchi L, Frassineti L, Cantarella G. Speaking with mask in the COVID-19 era: Multiclass machine learning classification of acoustic and perceptual parameters. J Acoust Soc Am 2023;153:1204. [PMID: 36859154 DOI: 10.1121/10.0017244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 01/26/2023] [Indexed: 06/18/2023]

Uloza V, Ulozaite-Staniene N, Petrauskas T. An iOS-based VoiceScreen application: feasibility for use in clinical settings-a pilot study. Eur Arch Otorhinolaryngol 2023;280:277-84. [PMID: 35906420 DOI: 10.1007/s00405-022-07546-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 07/06/2022] [Indexed: 01/07/2023]

Fahed VS, Doheny EP, Busse M, Hoblyn J, Lowery MM. Comparison of Acoustic Voice Features Derived From Mobile Devices and Studio Microphone Recordings. J Voice 2022:S0892-1997(22)00312-5. [PMID: 36379826 DOI: 10.1016/j.jvoice.2022.10.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 10/10/2022] [Accepted: 10/10/2022] [Indexed: 11/14/2022]

Abstract

OBJECTIVES/HYPOTHESIS

Improvements in mobile device technology offer new opportunities for remote monitoring of voice for home and clinical assessment. However, there is a need to establish equivalence between features derived from signals recorded from mobile devices and gold standard microphone-preamplifiers. In this study acoustic voice features from android smartphone, tablet, and microphone-preamplifier recordings were compared.

METHODS

Data were recorded from 37 volunteers (20 female) with no history of speech disorder and six volunteers with Huntington's disease (HD) during sustained vowel (SV) phonation, reading passage (RP), and five syllable repetition (SR) tasks. The following features were estimated: fundamental frequency median and standard deviation (F0 and SD F0), harmonics-to-noise ratio (HNR), local jitter, relative average perturbation of jitter (RAP), five-point period perturbation quotient (PPQ5), difference of differences of amplitude and periods (DDA and DDP), shimmer, and amplitude perturbation quotients (APQ3, APQ5, and APQ11).

RESULTS

Bland-Altman analysis revealed good agreement between microphone and mobile devices for fundamental frequency, jitter, RAP, PPQ5, and DDP during all tasks and a bias for HNR, shimmer and its variants (APQ3, APQ5, APQ11, and DDA). Significant differences were observed between devices for HNR, shimmer, and its variants for all tasks. High correlation was observed between devices for all features, except SD F0 for RP. Similar results were observed in the HD group for SV and SR task. Biological sex had a significant effect on F0 and HNR during all tests, and for jitter, RAP, PPQ5, DDP, and shimmer for RP and SR. No significant effect of age was observed.

CONCLUSIONS

Mobile devices provided good agreement with state of the art, high-quality microphones during structured speech tasks for features derived from frequency components of the audio recordings. Caution should be taken when estimating HNR, shimmer and its variants from recordings made with mobile devices.

Collapse

Di Pietro DA, Olivares A, Comini L, Vezzadini G, Luisa A, Petrolati A, Boccola S, Boccali E, Pasotti M, Danna L, Vitacca M. Voice Alterations, Dysarthria, and Respiratory Derangements in Patients With Parkinson's Disease. J Speech Lang Hear Res 2022;65:3749-3757. [PMID: 36194769 DOI: 10.1044/2022_jslhr-21-00539] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Abstract

PURPOSE

Almost 90% of people with Parkinson's disease (PD) develop voice and speech disorders during the course of the disease. Ventilatory dysfunction is one of the main causes. We aimed to evaluate relationships between respiratory impairments and speech/voice changes in PD.

METHOD

At Day 15 from admission, in consecutive clinically stable PD patients in a neurorehabilitation unit, we collected clinical data as follows: comorbidities, PD severity, motor function and balance, respiratory function at rest (including muscle strength and cough ability), during exercise-induced desaturation and at night, voice function (Voice Handicap Index [VHI] and acoustic analysis [Praat]), speech disorders (Robertson Dysarthria Profile [RDP]), and postural abnormalities. Based on an arbitrary RDP cutoff, two groups with different dysarthria degree were identified-moderate-severe versus no-mild dysarthria-and compared.

RESULTS

Of 55 patients analyzed (median value Unified Parkinson's Disease Rating Scale Part II 9 and Part III 17), we found significant impairments in inspiratory and expiratory muscle pressure (> 90%, both), exercise tolerance at 6-min walking distance (96%), nocturnal (12.7%) and exercise-induced (21.8%) desaturation, VHI (34%), and Praat Shimmer% (89%). Patients with moderate-severe dysarthria (16% of total sample) had more comorbidities/disabilities and worse respiratory pattern and postural abnormalities (camptocormia) than those with no-mild dysarthria. Moreover, the risk of presenting nocturnal desaturation, reduced peak expiratory flow, and cough ability was about 11, 13, and 8 times higher in the moderate-severe group.

CONCLUSIONS

Dysarthria and respiratory dysfunction are closely associated in PD patients, particularly nocturnal desaturation and reduced cough ability. In addition, postural condition could be at the base of both respiratory and voice impairments.

SUPPLEMENTAL MATERIAL

https://doi.org/10.23641/asha.21210944.

Collapse

Gerosa M, Kenny C. The Effects of Vocal Loading and Steam Inhalation on Acoustic, Aerodynamic and Vocal Tract Discomfort Measures in Adults. J Voice 2022. [DOI: 10.1016/j.jvoice.2022.09.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Pommée T, Morsomme D. Voice Quality in Telephone Interviews: A preliminary Acoustic Investigation. J Voice 2022:S0892-1997(22)00268-5. [PMID: 36192289 DOI: 10.1016/j.jvoice.2022.08.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/24/2022] [Accepted: 08/25/2022] [Indexed: 10/07/2022]

Abstract

OBJECTIVES

To investigate the impact of standardized mobile phone recordings passed through a telecom channel on acoustic markers of voice quality and on its perception by voice experts in normophonic speakers.

METHODS

Continuous speech and a sustained vowel were recorded for fourteen female and ten male normophonic speakers. The recordings were done simultaneously with a head-mounted high-quality microphone and through the telephone network on a receiving smartphone. Twenty-two acoustic voice quality, breathiness and pitch-related measures were extracted from the recordings. Nine vocologists perceptually rated the G, R and B parameters of the GRBAS scale on each voice sample. The reproducibility, the recording type, the stimulus type and the gender effects, as well as the correlation between acoustic and perceptual measures were investigated.

RESULTS

The sustained vowel samples are damped after one second. Only the frequencies between 100 and 3700Hz are passed through the telecom channel and the frequency response is characterized by peaks and troughs. The acoustic measures show a good reproducibility over the three repetitions. All measures significantly differ between the recording types, except for the local jitter, the harmonics-to-noise ratio by Dejonckere and Lebacq, the period standard deviation and all six pitch measures. The AVQI score is higher in telephone recordings, while the ABI score is lower. Significant differences between genders are also found for most of the measures; while the AVQI is similar in men and women, the ABI is higher in women in both recording types. For the perceptual assessment, the interrater agreement is rather low, while the reproducibility over the three repetitions is good. Few significant differences between recording types are observed, except for lower breathiness ratings on telephone recordings. G ratings are significantly more severe on the sustained vowel on both recording types, R ratings only on telephone recordings. While roughness is rated higher in men on telephone recordings by most experts, no gender effect is observed for breathiness on either recording types. Finally, neither the AVQI nor the ABI yield strong correlations with any of the perceptual parameters.

CONCLUSIONS

Our results show that passing a voice signal through a telecom channel induces filter and noise effects that limit the use of common acoustic voice quality measures and indexes. The AVQI and ABI are both significantly impacted by the recording type. The most reliable acoustic measures seem to be pitch perturbation (local jitter and period standard deviation) as well as the harmonics-to-noise ratio from Dejonckere and Lebacq. Our results also underline that raters are not equally sensitive to the various factors, including the recording type, the stimulus type and the gender effects. Neither of the three perceptual parameters G, R and B seem to be reliably measurable on telephone recordings using the two investigated acoustic indexes. Future studies investigating the impact of voice quality in telephone conversations should thus focus on acoustic measures on continuous speech samples that are limited to the frequency response of the telecom channel and that are not too sensitive to environmental and additive noise.

Collapse

Rodríguez Marconi D, Morales C, Araya P, Ferrada R, Ibarra M, Catrifol MT. Uso del smartphone en telepráctica para trastornos de la voz. Una revisión desde el concepto de Mhealth. Rev investig logop 2022. [DOI: 10.5209/rlog.78550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Yamada Y, Shinkawa K, Nemoto M, Arai T. Automatic Assessment of Loneliness in Older Adults Using Speech Analysis on Responses to Daily Life Questions. Front Psychiatry 2021;12:712251. [PMID: 34966297 PMCID: PMC8710612 DOI: 10.3389/fpsyt.2021.712251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 11/19/2021] [Indexed: 11/13/2022] Open

Castillo-Allendes A, Contreras-Ruston F, Cantor L, Codino J, Guzman M, Malebran C, Manzano C, Pavez A, Vaiano T, Wilder F, Behlau M. Terapia de voz en el contexto de la pandemia covid-19; recomendaciones para la práctica clínica. J Voice 2021;35:808.e1-808.e12. [PMID: 32917457 PMCID: PMC7442931 DOI: 10.1016/j.jvoice.2020.08.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Castillo-Allendes A, Contreras-Ruston F, Cantor L, Codino J, Guzman M, Malebran C, Manzano C, Pavez A, Vaiano T, Wilder F, Behlau M. Terapia Vocal No Contexto Da Pandemia Do Covid-19; Orientações Para A Prática Clínica. J Voice 2021;35:808.e13-808.e24. [PMID: 32917460 PMCID: PMC7439998 DOI: 10.1016/j.jvoice.2020.08.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Affiliation(s)

Adrián Castillo-Allendes Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
Francisco Contreras-Ruston Speech-Language Pathology and Audiology Department, Universidad de Valparaíso, San Felipe, Chile,1Address correspondence and reprint requests to Francisco Contreras-Ruston, CEV–Centro de Estudos da Voz, Rua Machado Bittencourt, 361, SP 04044-001, Brazil
Lady Cantor Department of Collective Health, Universidad Nacional de Colombia, Bogotá, Colombia,§Program of Speech and Language Pathology, Universidad Manuela Beltrán, Bogotá, Colombia
Juliana Codino Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan,‖Lakeshore Professional Voice Center, Lakeshore Ear, Nose, and Throat Center, St. Clair Shores, Michigan
Marco Guzman Universidad de los Andes, Chile, Santiago, Chile
Celina Malebran Escuela de Fonoaudiología, Universidad Católica Silva Henríquez, Santiago, Chile
Carlos Manzano Hospital Médica Sur, Ciudad de México, México,††Centro Médico ABC, Ciudad de México, México
Axel Pavez Physical Medicine and Rehabilitation Service, Hospital de Urgencia Asistencia Pública. Santiago, Chile
Thays Vaiano CEV - Centro de Estudos da Voz, São Paulo, Brazil,‖‖Speech-Language Pathology and Audiology Department, Escola Paulista de Medicina, Federal University of São Paulo, São Paulo, Brazil
Fabiana Wilder Carrera de Fonoaudiología, Facultad de Medicina, Universidad de Buenos Aires, Buenos Aires, Argentina,⁎⁎⁎Servicio de Fonoudiología, Hospital de Clínicas “José de San Martin,” Buenos Aires, Argentina
Mara Behlau CEV - Centro de Estudos da Voz, São Paulo, Brazil,‖‖Speech-Language Pathology and Audiology Department, Escola Paulista de Medicina, Federal University of São Paulo, São Paulo, Brazil

Collapse

Castillo-Allendes A, Contreras-Ruston F, Cantor-Cutiva LC, Codino J, Guzman M, Malebran C, Manzano C, Pavez A, Vaiano T, Wilder F, Behlau M. Voice Therapy in the Context of the COVID-19 Pandemic: Guidelines for Clinical Practice. J Voice 2021;35:717-727. [PMID: 32878736 PMCID: PMC7413113 DOI: 10.1016/j.jvoice.2020.08.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 07/30/2020] [Accepted: 08/03/2020] [Indexed: 01/14/2023]

Zhang C, Jepson K, Lohfink G, Arvaniti A. Comparing acoustic analyses of speech data collected remotely. J Acoust Soc Am 2021;149:3910. [PMID: 34241427 PMCID: PMC8269758 DOI: 10.1121/10.0005132] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 05/11/2021] [Accepted: 05/12/2021] [Indexed: 06/01/2023]

Yamada Y, Shinkawa K, Kobayashi M, Takagi H, Nemoto M, Nemoto K, Arai T. Using Speech Data From Interactions With a Voice Assistant to Predict the Risk of Future Accidents for Older Drivers: Prospective Cohort Study. J Med Internet Res 2021;23:e27667. [PMID: 33830066 PMCID: PMC8063093 DOI: 10.2196/27667] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 03/08/2021] [Accepted: 03/15/2021] [Indexed: 01/27/2023] Open

Abstract

Background

With the rapid growth of the older adult population worldwide, car accidents involving this population group have become an increasingly serious problem. Cognitive impairment, which is assessed using neuropsychological tests, has been reported as a risk factor for being involved in car accidents; however, it remains unclear whether this risk can be predicted using daily behavior data.

Objective

The objective of this study was to investigate whether speech data that can be collected in everyday life can be used to predict the risk of an older driver being involved in a car accident.

Methods

At baseline, we collected (1) speech data during interactions with a voice assistant and (2) cognitive assessment data—neuropsychological tests (Mini-Mental State Examination, revised Wechsler immediate and delayed logical memory, Frontal Assessment Battery, trail making test-parts A and B, and Clock Drawing Test), Geriatric Depression Scale, magnetic resonance imaging, and demographics (age, sex, education)—from older adults. Approximately one-and-a-half years later, we followed up to collect information about their driving experiences (with respect to car accidents) using a questionnaire. We investigated the association between speech data and future accident risk using statistical analysis and machine learning models.

Results

We found that older drivers (n=60) with accident or near-accident experiences had statistically discernible differences in speech features that suggest cognitive impairment such as reduced speech rate (P=.048) and increased response time (P=.040). Moreover, the model that used speech features could predict future accident or near-accident experiences with 81.7% accuracy, which was 6.7% higher than that using cognitive assessment data, and could achieve up to 88.3% accuracy when the model used both types of data.

Conclusions

Our study provides the first empirical results that suggest analysis of speech data recorded during interactions with voice assistants could help predict future accident risk for older drivers by capturing subtle impairments in cognitive function.

Collapse

Angelakis E, Andreopoulou A, Georgaki A. Multisensory biofeedback: Promoting the recessive somatosensory control in operatic singing pedagogy. Biomed Signal Process Control 2021;66:102400. [DOI: 10.1016/j.bspc.2020.102400] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Uloza V, Ulozaitė-Stanienė N, Petrauskas T, Kregždytė R. Accuracy of Acoustic Voice Quality Index Captured With a Smartphone - Measurements With Added Ambient Noise. J Voice 2021;37:465.e19-465.e26. [PMID: 33676807 DOI: 10.1016/j.jvoice.2021.01.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 01/22/2021] [Accepted: 01/26/2021] [Indexed: 11/27/2022]

Tabatabaei SAH, Fischer P, Schneider H, Koehler U, Gross V, Sohrabi K. Methods for Adventitious Respiratory Sound Analyzing Applications Based on Smartphones: A Survey. IEEE Rev Biomed Eng 2021;14:98-115. [PMID: 32746364 DOI: 10.1109/rbme.2020.3002970] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Ditthapron A, O Agu E, C Lammert A. Privacy-Preserving Deep Speaker Separation for Smartphone-Based Passive Speech Assessment. IEEE Open J Eng Med Biol 2021;2:304-313. [PMID: 35402977 PMCID: PMC8940203 DOI: 10.1109/ojemb.2021.3063994] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 12/03/2022] Open

Robin J, Harrison JE, Kaufman LD, Rudzicz F, Simpson W, Yancheva M. Evaluation of Speech-Based Digital Biomarkers: Review and Recommendations. Digit Biomark 2020;4:99-108. [PMID: 33251474 DOI: 10.1159/000510820] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 08/11/2020] [Indexed: 12/23/2022] Open

Baldanzi C, Crispiatico V, Foresti S, Groppo E, Rovaris M, Cattaneo D, Vitali C. Effects of Intensive Voice Treatment (The Lee Silverman Voice Treatment [LSVT LOUD]) in Subjects With Multiple Sclerosis: A Pilot Study. J Voice 2020;36:585.e1-585.e13. [PMID: 32819780 DOI: 10.1016/j.jvoice.2020.07.025] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 07/21/2020] [Accepted: 07/21/2020] [Indexed: 11/16/2022]

Abstract

AIM

The rehabilitation of voice disorders is an unmet need in multiple sclerosis (MS). The Lee Silverman Voice Treatment (LSVT LOUD) is a well-documented and effective speech treatment, developed to treat voice disorders in Parkinson Disease. The purpose of the present study was to examine the viability of applying the LSVT LOUD to individuals with MS and verify short- and long-term improvements in acoustic and perceptual voice parameters.

METHODS

A single subject design was performed in a consecutive sample of 8 subjects with MS. The subjects' voice was recorded with PRAAT software for 5 days at baseline during the 16 treatment sessions, and at follow-up (FU) 6/12 months later. PRAAT provided data on sustained /a/ (SPL/a/) voice intensity and maximum phonation time (MPT/a/) of sustained /a/, and on functional sentences voice intensity. In addition, self-assessment questionnaire Voice Handicap Index, the perceptual GIRBAS scale and intensity of monologue were collected at first day of baseline, post-treatment and at FU. In the treatment phase each subject received treatment according to LSVT LOUD protocol. Visual analysis calculated for daily acoustic variables was used to determine baseline stability and analyse changes following treatment. The Wilcoxon test was used to assess statistically significant differences between baseline and post treatment.

RESULTS

All participants completed the LSVT LOUD programme; one participant dropped out at FU. Improvements in acoustic analysis were found: SPL/a/ improved on average (± standard deviation) 11.64 ± 4.19 dB with 7 subjects showing statistically significant improvement (P < 0.05); MPT/a/ improved on average 1.2 ± 1.53seconds, while intensity of functional sentences improved on average 8.11 ± 3.46 dB with 4 and 5 subjects showed statistically significant improvement, respectively. Intensity of monologue improved 14.90 ± 3.33 dB. Acoustic values are maintained or increased at FU respect to baseline. All subjects improved perceptual ratings at Voice Handicap Index and results were maintained at FU. These changes were associated with improvements on five parameters on the GIRBAS scale at post-treatment, however no further improvement were observed at FU.

CONCLUSION

Intensive LSVT LOUD treatment is a viable approach to treat hypophonia in MS. LSVT LOUD improved both quantitative-instrumental and perceptive-subjective assessments. Randomised controlled trials are needed to provide a firm support on the effectiveness of LSVT LOUD in MS.

Collapse

Petrizzo D, Popolo PS. Smartphone Use in Clinical Voice Recording and Acoustic Analysis: A Literature Review. J Voice 2020;35:499.e23-499.e28. [PMID: 32736910 DOI: 10.1016/j.jvoice.2019.10.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 10/11/2019] [Accepted: 10/11/2019] [Indexed: 11/30/2022]

Illner V, Sovka P, Rusz J. Validation of freely-available pitch detection algorithms across various noise levels in assessing speech captured by smartphone in Parkinson’s disease. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2019.101831] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Yamada Y, Shinkawa K, Shimmei K. Atypical Repetition in Daily Conversation on Different Days for Detecting Alzheimer Disease: Evaluation of Phone-Call Data From Regular Monitoring Service. JMIR Ment Health 2020;7:e16790. [PMID: 31934870 PMCID: PMC6996758 DOI: 10.2196/16790] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 12/04/2019] [Accepted: 12/16/2019] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

Identifying signs of Alzheimer disease (AD) through longitudinal and passive monitoring techniques has become increasingly important. Previous studies have succeeded in quantifying language dysfunctions and identifying AD from speech data collected during neuropsychological tests. However, whether and how we can quantify language dysfunction in daily conversation remains unexplored.

OBJECTIVE

The objective of this study was to explore the linguistic features that can be used for differentiating AD patients from daily conversations.

METHODS

We analyzed daily conversational data of seniors with and without AD obtained from longitudinal follow-up in a regular monitoring service (from n=15 individuals including 2 AD patients at an average follow-up period of 16.1 months; 1032 conversational data items obtained during phone calls and approximately 221 person-hours). In addition to the standard linguistic features used in previous studies on connected speech data during neuropsychological tests, we extracted novel features related to atypical repetition of words and topics reported by previous observational and descriptive studies as one of the prominent characteristics in everyday conversations of AD patients.

RESULTS

When we compared the discriminative power for AD, we found that atypical repetition in two conversations on different days outperformed other linguistic features used in previous studies on speech data during neuropsychological tests. It was also a better indicator than atypical repetition in single conversations as well as that in two conversations separated by a specific number of conversations.

CONCLUSIONS

Our results show how linguistic features related to atypical repetition across days could be used for detecting AD from daily conversations in a passive manner by taking advantage of longitudinal data.

Collapse

Ulozaite-Staniene N, Petrauskas T, Šaferis V, Uloza V. Exploring the feasibility of the combination of acoustic voice quality index and glottal function index for voice pathology screening. Eur Arch Otorhinolaryngol 2019;276:1737-1745. [DOI: 10.1007/s00405-019-05433-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 04/12/2019] [Indexed: 11/25/2022]

Jannetts S, Schaeffler F, Beck J, Cowen S. Assessing voice health using smartphones: bias and random error of acoustic voice parameters captured by different smartphone types. Int J Lang Commun Disord 2019;54:292-305. [PMID: 30779425 DOI: 10.1111/1460-6984.12457] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Revised: 10/31/2018] [Accepted: 01/16/2019] [Indexed: 06/09/2023]

Munnings AJ. The Current State and Future Possibilities of Mobile Phone "Voice Analyser" Applications, in Relation to Otorhinolaryngology. J Voice 2019;34:527-532. [PMID: 30655018 DOI: 10.1016/j.jvoice.2018.12.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 12/21/2018] [Accepted: 12/26/2018] [Indexed: 10/27/2022]

Rusz J, Hlavnicka J, Tykalova T, Novotny M, Dusek P, Sonka K, Ruzicka E. Smartphone Allows Capture of Speech Abnormalities Associated With High Risk of Developing Parkinson’s Disease. IEEE Trans Neural Syst Rehabil Eng 2018;26:1495-1507. [DOI: 10.1109/tnsre.2018.2851787] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Lebacq J, Schoentgen J, Cantarella G, Bruss FT, Manfredi C, DeJonckere P. Maximal Ambient Noise Levels and Type of Voice Material Required for Valid Use of Smartphones in Clinical Voice Research. J Voice 2017;31:550-556. [DOI: 10.1016/j.jvoice.2017.02.017] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Revised: 02/22/2017] [Accepted: 02/24/2017] [Indexed: 10/19/2022]