1
|
Kent RD. The Feel of Speech: Multisystem and Polymodal Somatosensation in Speech Production. J Speech Lang Hear Res 2024; 67:1424-1460. [PMID: 38593006 DOI: 10.1044/2024_jslhr-23-00575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/11/2024]
Abstract
PURPOSE The oral structures such as the tongue and lips have remarkable somatosensory capacities, but understanding the roles of somatosensation in speech production requires a more comprehensive knowledge of somatosensation in the speech production system in its entirety, including the respiratory, laryngeal, and supralaryngeal subsystems. This review was conducted to summarize the system-wide somatosensory information available for speech production. METHOD The search was conducted with PubMed/Medline and Google Scholar for articles published until November 2023. Numerous search terms were used in conducting the review, which covered the topics of psychophysics, basic and clinical behavioral research, neuroanatomy, and neuroscience. RESULTS AND CONCLUSIONS The current understanding of speech somatosensation rests primarily on the two pillars of psychophysics and neuroscience. The confluence of polymodal afferent streams supports the development, maintenance, and refinement of speech production. Receptors are both canonical and noncanonical, with the latter occurring especially in the muscles innervated by the facial nerve. Somatosensory representation in the cortex is disproportionately large and provides for sensory interactions. Speech somatosensory function is robust over the lifespan, with possible declines in advanced aging. The understanding of somatosensation in speech disorders is largely disconnected from research and theory on speech production. A speech somatoscape is proposed as the generalized, system-wide sensation of speech production, with implications for speech development, speech motor control, and speech disorders.
Collapse
|
2
|
Vorperian HK, Kent RD, Lee Y, Buhr KA. Vowel Production in Children and Adults With Down Syndrome: Fundamental and Formant Frequencies of the Corner Vowels. J Speech Lang Hear Res 2023; 66:1208-1239. [PMID: 37015000 PMCID: PMC10187968 DOI: 10.1044/2022_jslhr-22-00510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Revised: 12/01/2022] [Accepted: 12/21/2022] [Indexed: 05/18/2023]
Abstract
PURPOSE Atypical vowel production contributes to reduced speech intelligibility in children and adults with Down syndrome (DS). This study compares the acoustic data of the corner vowels /i/, /u/, /æ/, and /ɑ/ from speakers with DS against typically developing/developed (TD) speakers. METHOD Measurements of the fundamental frequency (f o) and first four formant frequencies (F1-F4) were obtained from single word recordings containing the target vowels from 81 participants with DS (ages 3-54 years) and 293 TD speakers (ages 4-92 years), all native speakers of English. The data were used to construct developmental trajectories and to determine interspeaker and intraspeaker variability. RESULTS Trajectories for DS differed from TD based on age and sex, but the groups were similar with the striking change in f o and F1-F4 frequencies around age 10 years. Findings confirm higher f o in DS, and vowel-specific differences between DS and TD in F1 and F2 frequencies, but not F3 and F4. The measure of F2 differences of front-versus-back vowels was more sensitive of compression than reduced vowel space area/centralization across age and sex. Low vowels had more pronounced F2 compression as related to reduced speech intelligibility. Intraspeaker variability was significantly greater for DS than TD for nearly all frequency values across age. DISCUSSION Vowel production differences between DS and TD are age- and sex-specific, which helps explain contradictory results in previous studies. Increased intraspeaker variability across age in DS confirms the presence of a persisting motor speech disorder. Atypical vowel production in DS is common and related to dysmorphology, delayed development, and disordered motor control.
Collapse
Affiliation(s)
- Houri K. Vorperian
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin–Madison
| | - Raymond D. Kent
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin–Madison
| | - Yen Lee
- Department of Educational Leadership, Edgewood College, Madison, Wisconsin
| | - Kevin A. Buhr
- Department of Biostatistics and Medical Informatics, University of Wisconsin–Madison
| |
Collapse
|
3
|
Hendrix JA, Amon A, Abbeduto L, Agiovlasitis S, Alsaied T, Anderson HA, Bain LJ, Baumer N, Bhattacharyya A, Bogunovic D, Botteron KN, Capone G, Chandan P, Chase I, Chicoine B, Cieuta-Walti C, DeRuisseau LR, Durand S, Esbensen A, Fortea J, Giménez S, Granholm AC, Hahn LJ, Head E, Hillerstrom H, Jacola LM, Janicki MP, Jasien JM, Kamer AR, Kent RD, Khor B, Lawrence JB, Lemonnier C, Lewanda AF, Mobley W, Moore PE, Nelson LP, Oreskovic NM, Osorio RS, Patterson D, Rasmussen SA, Reeves RH, Roizen N, Santoro S, Sherman SL, Talib N, Tapia IE, Walsh KM, Warren SF, White AN, Wong GW, Yi JS. Opportunities, barriers, and recommendations in down syndrome research. Transl Sci Rare Dis 2021; 5:99-129. [PMID: 34268067 PMCID: PMC8279178 DOI: 10.3233/trd-200090] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
BACKGROUND Recent advances in medical care have increased life expectancy and improved the quality of life for people with Down syndrome (DS). These advances are the result of both pre-clinical and clinical research but much about DS is still poorly understood. In 2020, the NIH announced their plan to update their DS research plan and requested input from the scientific and advocacy community. OBJECTIVE The National Down Syndrome Society (NDSS) and the LuMind IDSC Foundation worked together with scientific and medical experts to develop recommendations for the NIH research plan. METHODS NDSS and LuMind IDSC assembled over 50 experts across multiple disciplines and organized them in eleven working groups focused on specific issues for people with DS. RESULTS This review article summarizes the research gaps and recommendations that have the potential to improve the health and quality of life for people with DS within the next decade. CONCLUSIONS This review highlights many of the scientific gaps that exist in DS research. Based on these gaps, a multidisciplinary group of DS experts has made recommendations to advance DS research. This paper may also aid policymakers and the DS community to build a comprehensive national DS research strategy.
Collapse
Affiliation(s)
| | - Angelika Amon
- Deceased. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA; Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Leonard Abbeduto
- Department of Psychiatry and Behavioral Sciences, University of California, Davis, CA, USA; MIND Institute, University of California, Davis, CA, USA
| | | | - Tarek Alsaied
- Heart Institute Department of Pediatrics Cincinnati Children’s Hospital Medical Center University of Cincinnati, Cincinnati, OH, USA
| | | | | | - Nicole Baumer
- Department of Neurology, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA; Down Syndrome Program, Developmental Medicine Center, Boston Children’s Hospital, Boston, MA, USA
| | - Anita Bhattacharyya
- Waisman Center, University of Wisconsin-Madison, Madison, WI, USA
- Department of Cell and Regenerative Biology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, USA
| | - Dusan Bogunovic
- Department of Microbiology, Icahn School of Medicine at Mt. Sinai, New York, NY, USA; Department of Pediatrics, Icahn School of Medicine at Mt. Sinai, New York, NY; Precision Immunology Institute, Icahn School of Medicine at Mt. Sinai, New York, NY, USA; Mindich Child Health and Development Institute, Icahn School of Medicine at Mt. Sinai, New York, NY, USA
| | - Kelly N. Botteron
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, USA; Mallinckrodt Institute of Radiology, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Priya Chandan
- Department of Neurosurgery, Division of Physical Medicine and Rehabilitation, University of Louisville School of Medicine, Louisville, KY, USA
| | - Isabelle Chase
- Department of Pediatric Dentistry, Boston Children’s Hospital, Boston, MA, USA
| | - Brian Chicoine
- Advocate Medical Group Adult Down Syndrome Center, Park Ridge, IL, USA
| | | | | | | | - Anna Esbensen
- Department of Pediatrics, University of Cincinnati College of Medicine & Division of Developmental and Behavioral Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Juan Fortea
- Barcelona Down Medical Center, Fundació Catalana de Síndrome de Down, Barcelona, Spain; Sant Pau Memory Unit, Department of Neurology, Hospital de la Santa Creu i Sant Pau, Biomedical Research Institute Sant Pau, Universitat Autònoma de Barcelona, Barcelona, Spain; Center of Biomedical Investigation Network for Neurodegenerative Diseases, Madrid, Spain
| | - Sandra Giménez
- Multidisciplinary Sleep Unit, Respiratory Department, Hospital de la Santa Creu i Sant Pau, Barcelona, Spain
| | - Ann-Charlotte Granholm
- Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
- Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
| | - Laura J. Hahn
- Department of Speech and Hearing Science, University of Illinois Urbana Champaign, Champaign, IL, USA
| | - Elizabeth Head
- Department of Pathology and Laboratory Medicine, UC Irvine School of Medicine, Orange, CA, USA
| | | | - Lisa M. Jacola
- Department of Psychology, St Jude Children’s Research Hospital, Memphis, TN, USA
| | | | - Joan M. Jasien
- Division of Pediatric Neurology, Duke University Health System, Durham, NC, USA
| | - Angela R. Kamer
- Department of Periodontology and Implant Dentistry, New York University, College of Dentistry, New York, NY, USA
| | - Raymond D. Kent
- Waisman Center, University of Wisconsin-Madison, Madison, WI, USA
| | - Bernard Khor
- Benaroy Research Institute at Virginia Mason, Seattle, WA, USA
| | - Jeanne B. Lawrence
- Department of Neurology, University of Massachusetts Medical School, Worcester, MA, USA; Department of Pediatrics, University of Massachusetts Medical School, Worcester, MA, USA
| | | | - Amy Feldman Lewanda
- Children s National Rare Disease Institute, Children’s National Health System, Washington, DC., USA
| | - William Mobley
- Department of Neurosciences, University of California, San Diego, CA, USA
| | - Paul E. Moore
- Division of Allergy, Immunology, and Pulmonology, Vanderbilt University Medical Center, Nashville, TN, USA; Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN, USA
| | | | - Nicolas M. Oreskovic
- Department of Pediatrics, Massachusetts General Hospital, Boston, MA, USA; Department of Internal Medicine, Massachusetts General Hospital, Boston, Mass
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Ricardo S. Osorio
- Center for Brain Health, Department of Psychiatry, NYU Langone Medical Center, New York, NY, USA
| | - David Patterson
- Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
- Eleanor Roosevelt Institute, University of Denver, Denver, CO, USA; Department of Biological Sciences, University of Denver, Denver, CO, USA; Molecular and Cellular Biophysics Program, University of Denver, Denver, CO, USA
| | - Sonja A. Rasmussen
- Department of Pediatrics, University of Florida College of Medicine, Gainesville, FL; Department of Epidemiology, University of Florida College of Public Health and Health Professions and College of Medicine, Gainesville, FL
| | - Roger H. Reeves
- Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, MD, USA; McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Nancy Roizen
- Department of Pediatrics, UH/Rainbow Babies and Children’s Hospital and Department of Pediatrics, Case Western Reserve University, Cleveland, OH, USA
| | - Stephanie Santoro
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
- Down Syndrome Program, Division of Medical Genetics and Metabolism, Department of Pediatrics, Massachusetts General Hospital, Boston, MA, USA
| | - Stephanie L. Sherman
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Nasreen Talib
- Division of General Pediatrics, Children’s Mercy Kansas City, 2401 Gillham Road, Kansas City, MO, USA
| | - Ignacio E. Tapia
- Sleep Center, Division of Pulmonary Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Kyle M. Walsh
- Division of Neuro-epidemiology, Department of Neurosurgery, Duke University, Durham, NC, USA
| | - Steven F. Warren
- Institute for Life Span Studies, University of Kansas, Lawrence, KS, USA
| | - A. Nicole White
- Research Foundation, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Guang William Wong
- Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, MD, USA; McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Center for Metabolism and Obesity Research, The Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - John S. Yi
- Division of Surgical Sciences, Department of Surgery, Duke University Medical Center, Durham, NC, USA
| |
Collapse
|
4
|
Kent RD, Eichhorn J, Wilson EM, Suk Y, Bolt DM, Vorperian HK. Auditory-Perceptual Features of Speech in Children and Adults With Down Syndrome: A Speech Profile Analysis. J Speech Lang Hear Res 2021; 64:1157-1175. [PMID: 33789057 PMCID: PMC8608145 DOI: 10.1044/2021_jslhr-20-00617] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 01/02/2021] [Accepted: 01/03/2021] [Indexed: 05/17/2023]
Abstract
Purpose The aim of this study was to determine how the speech disorder profiles in Down syndrome (DS) relate to reduced intelligibility, atypical overall quality, and impairments in the subsystems of speech production (phonation, articulation, resonance, and prosody). Method Auditory-perceptual ratings of intelligibility, overall quality, and features associated with the subsystems of speech production were obtained from recordings of 79 children and adults with DS. Ratings were made for sustained vowels (62 of 79 speakers) and short sentences (79 speakers). The data were analyzed to determine the severity of the affected features in each speaking task and to detect patterns in the group data by means of principal components analysis. Results Reduced intelligibility was noted in 90% of the speakers, and atypical overall speech quality was noted in 100%. Affected speech features were distributed across the speech production subsystems. Principal components analysis revealed four components each for the vowel and sentence tasks, showing that individuals with DS are not homogeneous in the features of their speech disorder. Discussion The speech disorder in DS is complex in its perceptual features and reflects impairments across the subsystems of speech production, but the pattern is not uniform across individuals, indicating that attention must be given to individual variation in designing treatments.
Collapse
|
5
|
Kent RD, Eichhorn JT, Vorperian HK. Acoustic parameters of voice in typically developing children ages 4-19 years. Int J Pediatr Otorhinolaryngol 2021; 142:110614. [PMID: 33450527 PMCID: PMC7902385 DOI: 10.1016/j.ijporl.2021.110614] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 12/31/2020] [Accepted: 12/31/2020] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Report data on acoustic measures of voice in sustained vowels produced by typically developing children, aged 4-19 years, to add to the cross-sectional reference values in a pediatric database. METHODS Recordings of sustained vowel/ɑ/phonation were obtained from 158 children (80 males, 78 females) aged 4-19 years who were judged to be typically developing with respect to speech and voice. Acoustic analyses were performed with the Multidimensional Voice Program (MDVP™) and the Analysis of Dysphonia in Speech and Voice (ADSV™), both from Pentax Medical. RESULTS Values from both MDVP and ADSV are reported for children in the following age cohorts: 4-6 years, 7-9 years, 10-12 years, 13-15 years, and 16-19 years. CONCLUSION The data in this study complement previously published data and contribute to a pediatric reference database useful for research and for clinical practice related to children's voice. Acoustic parameters most sensitive to age and sex are identified.
Collapse
Affiliation(s)
- Raymond D. Kent
- Waisman Center, University of Wisconsin-Madison, 1500 Highland Ave., Madison, WI 53705
| | - Julie T. Eichhorn
- Waisman Center, University of Wisconsin-Madison, 1500 Highland Ave., Madison, WI 53705
| | - Houri K. Vorperian
- Waisman Center, University of Wisconsin-Madison, 1500 Highland Ave., Madison, WI 53705
| |
Collapse
|
6
|
Abstract
Purpose Speech production deficits and reduced intelligibility are frequently noted in individuals with Down syndrome (DS) and are attributed to a combination of several factors. This study reports acoustic data on vowel production in young adults with DS and relates these findings to perceptual analysis of speech intelligibility. Method Participants were eight young adults with DS as well as eight age- and gender-matched typically developing (TD) controls. Several different acoustic measures of vowel centralization and variability were applied to tokens of corner vowels (/ɑ/, /æ/, /i/, /u/) produced in common English words. Intelligibility was assessed for single-word productions of speakers with DS, by means of transcriptions from 14 adult listeners. Results Group differentiation was found for some, but not all, of the acoustic measures. Low vowels were more acoustically centralized and variable in speakers with DS than TD controls. Acoustic findings were associated with overall intelligibility scores. Vowel formant dispersion was the most sensitive measure in distinguishing DS and TD formant data. Conclusion Corner vowels are differentially affected in speakers with DS. The acoustic characterization of vowel production and its association with speech intelligibility scores within the DS group support the conclusion of motor control deficits in the overall speech impairment. Implications are discussed for effective treatment planning.
Collapse
Affiliation(s)
- Micalle Carl
- Program in Speech-Language-Hearing Sciences, City University of New York Graduate Center, New York
| | | | - Erika S Levy
- Teachers College, Columbia University, New York, NY
| | - D H Whalen
- Program in Speech-Language-Hearing Sciences, City University of New York Graduate Center, New York
- Haskins Laboratories, New Haven, CT
- Yale University, New Haven, CT
| |
Collapse
|
7
|
Milenkovic PH, Wagner M, Kent RD, Story BH, Vorperian HK. Effects of sampling rate and type of anti-aliasing filter on linear-predictive estimates of formant frequencies in men, women, and children. J Acoust Soc Am 2020; 147:EL221. [PMID: 32237805 PMCID: PMC7056453 DOI: 10.1121/10.0000824] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Revised: 01/31/2020] [Accepted: 02/10/2020] [Indexed: 06/11/2023]
Abstract
The purpose of this study was to assess the effect of downsampling the acoustic signal on the accuracy of linear-predictive (LPC) formant estimation. Based on speech produced by men, women, and children, the first four formant frequencies were estimated at sampling rates of 48, 16, and 10 kHz using different anti-alias filtering. With proper selection of number of LPC coefficients, anti-alias filter and between-frame averaging, results suggest that accuracy is not improved by rates substantially below 48 kHz. Any downsampling should not go below 16 kHz with a filter cut-off centered at 8 kHz.
Collapse
Affiliation(s)
- Paul H Milenkovic
- Department of Electrical and Computer Engineering, University of Wisconsin-Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, USA
| | - Madison Wagner
- Vocal Tract Development Laboratory, Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, Wisconsin 53706, USA
| | - Raymond D Kent
- Vocal Tract Development Laboratory, Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, Wisconsin 53706, USA
| | - Brad H Story
- Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85718, , , , ,
| | - Houri K Vorperian
- Vocal Tract Development Laboratory, Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, Wisconsin 53706, USA
| |
Collapse
|
8
|
Vorperian HK, Kent RD, Lee Y, Bolt DM. Corner vowels in males and females ages 4 to 20 years: Fundamental and F1-F4 formant frequencies. J Acoust Soc Am 2019; 146:3255. [PMID: 31795713 PMCID: PMC6850954 DOI: 10.1121/1.5131271] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 09/06/2019] [Accepted: 10/08/2019] [Indexed: 05/29/2023]
Abstract
The purpose of this study was to determine the developmental trajectory of the four corner vowels' fundamental frequency (fo) and the first four formant frequencies (F1-F4), and to assess when speaker-sex differences emerge. Five words per vowel, two of which were produced twice, were analyzed for fo and estimates of the first four formants frequencies from 190 (97 female, 93 male) typically developing speakers ages 4-20 years old. Findings revealed developmental trajectories with decreasing values of fo and formant frequencies. Sex differences in fo emerged at age 7. The decrease of fo was larger in males than females with a marked drop during puberty. Sex differences in formant frequencies appeared at the earliest age under study and varied with vowel and formant. Generally, the higher formants (F3-F4) were sensitive to sex differences. Inter- and intra-speaker variability declined with age but had somewhat different patterns, likely reflective of maturing motor control that interacts with the changing anatomy. This study reports a source of developmental normative data on fo and the first four formants in both sexes. The different developmental patterns in the first four formants and vowel-formant interactions in sex differences likely point to anatomic factors, although speech-learning phenomena cannot be discounted.
Collapse
Affiliation(s)
- Houri K Vorperian
- Vocal Tract Development Laboratory, Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, Wisconsin 53705, USA
| | - Raymond D Kent
- Vocal Tract Development Laboratory, Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, Wisconsin 53705, USA
| | - Yen Lee
- Department of Educational Psychology, University of Wisconsin-Madison, 1086 Educational, Sciences Building, 1025 West Johnson Street, Madison, Wisconsin 53706, USA
| | - Daniel M Bolt
- Department of Educational Psychology, University of Wisconsin-Madison, 1086 Educational, Sciences Building, 1025 West Johnson Street, Madison, Wisconsin 53706, USA
| |
Collapse
|
9
|
Penttilä N, Korpijaakko-Huuhka AM, Kent RD. Disfluency clusters in speakers with and without neurogenic stuttering following traumatic brain injury. J Fluency Disord 2019; 59:33-51. [PMID: 30641458 DOI: 10.1016/j.jfludis.2019.01.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2018] [Revised: 12/01/2018] [Accepted: 01/07/2019] [Indexed: 06/09/2023]
Abstract
PURPOSE Analyze the characteristics and rate of disfluency clusters in adults with and without neurogenic stuttering after traumatic brain injury (TBI). METHOD Twenty adults with TBI participated in this study, including 10 with neurogenic stuttering (Group B) and 10 without -stuttering (Group A). Disfluency clusters in speech samples were classified into three types: Stuttering-like (SLD), other (OD), and mixed (MIX). RESULTS Speakers with and without neurogenic stuttering produced the same mean number of disfluency clusters. In addition, the mean length of clusters did not differ between these speaker groups although the longest clusters did. The most frequently occurring cluster type for people with neurogenic stuttering was MIX and OD for people without stuttering. Although the speakers in Group A produced stuttering-like disfluencies, these never occurred together to form a SLD type cluster. For Group B, the starter units of the clusters were usually stuttering-like disfluencies, while for Group A, the starter units were mostly interruptions. CONCLUSIONS Compared to non-stuttering speakers, stuttering after TBI did not increase the number of clusters, but rather lengthened them. In speakers with neurogenic stuttering, the number and length of clusters were related to the manifestation of other communication deficits, not to the frequency of stuttering-like disfluencies. Still, SLD clusters occurred only in those people with neurogenic stuttering. These findings raise questions about the nature of both neurogenic stuttering and the dynamics of disfluency clustering.
Collapse
Affiliation(s)
- Nelly Penttilä
- Faculty of Social Sciences, University of Tampere, 33610 Tampere, Finland.
| | | | - Raymond D Kent
- Waisman Center, University of Wisconsin-Madison, 53705 Madison WI, United States
| |
Collapse
|
10
|
Kent RD, Vorperian HK. Static measurements of vowel formant frequencies and bandwidths: A review. Journal of Communication Disorders 2018; 74:74-97. [PMID: 29891085 PMCID: PMC6002811 DOI: 10.1016/j.jcomdis.2018.05.004] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Revised: 04/23/2018] [Accepted: 05/27/2018] [Indexed: 05/05/2023]
Abstract
PURPOSE Data on vowel formants have been derived primarily from static measures representing an assumed steady state. This review summarizes data on formant frequencies and bandwidths for American English and also addresses (a) sources of variability (focusing on speech sample and time sampling point), and (b) methods of data reduction such as vowel area and dispersion. METHOD Searches were conducted with CINAHL, Google Scholar, MEDLINE/PubMed, SCOPUS, and other online sources including legacy articles and references. The primary search items were vowels, vowel space area, vowel dispersion, formants, formant frequency, and formant bandwidth. RESULTS Data on formant frequencies and bandwidths are available for both sexes over the lifespan, but considerable variability in results across studies affects even features of the basic vowel quadrilateral. Origins of variability likely include differences in speech sample and time sampling point. The data reveal the emergence of sex differences by 4 years of age, maturational reductions in formant bandwidth, and decreased formant frequencies with advancing age in some persons. It appears that a combination of methods of data reduction provide for optimal data interpretation. CONCLUSION The lifespan database on vowel formants shows considerable variability within specific age-sex groups, pointing to the need for standardized procedures.
Collapse
Affiliation(s)
- Raymond D Kent
- Waisman Center, University of Wisconsin-Madison, United States.
| | | |
Collapse
|
11
|
Workinger MS, Kent RD, Meilahn JR. The effect of botulinum toxin A (Botox) injections used to treat limb spasticity on speech patterns in children with dysarthria and cerebral palsy: A report of two cases. J Pediatr Rehabil Med 2017; 10:137-143. [PMID: 28582880 DOI: 10.3233/prm-170433] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Botulinum toxin A (Btx-A) injections are used to treat limb spasticity in children with cerebral palsy (CP) resulting in improved gross and fine motor control. This treatment has also been reported to have additional functional effects, but the effect of treatment on speech has not been reported. This report presents results of longitudinal speech evaluation of two children with CP given injections of Btx-A for treatment of limb spasticity. Speech evaluations were accomplished at baseline (date of injections) and 4- and 10-weeks post-injections. Improvements in production of consonants, loudness control, and syllables produced per breath were found. Parental survey also suggested improvements in subjects' speech production and willingness to speak outside the testing situation. Future larger studies are warranted to assess the nature of the changes observed related to Btx-A.
Collapse
Affiliation(s)
| | | | - Jill R Meilahn
- Department of Physical Medicine and Rehabilitation, Marshfield Clinic, Marshfield, WI, USA
| |
Collapse
|
12
|
Titze IR, Baken RJ, Bozeman KW, Granqvist S, Henrich N, Herbst CT, Howard DM, Hunter EJ, Kaelin D, Kent RD, Kreiman J, Kob M, Löfqvist A, McCoy S, Miller DG, Noé H, Scherer RC, Smith JR, Story BH, Švec JG, Ternström S, Wolfe J. Toward a consensus on symbolic notation of harmonics, resonances, and formants in vocalization. J Acoust Soc Am 2015; 137:3005-7. [PMID: 25994732 PMCID: PMC5392060 DOI: 10.1121/1.4919349] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Affiliation(s)
- Ingo R Titze
- National Center for Voice and Speech, 136 South Main Street, Suite 320, Salt Lake City, Utah 84101, USA
| | - Ronald J Baken
- Department of Otolaryngology, New York Eye and Ear Infirmary of Mount Sinai, 310 East 14th Street, New York, New York 10003, USA
| | - Kenneth W Bozeman
- Lawrence University Conservatory of Music, 711 East Boldt Way, Appleton, Wisconsin 54911, USA
| | - Svante Granqvist
- Royal Institute of Technology (KTH), School of Technology and Health (STH), Basic Science and Biomedicine, Campus Haninge, SE-136 40 Handen, Sweden
| | - Nathalie Henrich
- GIPSA-lab, Département Parole et Cognition, Domaine Universitaire, 11 rue des Mathématiques, BP 46 38402 Saint Martin d'Hères cedex, France
| | - Christian T Herbst
- Laboratory of Biophysics, Department of Experimental Physics, Palacký University Olomouc, Czech Republic
| | - David M Howard
- Department of Electronics, University of York, Heslington, York, United Kingdom
| | - Eric J Hunter
- Communicative Sciences and Disorders, Michigan State University, 1026 Red Cedar Road, East Lansing, Michigan 48824, USA
| | - Dean Kaelin
- Dean Kaelin Vocal Studio, 2539 East 4430 Street, Holladay, Utah 84124, USA
| | - Raymond D Kent
- The Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, Wisconsin 53705, USA
| | - Jody Kreiman
- Head and Neck Surgery, University of California-Los Angeles School of Medicine, 1000 Veteran Avenue, Los Angeles, California 90095, USA
| | - Malte Kob
- Musikalische Akustik and Theorie der Musikübertragung Erich-Thienhaus-Institut Hochschule für Musik Detmold, Neustadt 22, 32756 Detmold, Germany
| | - Anders Löfqvist
- Haskins Laboratories, Yale University, 300 George Street, Suite 900, New Haven, Connecticut 06511, USA
| | - Scott McCoy
- School of Music, Ohio State University, 304 Mershon Auditorium, 1871 North High Street, Columbus, Ohio 43201, USA
| | - Donald G Miller
- Voce Vista, Achterste Kamp 9, 9301 RB Roden, the Netherlands
| | - Hubert Noé
- Klostergasse 30, 8280 Fürstenfeld, Austria
| | - Ronald C Scherer
- Communication Sciences and Disorders, Bowling Green State University, 200 Health and Human Services Building, Bowling Green, Ohio 43403, USA
| | - John R Smith
- School of Physics, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Brad H Story
- Speech, Language, and Hearing Sciences, University of Arizona, P.O. Box 210071, Tucson, Arizona 85721, USA
| | - Jan G Švec
- Department of Biophysics, Palacký University, Olomouc, the Czech Republic
| | - Sten Ternström
- Department of Speech, Music and Hearing (KTH), Lindstedtsvägen 24, S-100 44 Stockholm, Sweden
| | - Joe Wolfe
- School of Physics, University of New South Wales, Sydney 2052, Australia
| |
Collapse
|
13
|
Abstract
Clinical Linguistics & Phonetics (CLP) and its namesake field have accomplished a great deal in the last quarter of a century. The success of the journal parallels the growth and vitality of the field it represents. The markers of journal achievement are several, including increased number of journal pages published annually; greater diversity of topics related to the core mission of the journal; expanding cross-language coverage; and healthy interactions among editors, reviewers and contributors; and - for better or worse - journal impact factors. A journal is in a competitive dynamic with other journals that share its general domain of scholarship, which is a major reason why an apparent imbalance may emerge in the topic content of any particular journal. The content of a journal is determined by the nature and number of submitted manuscripts. As far as linguistic content goes, CLP's centre of gravity appears to have been mostly in phonology and phonetics, but certainly not to the exclusion of syntax, semantics and pragmatics. The clinical scope is broad, both in terms of concepts and types of disorder. CLP has secured its place among journals in the field, and it is an outlet of choice for many researchers throughout the world.
Collapse
Affiliation(s)
- Raymond D Kent
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705-2280, USA.
| |
Collapse
|
14
|
Kim Y, Kent RD, Weismer G. An acoustic study of the relationships among neurologic disease, dysarthria type, and severity of dysarthria. J Speech Lang Hear Res 2011; 54:417-429. [PMID: 20884780 DOI: 10.1044/1092-4388(2010/10-0020)] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
PURPOSE This study examined acoustic predictors of speech intelligibility in speakers with several types of dysarthria secondary to different diseases and conducted classification analysis solely by acoustic measures according to 3 variables (disease, speech severity, and dysarthria type). METHOD Speech recordings from 107 speakers with dysarthria due to Parkinson's disease, stroke, traumatic brain injury, and multiple system atrophy were used for acoustic analysis and for perceptual judgment of speech intelligibility. Acoustic analysis included 8 segmental/suprasegmental features: 2nd formant frequency slope, articulation rate, voiceless interval duration, 1st moment analysis for fricatives, vowel space, F0, intensity range, and Pairwise Variability Index. RESULTS The results showed that (a) acoustic predictors of speech intelligibility differed slightly across diseases and (b) classification accuracy by dysarthria type was typically worse than by disease type or severity. CONCLUSIONS These findings were discussed with respect to (a) the relationship between acoustic characteristics and speech intelligibility and (b) dysarthria classification.
Collapse
Affiliation(s)
- Yunjung Kim
- Louisiana State University, Baton Rouge, USA.
| | | | | |
Collapse
|
15
|
Kent RD. In Memoriam: Thomas J. Hixon, PhD. J Voice 2010. [DOI: 10.1016/j.jvoice.2009.07.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
16
|
Abstract
The early development of vocalic and consonantal production in Mandarin-learning infants was studied at the transition from babbling to producing first words. Spontaneous vocalizations were recorded for 24 infants grouped by age: G1 (0 ; 7 to 1 ; 0) and G2 (1 ; 1 to 1 ; 6). Additionally, the infant-directed speech of 24 caregivers was recorded during natural infant-adult interactions to infer language-specific effects. Data were phonetically transcribed according to broad categories of vowels and consonants. Vocalic development, in comparison with reports for children of other linguistic environments, exhibited two universal patterns: the prominence of [symbol: see text] and [symbol: see text], and the predominance of low and mid vowels over high vowels. Language-specific patterns were also found, e.g. the early appearance and acquisition of low vowels [symbol: see text]. Vowel production was similar in G1 and G2, and a continuum of developmental changes brought infants' vocalization closer to the adult model. Consonantal development showed two universal patterns: labials and alveolars occurred more frequently than velars; and nasals developed earlier than fricatives, affricates and liquids. We also found two language-specific patterns: alveolars were more prominent than labials and affricates developed early. Universal and language-specific characteristics in G1 continued to be prominent in G2. These data indicate that infants are sensitive to the ambient language at an early age, and this sensitivity influences the nature of their vocalizations.
Collapse
|
17
|
Milenkovic PH, Yaddanapudi S, Vorperian HK, Kent RD. Effects of a curved vocal tract with grid-generated tongue profile on low-order formants. J Acoust Soc Am 2010; 127:1002-1013. [PMID: 20136222 PMCID: PMC2830264 DOI: 10.1121/1.3277214] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2008] [Revised: 10/22/2009] [Accepted: 11/30/2009] [Indexed: 05/28/2023]
Abstract
A hyperbolic grid-generation algorithm allows investigation of the effect of vocal-tract curvature on low-order formants. A smooth two-dimensional (2D) curve represents the combined lower lip, tongue, and anterior pharyngeal wall profile as displacements from the combined upper lip, palate, and posterior pharyngeal wall outline. The algorithm is able to generate tongue displacements beyond the local radius of strongly curved sections of the palate. The 2D grid, along with transverse profiles of the lip, oral-pharyngeal, and epilarynx regions, specifies a vocal conduit from which an effective area function may be determined using corrections to acoustic parameters resulting from duct curvature; the effective area function in turn determines formant frequencies through an acoustic transmission-line calculation. Results of the corrected transmission line are compared with a three-dimensional finite element model. The observed effects of the curved vocal tract on formants F1 and F2 are in order of importance, as follows: (1) reduction in midline distances owing to curvature of the palate and the bend joining the palate to the pharynx, (2) the curvature correction to areas and section lengths, and (3) adjustments to the palate-tongue distance required to produce smooth tongue shapes at large displacements from the palate.
Collapse
Affiliation(s)
- Paul H Milenkovic
- Department of Electrical and Computer Engineering, University of Wisconsin-Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, USA.
| | | | | | | |
Collapse
|
18
|
Abstract
OBJECTIVE This study investigated the distribution of second-formant (F2) slopes in a relatively large number of speakers with dysarthria associated with two different underlying diseases. PATIENTS AND METHODS Forty speakers with dysarthria (20 with Parkinson's disease, PD; 20 with stroke) and 5 control speakers without a history of neurological disease were asked to repeat six words (coat, hail, sigh, shoot, row and wax) 10 times. Acoustic analysis was performed to derive F2 slope, and speech intelligibility data were collected using a direct magnitude estimate technique to examine its relationship to F2 slope. RESULTS Statistical analysis revealed that both clinical groups showed significantly reduced F2 slopes compared to healthy speakers for all words but row. No group difference was found between speakers with PD and stroke; however, different words showed varying sensitivity to the speech motor control problems. The F2 slopes of only two words, shoot and wax, were significantly correlated with scaled speech intelligibility. CONCLUSION The findings support the idea that distributional characteristics of acoustic variables, such as F2 slope, could be used to develop a quantitative metric of severity of speech motor control deficits in dysarthria, when the materials are appropriately selected and additional distributional characteristics are studied.
Collapse
Affiliation(s)
- Yunjung Kim
- Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge, LA 70803, USA.
| | | | | | | |
Collapse
|
19
|
|
20
|
Abstract
Early prosodic development (f0 variation) was systematically measured in Mandarin-learning infants at the transition from babbling to producing first words. Spontaneous vocalizations of twenty-four infants aged 0 ; 7 to 1 ; 6 were recorded in 45-minute sessions. The speech production of twenty-four caregivers was also audio-recorded during caregiver-infant natural daily interactions at home. All recordings were transcribed using broad prosodic patterns. Analysis revealed four major findings: (1) falling f0 contours were more prominently produced than level and rising contours; (2) high prosodic patterns occurred more frequently than mid and low prosodic patterns; (3) these distribution patterns of f0 contours showed significant similarities in babbling and early words; and (4) these distribution patterns were also similar in infants' and their caregivers' data.
Collapse
|
21
|
Abstract
PURPOSE Darley, Aronson, and Brown (1969a, 1969b) detailed methods and results of auditory-perceptual assessment for speakers with dysarthrias of varying etiology. They reported adequate listener reliability for use of the rating system as a tool for differential diagnosis, but several more recent studies have raised concerns about listener reliability using this approach. METHOD In the present study, the authors examined intrarater and interrater agreement for perceptual ratings of 47 speakers with various dysarthria types by 2 listener groups (inexperienced and experienced). The entire set of perceptual features proposed by Darley et al. was rated based on a 40-s conversational speech sample. RESULTS No differences in levels of agreement were found between the listener groups. Agreement was within 1 scale value or better for 67% of the pairwise comparisons. Levels of agreement were lower when the average rating fell in the mid-range of the scale compared with samples that had an average rating near either of the scale endpoints; agreement was above chance level. No significant differences in agreement were found between the perceptual features. DISCUSSION The levels of listener agreement that were found indicate that auditory-perceptual ratings show promise during clinical assessment for identifying salient features of dysarthria for speakers with various etiologies.
Collapse
|
22
|
Rosen KM, Kent RD, Delaney AL, Duffy JR. Parametric quantitative acoustic analysis of conversation produced by speakers with dysarthria and healthy speakers. J Speech Lang Hear Res 2006; 49:395-411. [PMID: 16671852 DOI: 10.1044/1092-4388(2006/031)] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2005] [Revised: 08/23/2005] [Accepted: 08/25/2005] [Indexed: 05/09/2023]
Abstract
PURPOSE This study's main purpose was to (a) identify acoustic signatures of hypokinetic dysarthria (HKD) that are robust to phonetic variation in conversational speech and (b) determine specific characteristics of the variability associated with HKD. METHOD Twenty healthy control (HC) participants and 20 participants with HKD associated with idiopathic Parkinson's disease (PD) repeated 3 isolated sentences (controlled phonetic content) and 2 min of conversational speech (phonetic content treated as a random variable). A MATLAB-based program automatically calculated measures of contrastivity: speech-pause ratio, intensity variation, median and maximum formant slope, formant range, change in the upper and lower spectral envelope, and range of the spectral envelope. t tests were used to identify which measures were sensitive to HKD and which measures differed by task. Discriminant analysis was used to identify the combination of measures that best predicted HKD, and this analysis was then used as a general measure of contrastivity (Contrastivity Index). Differential effects of HKD on maximum and typical contrastivity levels were tested with interaction of maximum, minimum, and median observations of individual speakers and with pairwise comparisons of skewness and kurtosis of the contrastivity index distributions. RESULTS Group differences were detected with pairwise comparisons with t tests in 8 of the 9 measures. Percentage pause time and spectral range were identified as the most specific (95%) and accurate (95%) differentiators of HKD and HC conversational speech. Sentence repetition elicited significantly higher levels of contrastivity than conversational speech in both HC and HKD speakers. Maximum and minimum contrastivities were significantly lower in HKD speech, but there was no evidence that HKD affects maximum contrastivity levels more than median contrastivity levels. The HKD speakers' contrastivity distributions were significantly more skewed to lower levels of production. CONCLUSION HKD can be consistently distinguished from HC speech in both sentence repetition and conversational speech on the basis of intensity variation and spectral range. Although speakers with HKD were effectively able to produce higher contrastivity levels in sentence repetition tasks, they habitually performed closer to the lower end of their production ranges.
Collapse
Affiliation(s)
- Kristin M Rosen
- Waisman Research, Center University of Wisconsin--Madison, 53705, USA.
| | | | | | | |
Collapse
|
23
|
Potter NL, Kent RD, Lindstrom MJ, Lazarus JAC. Power and precision grip force control in three-to-five-year-old children: velocity control precedes amplitude control in development. Exp Brain Res 2006; 172:246-60. [PMID: 16432697 DOI: 10.1007/s00221-005-0322-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2005] [Accepted: 12/05/2005] [Indexed: 11/27/2022]
Abstract
The aim of this study was to examine the development of underlying motor control strategies in young children by characterizing the changes in performance of a visually guided force regulation task using two different grip formations; a whole-hand power grip (developmentally easier) and a thumb-index finger precision grip (developmentally more advanced). Typically developing preschool children (n=50, 3.0-5.5 years) used precision and power grips to perform a ramp and hold task with their dominant and non-dominant hands. Participants performed five trials with each hand and grip holding the force at 30% of their maximum volitional contraction for 3 s. The data were examined for both age-related and performance-related changes in motor performance. Across ages, children increased in strength, decreased in initial overshoot of the target force level, and decreased in rate of force release. Results of a cluster analysis suggest non-linear changes in the development of force control in preschool children, with a plateau in (or maturation of) velocity measures (rate of force increase and force decrease) earlier than in amplitude-related measures (initial force overshoot and force variability).
Collapse
Affiliation(s)
- Nancy L Potter
- Department of Speech and Hearing Sciences, Washington State University, Spokane, WA 99210-1495, USA.
| | | | | | | |
Collapse
|
24
|
Abstract
Most studies on CV co-occurrence patterns in early phonetic development have been based on Indo-European languages. Data from infants learning Mandarin, which has a substantially different phonological system from Indo-European languages, can confirm or refute the findings of previous studies, thus shedding further light on the theoretical bases of CV association. Spontaneous vocalizations were recorded in 45-minute sessions for each of 24 Mandarin-learning infants aged 0;7 to 1;6. In addition, the speech production of 24 caregivers was audio-recorded during their natural daily interactions with the infants at home. Both infants' vocalizations and adults' speech were transcribed and analysed for consonant-vowel co-occurrence patterns. These patterns were similar to those in other language groups, but language-specific patterns were evident by 1;0. Combinations of alveolars+front vowels and velars+back vowels confirm Davis & MacNeilage's (1990, 1995) frame-then-content theory and Clements's (1991) model of unified features for consonants and vowels. However, our finding of a language-specific pattern (labials+back vowels) suggests the need to reexamine the 'pure frame' of Davis & MacNeilage and Clements's grouping of labials and rounded vowels.
Collapse
|
25
|
Abstract
This study examines intensity decay in the phonation of persons with Parkinson's disease (PD). The decline in vocal intensity (determined by linear regression of the intensity envelope) was compared across the following tasks: vowel prolongation, syllable repetition (diadochokinesis, DDK), isolated sentences and conversation. In contrast to previous studies, PD speakers demonstrated no significant differences in intensity decline from healthy speakers in vowel prolongation. The vocal intensity of speakers with PD declined more rapidly than that of controls in DDK tasks. While intensity slopes in conversation were more variable in both groups, some participants with PD exhibited abrupt changes in intensity. Results indicate that the DDK is particularly useful for describing intensity decay associated with PD. However, considering the inconsistent group differences across tasks, and the discrepant findings from previous studies, intensity decay is not a robust symptom of PD.
Collapse
|
26
|
Shriberg LD, Kent RD, Karlsson HB, McSweeny JL, Nadler CJ, Brown RL. A diagnostic marker for speech delay associated with otitis media with effusion: backing of obstruents. Clin Linguist Phon 2003; 17:529-547. [PMID: 14608798 DOI: 10.1080/0269920031000138132] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
A companion paper in this issue reported diagnostic accuracy findings for a marker (the Intelligibility-Speech Gap) to identify speech delay associated with otitis media with effusion (SD-OME). The present paper reports findings for another possible diagnostic marker for SD-OME--Backing of Obstruents. Conversational speech samples and citation forms from 48 speech-delayed children with positive and negative histories for OME were analysed using both perceptual and acoustic methods. The perceptual findings indicated significant trends for backing to be more prevalent in children with positive compared to negative histories of OME. Among a number of candidate speech error variables, lowered first spectral moments on lingual stops and sibilant fricatives as obtained from moments analysis emerged as a promising acoustic correlate of backing. Positive and negative predictive values, and sensitivity and specificity values for the acoustic marker, were above 75% for each of three stimulus words targeting /k/, /z/ and /f/. Discussion considers alternative explanatory perspectives on the ontogenetic development of backing in children who have experienced the fluctuant hearing loss associated with early recurrent OME.
Collapse
Affiliation(s)
- Lawrence D Shriberg
- Phonology Project, Waisman Center, University of Wisconsin-Madison, 1500 Highland Avenue, Madison, WI 53705, USA.
| | | | | | | | | | | |
Collapse
|
27
|
Abstract
Previous investigations indicated that pauses in ataxic dysarthric speech are abnormal, but little is known about the nature of their variation or whether it can be characterized by a common distribution. The following types of pauses were measured in spontaneous monologues of 16 speakers with ataxic dysarthria: pauses that contained evidence of respiration, pauses that occurred between words, between phonemes, within phonemes, and those associated with stop consonants. Results showed that the duration of pauses not associated with stop consonants can be modeled with two distinct lognormal distributions. The lognormal distribution predicted duration of across types within 1.67% standard error, with the exception of pauses that occurred within phonemes. Distributional parameters pointed to some fundamental similarities between pauses that occur within words. The robustness of the lognormal distribution across pause types strongly supports this distribution as a viable and useful method of analysis. We hope this work will encourage others to consider the lognormal distribution.
Collapse
Affiliation(s)
- Kristin M Rosen
- Waisman Center, University of Wisonsin, 1500 Highland Avenue, Madison, WI 53705-2280, USA.
| | | | | |
Collapse
|
28
|
Abstract
Phonatory dysfunction is a frequent component of dysarthria and often is a primary feature noted in clinical assessment. But the vocal impairment can be difficult to assess because (a). the analysis of voice disorder of any kind can be challenging, and (b). the voice disorder in dysarthria often occurs along with other impairments affecting articulation, resonance, and respiration. A promising assessment tool is multi-parameter acoustic analysis, such as the Multi-Dimensional Voice Program (MDVP). Part 1 of this paper recommends procedures and standards for the acoustic analysis of voice, including (1). selection of the sample to be analyzed, (2). signal quality requirements, (3). availability of normative data for both genders and different ages of speakers, (4). reliability of analysis, and (5). correlation of acoustic results with results from other methods of analysis. In Part 2, acoustic data are reviewed for the dysarthria associated with Parkinson disease (PD), cerebellar disease, amyotrophic lateral sclerosis (ALS), traumatic brain injury (TBI), unilateral hemispheric stroke, and essential tremor. Tentative profiles of voice disorder are described for these conditions. These profiles may serve as hypotheses for future research. Although several issues remain to be resolved in the acoustic analysis of voice disorder in dysarthria, steps can be taken now to promote the reliability, validity, and clinical utility of such analyses. (1). As a result of this activity, the participant will be able to describe ways in which an optimal multi-dimensional analysis of voice can be performed with modern acoustic analysis systems. (2). As a result of this activity, the participant will be able to apply multi-dimensional acoustic analysis of voice to individuals who have a dysarthria-related voice disorder. (3). As a result of this activity, the participant will be able to identify major sources of normative data on the Multi-Dimensional Voice Program.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin-Madison, Rm 435, 1500 Highland Avenue, Madison, WI 53705-2280, USA.
| | | | | | | |
Collapse
|
29
|
Abstract
This study sought to develop a quantitative kinematic description of tongue movement for liquid swallowing in a group of 12 healthy subjects. X-ray microbeam technology was used to track the positions of six small pellets attached to the tongue and jaw while subjects swallowed water at 2- and 10-mL bolus volumes. A feature common to all subjects was a prominent rostral movement of the dorsal region of the tongue. In addition, all subjects consistently increased the displacement and maximum speed of this tongue movement with increased bolus volume. However, detailed movement analysis showed a variety of tongue movement patterns for the group. This variability across subjects was large enough that it was surprisingly difficult to provide a low-dimension quantitative description of the tongue kinematics during liquid swallowing.
Collapse
Affiliation(s)
- Stephen M Tasko
- Army Audiology and Speech Center, Walter Reed Army Medical Center, Washington, DC 20307-5001, USA.
| | | | | |
Collapse
|
30
|
Kent RD, Duffy JR, Slama A, Kent JF, Clift A. Clinicoanatomic studies in dysarthria: review, critique, and directions for research. J Speech Lang Hear Res 2001; 44:535-551. [PMID: 11407559 DOI: 10.1044/1092-4388(2001/042)] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
More than 30 years ago, Darley, Aronson, and Brown (1969) proposed clinicoanatomic correlations for seven perceptual types of dysarthria. These correlations have not been systematically re-examined even though imaging technologies developed in recent years provide the means to do so. This review considers data from published imaging studies as well as data from selected medical interventions to evaluate the current state of knowledge that relates lesion site to the nature of a speech disturbance. Although the extant data are not sufficient to allow a complete evaluation of the seven types of dysarthria described by Darley et al., relevant information has been reported on lesions of the pyramidal pathway, extrapyramidal pathway, and cerebellum. In general, the results are best explained by an equivalence mode of brain-behavior relationship in which a type of dysarthria is associated with a lesion in one of two or more brain structures. Criteria also are proposed for future studies of clinicoanatomic relationships in neurogenic communication disorders.
Collapse
Affiliation(s)
- R D Kent
- University of Wisconsin-Madison, 53705-2280, USA.
| | | | | | | | | |
Collapse
|
31
|
Weismer G, Jeng JY, Laures JS, Kent RD, Kent JF. Acoustic and intelligibility characteristics of sentence production in neurogenic speech disorders. Folia Phoniatr Logop 2001; 53:1-18. [PMID: 11125256 DOI: 10.1159/000052649] [Citation(s) in RCA: 200] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The purpose of this study was to examine the relationship between scaled speech intelligibility and selected acoustic variables in persons with dysarthria. Control speakers and speakers with amyotrophic lateral sclerosis (ALS) and Parkinson's disease (PD) produced sentences which were analyzed acoustically and perceptually. The acoustic variables included total utterance durations, segment durations, estimates of the acoustic vowel space, and slopes of formant transitions; the perceptual variables included scaled speech intelligibility and severity of speech involvement. Results indicated that the temporal variables typically differentiated the ALS group, but not the PD group, from the controls, and that vowel spaces were smaller for both neurogenic groups as compared to controls, but only significantly so for the ALS speakers. The relation of these acoustic measures to scaled speech intelligibility is shown to be complex, and the composite results are discussed in terms of sentence vs. single-word intelligibility estimates and their underlying acoustic bases.
Collapse
Affiliation(s)
- G Weismer
- Department of Communicative Disorders, Waisman Center, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | | | | | | | | |
Collapse
|
32
|
Abstract
Although ataxic dysarthria has been studied with various methods in several languages, questions remain concerning which features of the disorder are most consistent, which speaking tasks are most sensitive to the disorder, and whether the different speech production subsystems are uniformly affected. Perceptual and acoustic data were obtained from 14 individuals (seven men, seven women) with ataxic dysarthria for several speaking tasks, including sustained vowel phonation, syllable repetition, sentence recitation, and conversation. Multidimensional acoustic analyses of sustained vowel phonation showed that the largest and most frequent abnormality for both men and women was a long-term variability of fundamental frequency. Other measures with a high frequency of abnormality were shimmer and peak amplitude variation (for both sexes) and jitter (for women). Syllable alternating motion rate (AMR) was typically slow and irregular in its temporal pattern. In addition, the energy maxima and minima often were highly variable across repeated syllables, and this variability is thought to reflect poorly coordinated respiratory function and inadequate articulatory/voicing control. Syllable rates tended to be slower for sentence recitation and conversation than for AMR, but the three rates were highly similar. Formant-frequency ranges during sentence production were essentially normal, showing that articulatory hypometria is not a pervasive problem. Conversational samples varied considerably across subjects in intelligibility and number of words/ morphemes in a breath group. Qualitative analyses of unintelligible episodes in conversation showed that these samples generally had a fairly well-defined syllable pattern but subjects differed in the degree to which the acoustic contrasts typical of consonant and vowel sequences were maintained. For some individuals, an intelligibility deficit occurred in the face of highly distinctive (and contrastive) acoustic patterns.
Collapse
Affiliation(s)
- R D Kent
- University of Wisconsin-Madison, 53705-2280, USA.
| | | | | | | | | | | |
Collapse
|
33
|
Abstract
This paper reviews issues in speech motor control and a class of communication disorders known as motor speech disorders. Speech motor control refers to the systems and strategies that regulate the production of speech, including the planning and preparation of movements (sometimes called motor programming) and the execution of movement plans to result in muscle contractions and structural displacements. Traditionally, speech motor control is distinguished from phonologic operations, but in some recent phonologic theories, there is a deliberate blurring of the boundaries between phonologic representation and motor functions. Moreover, there is continuing discussion in the literature as to whether a given motor speech disorder (especially apraxia of speech and stuttering) should be understood at the phonologic level, the motoric level, or both of these. The motor speech disorders considered here include: the dysarthrias, apraxia of speech, developmental apraxia of speech, developmental stuttering, acquired (neurogenic and psychogenic) stuttering, and cluttering.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin-Madison, 53705-2280, USA.
| |
Collapse
|
34
|
Weismer G, Laures JS, Jeng JY, Kent RD, Kent JF. Effect of speaking rate manipulations on acoustic and perceptual aspects of the dysarthria in amyotrophic lateral sclerosis. Folia Phoniatr Logop 2000; 52:201-19. [PMID: 10965174 DOI: 10.1159/000021536] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The current study explored the acoustic and perceptual effects of speaking rate adjustments in persons with amyotrophic lateral sclerosis (ALS) and in neurologically normal individuals. Sentence utterances were obtained from the participants at two self-selected speaking rates: habitual and fast. Total utterance durations, segment durations, and vowel formant frequencies comprised the acoustic measures, whereas magnitude estimates of speech intelligibility and severity of speech involvement were the perceptual measures. Results showed that participants in both the neurologically normal and ALS groups were able to increase their speaking rate when asked to do so, but that the participants with ALS were significantly slower than the neurologically normal participants at both rates. Both groups of participants also showed compression of the acoustic vowel space with increased speaking rate, with the vowel spaces of participants with ALS generally being more compressed than the vowel spaces of neurologically normal participants, at either rate. Most importantly, the perceptual measures failed to show any effect of the speaking rate adjustment on scaled intelligibility or severity, for either group. These findings are discussed relative to the general issue of slow habitual speaking rates among many speakers with dysarthria, and possible explanations for the slowness. The lack of an effect of increased rate on the perception of the speech deficit among speakers with ALS argues against the idea that the habitually slow rates are a form of compensation to reduce the complexity of speech production.
Collapse
Affiliation(s)
- G Weismer
- Department of Communicative Disorders, Waisman Center, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | | | | | | | | |
Collapse
|
35
|
Callan DE, Kent RD, Guenther FH, Vorperian HK. An auditory-feedback-based neural network model of speech production that is robust to developmental changes in the size and shape of the articulatory system. J Speech Lang Hear Res 2000; 43:721-736. [PMID: 10877441 DOI: 10.1044/jslhr.4303.721] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The purpose of this article is to demonstrate that self-produced auditory feedback is sufficient to train a mapping between auditory target space and articulator space under conditions in which the structures of speech production are undergoing considerable developmental restructuring. One challenge for competing theories that propose invariant constriction targets is that it is unclear what teaching signal could specify constriction location and degree so that a mapping between constriction target space and articulator space can be learned. It is predicted that a model trained by auditory feedback will accomplish speech goals, in auditory target space, by continuously learning to use different articulator configurations to adapt to the changing acoustic properties of the vocal tract during development. The Maeda articulatory synthesis part of the DIVA neural network model (Guenther et al., 1998) was modified to reflect the development of the vocal tract by using measurements taken from MR images of children. After training, the model was able to maintain the 11 English vowel targets in auditory planning space, utilizing varying articulator configurations, despite morphological changes that occur during development. The vocal-tract constriction pattern (derived from the vocal-tract area function) as well as the formant values varied during the course of development in correspondence with morphological changes in the structures involved with speech production. Despite changes in the acoustical properties of the vocal tract that occur during the course of development, the model was able to demonstrate motor-equivalent speech production under lip-restriction conditions. The model accomplished this in a self-organizing manner even though there was no prior experience with lip restriction during training.
Collapse
Affiliation(s)
- D E Callan
- ATR Human Information Processing Research Laboratories, Kyoto, Japan.
| | | | | | | |
Collapse
|
36
|
Bunton K, Kent RD, Kent JF, Rosenbek JC. Perceptuo-acoustic assessment of prosodic impairment in dysarthria. Clin Linguist Phon 2000; 14:13-24. [PMID: 22091695 DOI: 10.1080/026992000298922] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Dysprosody was studied in four groups of male subjects: subjects with amyotrophic lateral sclerosis (ALS) and mild intelligibility impairment, subjects with ALS and a more severe intelligibility loss, subjects with cerebellar disease, and neurologically normal controls. Dysprosody was assessed with perceptual ratings and acoustic measures pertaining to the regulation of duration, ƒ(0), and intensity within tone units of conversational samples. Intelligibility reduction and prosodic disturbance were not necessarily equally impaired in all subjects, and it is concluded that these are complementary indices of severity of dysarthria. Compared to the neurologically normal control group, the clinical groups tended to decrease the overall duration of tone units, produce fewer words in a tone unit, and use smaller variations in ƒ(0). Recommendations are offered for the assessement of.
Collapse
|
37
|
Abstract
The dysarthrias are associated with a variety of motor disturbances distributed over several motor systems of speech production. The features of a given dysarthria often vary with the speaking task, and this task-dependency affords insights into the responsible neural lesion and its effects on the motor regulation of speech. Each task also is amenable to quantitative analyses with acoustic or physiologic methods, and these analyses may redefine the value of these speaking tasks. This article considers task-based analyses for the dysarthrias associated with Parkinson's disease, cerebellar disease, and stroke.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin, Madison, WI 53705-2280, USA.
| | | |
Collapse
|
38
|
Vorperian HK, Kent RD, Gentry LR, Yandell BS. Magnetic resonance imaging procedures to study the concurrent anatomic development of vocal tract structures: preliminary results. Int J Pediatr Otorhinolaryngol 1999; 49:197-206. [PMID: 10519699 DOI: 10.1016/s0165-5876(99)00208-6] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The vocal tract structures undergo drastic anatomic restructuring during the course of development from infancy to adulthood. This study demonstrates the feasibility of using MRI to examine the growth processes of the vocal tract. This method affords precise and detailed visualization of the soft tissues in the oro-pharyngeal region, while also providing images of related bony and cartilaginous structures. Information on anatomic restructuring contributes to the understanding of how speech emerges and develops, and it also establishes normative information that can be used in the assessment of developmental anomalies. This paper describes the method used to measure and examine the concurrent anatomic development of the various vocal tract structures during early childhood. Preliminary results from two pediatric subjects indicate that there is synchrony of growth in the different structures-both soft and hard tissues-, and that such synchronous growth appears to persist during periods of growth spurts.
Collapse
Affiliation(s)
- H K Vorperian
- Department of Communicative Disorders, Waisman Center, University of Wisconsin-Madison, USA.
| | | | | | | |
Collapse
|
39
|
Abstract
(1) The reader will be able to describe the major types of acoustic analysis available for the study of speech, (2) specify the components needed for a modern speech analysis laboratory, including equipment for recording and analysis, and (3) list possible measurements for various aspects of phonation, articulation and resonance, as they might be manifest in neurologically disordered speech.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin-Madison 53705-2280, USA.
| | | | | | | | | |
Collapse
|
40
|
Abstract
The goal of this research was to train a self-organizing map (SOM) on various acoustic measures (amplitude perturbation quotient, degree of voice breaks, rahmonic amplitude, soft phonation index, standard deviation of the fundamental frequency, and peak amplitude variation) of the sustained vowel /a/ to enhance visualization of the multidimensional nonlinear regularities inherent in the input data space. The SOM was trained using 30 spasmodic dysphonia exemplars, 30 pretreatment functional dysphonia exemplars, 30 post-treatment functional dysphonia exemplars, and 30 normal voice exemplars. After training, the classification performance of the SOM was evaluated. The results indicated that the SOM had better classification performance than that of a stepwise discriminant analysis over the original data. Analysis of the weight values across the SOM, by means of stepwise discriminant analysis, revealed the relative importance of the acoustic measures in classification of the various groups. The SOM provided both an easy way to visualize multidimensional data, and enhanced statistical predictability at distinguishing between the various groups (over that conducted on the original data set). We regard the results of this study as a promising initial step into the use of SOMs with multiple acoustic measures to assess phonatory function.
Collapse
Affiliation(s)
- D E Callan
- Department of Communicative Disorders, University of Wisconsin-Madison, USA.
| | | | | | | |
Collapse
|
41
|
Kent RD. Neuroimaging studies of brain activation for language, with an emphasis on functional magnetic resonance imaging: a review. Folia Phoniatr Logop 1998; 50:291-304. [PMID: 9925952 DOI: 10.1159/000021471] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Neuroimaging studies have greatly enhanced the potential to understand brain-behavior relationships in complex behaviors such as language. The method of functional magnetic resonance imaging (fMRI) is one of the newest tools for neuroimaging, and it will in all likelihood contribute substantially to new knowledge about brain activation for language processing. This review summarizes basic information about fMRI, including principles of operation, experimental pitfalls and examples of application to language.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin-Madison, 53705-2250, USA.
| |
Collapse
|
42
|
Hustad KC, Kent RD, Beukelman DR. DECTalk and MacinTalk speech synthesizers: intelligibility differences for three listener groups. J Speech Lang Hear Res 1998; 41:744-752. [PMID: 9712123 DOI: 10.1044/jslhr.4104.744] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
This study examined word level intelligibility differences between DECTalk and MacinTalk speech synthesizers using the Modified Rhyme Test in an open format transcription task. Three groups of listeners participated: inexperienced, speech-language pathologists, and speech synthesis experts. Results for between-subjects ANOVA showed that the expert group correctly identified a significantly higher number of words than each of the other listener groups. For the within-subjects factor of voice, simple effects ANOVA and post hoc contrasts within each group showed that listeners had higher intelligibility scores for the DECTalk male voice, Perfect Paul, than for the MacinTalk male voice, Bruce. No other pairwise gender/age-matched differences were found between the two synthesizers.
Collapse
Affiliation(s)
- K C Hustad
- Department of Special Education and Communication Disorders, University of Nebraska, Lincoln 68583-0731, USA.
| | | | | |
Collapse
|
43
|
Kent RD. Insights from memoirs of illness and disability. ASHA 1998; 40:22-5. [PMID: 9670572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- R D Kent
- University of Wisconsin-Madison, USA.
| |
Collapse
|
44
|
Abstract
Cerebellar disease affects a number of skilled movements, including those in speech. Ataxic dysarthria, the speech disorder that typically accompanies cerebellar disease, was studied by acoustic methods. Control subjects and subjects with ataxic dysarthria were recorded while performing a number of speaking tasks, including sustained vowel phonation, syllable repetition, monosyllabic word production (intelligibility test), sentence recitation, and conversation. Acoustic data derived from the speech samples confirmed the hypothesis that temporal dysregulation is a primary component of the speech disorder. The data also show that the nature of the disorder varies with the speaking task. This result agrees with observations on other motor systems in subjects with cerebellar disease and may be evidence of a dissociation of impairments. Suggestions are offered on the selection of measures for a given task and on the role of the cerebellum in the regulation of speaking.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin-Medison 53705-2280, USA
| | | | | | | | | |
Collapse
|
45
|
Buder EH, Kent RD, Kent JF, Milenkovic P, Workinger M. Formoffa: An automated formant, moment, fundamental frequency, amplitude analysis of normal and disordered speech. Clin Linguist Phon 1996; 10:31-54. [PMID: 20426513 DOI: 10.3109/02699209608985160] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
A system for semi-automatic, multi-parameter acoustic analysis is described. The system, called FORMOFFA (For = FORmants, Mo = MOments, FF = Fundamental Frequency, A = Amplitude), operates on a PC microcomputer by adaptations of commercially available software. Data displays include a deterministic time record of instantaneous values, and an ergodic time-compressed distribution. In this report the technique is developed with a one-word example, and some measurement and reliability issues are described. The analysis possibilities are then illustrated with several applications: (1) segmental analysis of normal speech, (2) acoustic assessment of the effects of a progressive neurological disease (amyotrophic lateral sclerosis) on sentence production, (3) acoustic study of palatal lift management of a patient with traumatic brain injury, and (4) phonetic assessment of word production by a subject with dysarthria. Although the current technique is recommended as a research tool, this kind of analysis promises several advantages for clinical application, including semi-automaticity, efficiency, parsimony, and relevance to both segmental and suprasegmental levels of analysis.
Collapse
Affiliation(s)
- E H Buder
- Department of Speech and Hearing Sciences, University of Washington, Seattle, WA, 98195, USA
| | | | | | | | | |
Collapse
|
46
|
|
47
|
|
48
|
Abstract
This paper considers the topic of vocal tract acoustics from the three perspectives: (a) the acoustic theory of speech production; (b) contemporary laboratory methods for acoustic analysis, and (c) measurement of the acoustic signal of speech. Linear source-filter theory is the standard acoustic theory of speech production and is the foundation for remarkable advances in the analysis and synthesis of speech. Digital signal processing, the dominant laboratory method for speech analysis, enables the acquisition and recording of the acoustic speech signal but also implements quantitative algorithms largely based on linear source-filter theory. Measurements of the acoustic signal reflect the acoustic theory of speech production, laboratory methods for signal analysis, and principles of experimental phonetics. Basic issues in the three domains of theory, laboratory methods, and measurement are summarized as they pertain to the interests of the voice scientist, voice clinician, and voice teacher.
Collapse
Affiliation(s)
- R D Kent
- Waisman Center, University of Wisconsin-Madison 53705-2280
| |
Collapse
|
49
|
Abstract
The effects of speaking rate on the velocity profiles of movements of the lower lip and tongue tip during the production of stop consonants were examined using an x-ray microbeam system. Five young adults used a magnitude production task to produce five speaking rates that ranged from very fast to very slow. Results indicated that changes in speaking rate were associated with changes in the topology of the speech movement velocity-time function. Specifically, the velocity profile changed from a symmetrical, single-peaked function at the fast speaking rates to an asymmetrical and multi-peaked function at the slow speaking rates. This variation in velocity profile shape is interpreted as support for the view that alterations in speaking rate are associated with changes in motor control strategies. In particular, the control strategy for speech gestures produced at fast speaking rates appears to involve unitary movements that may be predominantly preprogrammed, whereas gestures produced at slow speaking rates consist of multiple submovements that may be influenced by feedback mechanisms.
Collapse
Affiliation(s)
- S G Adams
- Speech and Swallowing Lab, Toronto Hospital, Ontario, Canada
| | | | | |
Collapse
|
50
|
Kent RD, Schlesinger M. Applications of the unitary-group approach to variational calculations in many-electron atoms. Phys Rev A 1992; 46:6881-6887. [PMID: 9908018 DOI: 10.1103/physreva.46.6881] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
|