1
|
Vitevitch MS, Lachs L. Using network science to examine audio-visual speech perception with a multi-layer graph. PLoS One 2024; 19:e0300926. [PMID: 38551907 PMCID: PMC10980250 DOI: 10.1371/journal.pone.0300926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 03/05/2024] [Indexed: 04/01/2024] Open
Abstract
To examine visual speech perception (i.e., lip-reading), we created a multi-layer network (the AV-net) that contained: (1) an auditory layer with nodes representing phonological word-forms and edges connecting words that were phonologically related, and (2) a visual layer with nodes representing the viseme representations of words and edges connecting viseme representations that differed by a single viseme (and additional edges to connect related nodes in the two layers). The results of several computer simulations (in which activation diffused across the network to simulate word identification) are reported and compared to the performance of human participants who identified the same words in a condition in which audio and visual information were both presented (Simulation 1), in an audio-only presentation condition (Simulation 2), and a visual-only presentation condition (Simulation 3). Another simulation (Simulation 4) examined the influence of phonological information on visual speech perception by comparing performance in the multi-layer AV-net to a single-layer network that contained only a visual layer with nodes representing the viseme representations of words and edges connecting viseme representations that differed by a single viseme. We also report the results of several analyses of the errors made by human participants in the visual-only presentation condition. The results of our analyses have implications for future research and training of lip-reading, and for the development of automatic lip-reading devices and software for individuals with certain developmental or acquired disorders or for listeners with normal hearing in noisy conditions.
Collapse
Affiliation(s)
| | - Lorin Lachs
- California State University, Fresno, Fresno, CA, United States of America
| |
Collapse
|
2
|
Stella M, Citraro S, Rossetti G, Marinazzo D, Kenett YN, Vitevitch MS. Cognitive modelling of concepts in the mental lexicon with multilayer networks: Insights, advancements, and future challenges. Psychon Bull Rev 2024:10.3758/s13423-024-02473-9. [PMID: 38438713 DOI: 10.3758/s13423-024-02473-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/28/2024] [Indexed: 03/06/2024]
Abstract
The mental lexicon is a complex cognitive system representing information about the words/concepts that one knows. Over decades psychological experiments have shown that conceptual associations across multiple, interactive cognitive levels can greatly influence word acquisition, storage, and processing. How can semantic, phonological, syntactic, and other types of conceptual associations be mapped within a coherent mathematical framework to study how the mental lexicon works? Here we review cognitive multilayer networks as a promising quantitative and interpretative framework for investigating the mental lexicon. Cognitive multilayer networks can map multiple types of information at once, thus capturing how different layers of associations might co-exist within the mental lexicon and influence cognitive processing. This review starts with a gentle introduction to the structure and formalism of multilayer networks. We then discuss quantitative mechanisms of psychological phenomena that could not be observed in single-layer networks and were only unveiled by combining multiple layers of the lexicon: (i) multiplex viability highlights language kernels and facilitative effects of knowledge processing in healthy and clinical populations; (ii) multilayer community detection enables contextual meaning reconstruction depending on psycholinguistic features; (iii) layer analysis can mediate latent interactions of mediation, suppression, and facilitation for lexical access. By outlining novel quantitative perspectives where multilayer networks can shed light on cognitive knowledge representations, including in next-generation brain/mind models, we discuss key limitations and promising directions for cutting-edge future research.
Collapse
Affiliation(s)
- Massimo Stella
- CogNosco Lab, Department of Psychology and Cognitive Science, University of Trento, Trento, Italy.
| | - Salvatore Citraro
- Institute of Information Science and Technologies, National Research Council, Pisa, Italy
| | - Giulio Rossetti
- Institute of Information Science and Technologies, National Research Council, Pisa, Italy
| | - Daniele Marinazzo
- Faculty of Psychology and Educational Sciences, Department of Data Analysis, University of Ghent, Ghent, Belgium
| | - Yoed N Kenett
- Faculty of Data and Decision Sciences, Technion - Israel Institute of Technology, Haifa, Israel
| | - Michael S Vitevitch
- Department of Speech Language Hearing, University of Kansas, Lawrence, KS, USA
| |
Collapse
|
3
|
Vitevitch MS, Pisoni DB, Soehlke L, Foster TA. Using Complex Networks in the Hearing Sciences. Ear Hear 2024; 45:1-9. [PMID: 37316992 PMCID: PMC10721731 DOI: 10.1097/aud.0000000000001395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In this Point of View, we review a number of recent discoveries from the emerging, interdisciplinary field of Network Science , which uses graph theoretic techniques to understand complex systems. In the network science approach, nodes represent entities in a system, and connections are placed between nodes that are related to each other to form a web-like network . We discuss several studies that demonstrate how the micro-, meso-, and macro-level structure of a network of phonological word-forms influence spoken word recognition in listeners with normal hearing and in listeners with hearing loss. Given the discoveries made possible by this new approach and the influence of several complex network measures on spoken word recognition performance we argue that speech recognition measures-originally developed in the late 1940s and routinely used in clinical audiometry-should be revised to reflect our current understanding of spoken word recognition. We also discuss other ways in which the tools of network science can be used in Speech and Hearing Sciences and Audiology more broadly.
Collapse
|
4
|
Hearnshaw S, Baker E, Pomper R, McGregor KK, Edwards J, Munro N. The Relationship Between Speech Perception, Speech Production, and Vocabulary Abilities in Children: Insights From By-Group and Continuous Analyses. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:1173-1191. [PMID: 36940475 DOI: 10.1044/2022_jslhr-22-00441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
PURPOSE The purpose of this study was to explore the relationship between speech perception, speech production, and vocabulary abilities in children with and without speech sound disorders (SSDs), analyzing the data both by group and continuously. METHOD Sixty-one Australian English-speaking children aged 48-69 months participated in this study. Children's speech production abilities ranged along the continuum from SSDs through to typical speech. Their vocabulary abilities ranged along the continuum from typical to above average ("lexically precocious"). Children completed routine speech and language assessments in addition to an experimental Australian English lexical and phonetic judgment task. RESULTS When analyzing data by group, there was no significant difference between the speech perception ability of children with SSDs and that of children without SSDs. Children with above-average vocabularies had significantly better speech perception ability than children with average vocabularies. When analyzing data continuously, speech production and vocabulary were both significant positive predictors of variance in speech perception ability, both individually in simple linear regression and when combined in multiple linear regression. There was also a significant positive correlation between perception and production of two of the four target phonemes tested (i.e., /k/ and /ʃ/) among children in the SSD group. CONCLUSIONS Results from this study provide further insight into the complex relationship between speech perception, speech production, and vocabulary abilities in children. While there is a clinical and important need for categorical distinctions between SSDs and typically developing speech, findings further highlight the value of investigating speech production and vocabulary abilities continuously and categorically. By capturing the heterogeneity among children's speech production and vocabulary abilities, we can advance our understanding of SSDs in children. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.22229674.
Collapse
Affiliation(s)
| | - Elise Baker
- Western Sydney University, New South Wales, Australia
- South Western Sydney Local Health District, New South Wales, Australia
- Ingham Institute for Applied Medical Research, Liverpool, New South Wales, Australia
| | - Ron Pomper
- Boys Town National Research Hospital, Omaha, NE
| | | | - Jan Edwards
- Department of Hearing and Speech Sciences, University of Maryland, College Park
| | - Natalie Munro
- The University of Sydney, New South Wales, Australia
| |
Collapse
|
5
|
Castro N, Vitevitch MS. Using Network Science and Psycholinguistic Megastudies to Examine the Dimensions of Phonological Similarity. LANGUAGE AND SPEECH 2023; 66:143-174. [PMID: 35586894 DOI: 10.1177/00238309221095455] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Network science was used to examine different dimensions of phonological similarity in English. Data from a phonological associate task and an identification of words in noise task were used to create a phonological association network and a misperception network. These networks were compared to a network formed by a computational metric widely used to assess phonological similarity (i.e., one-phoneme metric). The phonological association network and the misperception network were topographically more similar to each other than either were to the one-phoneme metric network, but there were several network features in common between the one-phoneme metric network and the phonological association network. To assess the influence of network structure on processing, we compared the influence of degree (i.e., neighborhood density) from each of the networks on visual and auditory lexical decision reaction times obtained from two psycholinguistic megastudies. The effect of degree differed across network types and tasks. We discuss the use of each approach to determine phonological similarity and a possible direction forward for language research through the use of multiplex networks.
Collapse
Affiliation(s)
- Nichol Castro
- Department of Psychology, The University of Kansas, USA; Department of Communicative Disorders and Sciences, University at Buffalo, USA
| | | |
Collapse
|
6
|
Brown KS, Allopenna PD, Hunt WR, Steiner R, Saltzman E, McRae K, Magnuson JS. Universal Features in Phonological Neighbor Networks. ENTROPY 2018; 20:e20070526. [PMID: 33265615 PMCID: PMC7513050 DOI: 10.3390/e20070526] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 06/29/2018] [Accepted: 07/10/2018] [Indexed: 11/16/2022]
Abstract
Human speech perception involves transforming a countinuous acoustic signal into discrete linguistically meaningful units (phonemes) while simultaneously causing a listener to activate words that are similar to the spoken utterance and to each other. The Neighborhood Activation Model posits that phonological neighbors (two forms [words] that differ by one phoneme) compete significantly for recognition as a spoken word is heard. This definition of phonological similarity can be extended to an entire corpus of forms to produce a phonological neighbor network (PNN). We study PNNs for five languages: English, Spanish, French, Dutch, and German. Consistent with previous work, we find that the PNNs share a consistent set of topological features. Using an approach that generates random lexicons with increasing levels of phonological realism, we show that even random forms with minimal relationship to any real language, combined with only the empirical distribution of language-specific phonological form lengths, are sufficient to produce the topological properties observed in the real language PNNs. The resulting pseudo-PNNs are insensitive to the level of lingustic realism in the random lexicons but quite sensitive to the shape of the form length distribution. We therefore conclude that “universal” features seen across multiple languages are really string universals, not language universals, and arise primarily due to limitations in the kinds of networks generated by the one-step neighbor definition. Taken together, our results indicate that caution is warranted when linking the dynamics of human spoken word recognition to the topological properties of PNNs, and that the investigation of alternative similarity metrics for phonological forms should be a priority.
Collapse
Affiliation(s)
- Kevin S. Brown
- Department of Biomedical Engineering, University of Connecticut, Storrs, CT 06269, USA
- Department of Physics, University of Connecticut, Storrs, CT 06269, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
- Connecticut Institute for the Brain & Cognitive Sciences, Storrs, CT 06269, USA
- Correspondence: ; Tel.: +1-860-486-6975
| | - Paul D. Allopenna
- Department of Psychological Sciences, University of Connecticut, Storrs, CT 06269, USA
| | - William R. Hunt
- Department of Biomedical Engineering, University of Connecticut, Storrs, CT 06269, USA
| | - Rachael Steiner
- Department of Psychological Sciences, University of Connecticut, Storrs, CT 06269, USA
| | - Elliot Saltzman
- Department of Physical Therapy and Athletic Training, Boston University, Boston, MA 02215, USA
| | - Ken McRae
- Department of Psychology, University of Western Ontario, London, ON N6A 5C2, Canada
- Brain & Mind Institute, University of Western Ontario, London, ON N6A 5C2, Canada
| | - James S. Magnuson
- Connecticut Institute for the Brain & Cognitive Sciences, Storrs, CT 06269, USA
- Department of Psychological Sciences, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
7
|
Beckage NM, Colunga E. Language Networks as Models of Cognition: Understanding Cognition through Language. UNDERSTANDING COMPLEX SYSTEMS 2016. [DOI: 10.1007/978-3-662-47238-5_1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
8
|
Gruenenfelder TM, Recchia G, Rubin T, Jones MN. Graph‐Theoretic Properties of Networks Based on Word Association Norms: Implications for Models of Lexical Semantic Memory. Cogn Sci 2015; 40:1460-95. [PMID: 26453571 DOI: 10.1111/cogs.12299] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Revised: 05/17/2015] [Accepted: 05/23/2015] [Indexed: 11/30/2022]
Affiliation(s)
| | - Gabriel Recchia
- Department of Psychological and Brain Sciences Indiana University
| | - Tim Rubin
- Department of Psychological and Brain Sciences Indiana University
| | - Michael N. Jones
- Department of Psychological and Brain Sciences Indiana University
| |
Collapse
|
9
|
Carlson MT, Sonderegger M, Bane M. How children explore the phonological network in child-directed speech: A survival analysis of children's first word productions. JOURNAL OF MEMORY AND LANGUAGE 2014; 75:159-180. [PMID: 25089073 PMCID: PMC4115338 DOI: 10.1016/j.jml.2014.05.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
We explored how phonological network structure influences the age of words' first appearance in children's (14-50 months) speech, using a large, longitudinal corpus of spontaneous child-caregiver interactions. We represent the caregiver lexicon as a network in which each word is connected to all of its phonological neighbors, and consider both words' local neighborhood density (degree), and also their embeddedness among interconnected neighborhoods (clustering coefficient and coreness). The larger-scale structure reflected in the latter two measures is implicated in current theories of lexical development and processing, but its role in lexical development has not yet been explored. Multilevel discrete-time survival analysis revealed that children are more likely to produce new words whose network properties support lexical access for production: high degree, but low clustering coefficient and coreness. These effects appear to be strongest at earlier ages and largely absent from 30 months on. These results suggest that both a word's local connectivity in the lexicon and its position in the lexicon as a whole influences when it is learned, and they underscore how general lexical processing mechanisms contribute to productive vocabulary development.
Collapse
Affiliation(s)
- Matthew T Carlson
- University of Chicago, Department of Psychology, 5848 S. University Ave., Chicago, IL 60637, USA
| | - Morgan Sonderegger
- University of Chicago, Department of Linguistics, 1010 E. 59 St., Chicago, IL 60637, USA ; University of Chicago, Department of Computer Science, 1100 E. 58 St., Chicago, IL 60637, USA
| | - Max Bane
- University of Chicago, Department of Linguistics, 1010 E. 59 St., Chicago, IL 60637, USA
| |
Collapse
|
10
|
Vitevitch MS, Chan KY, Goldstein R. Insights into failed lexical retrieval from network science. Cogn Psychol 2014; 68:1-32. [PMID: 24269488 PMCID: PMC3891304 DOI: 10.1016/j.cogpsych.2013.10.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Revised: 10/30/2013] [Accepted: 10/31/2013] [Indexed: 01/22/2023]
Abstract
Previous network analyses of the phonological lexicon (Vitevitch, 2008) observed a web-like structure that exhibited assortative mixing by degree: words with dense phonological neighborhoods tend to have as neighbors words that also have dense phonological neighborhoods, and words with sparse phonological neighborhoods tend to have as neighbors words that also have sparse phonological neighborhoods. Given the role that assortative mixing by degree plays in network resilience, we examined instances of real and simulated lexical retrieval failures in computer simulations, analysis of a slips-of-the-ear corpus, and three psycholinguistic experiments for evidence of this network characteristic in human behavior. The results of the various analyses support the hypothesis that the structure of words in the mental lexicon influences lexical processing. The implications of network science for current models of spoken word recognition, language processing, and cognitive psychology more generally are discussed.
Collapse
Affiliation(s)
| | - Kit Ying Chan
- Department of Psychology, University of Kansas, United States
| | | |
Collapse
|
11
|
Baronchelli A, Ferrer-i-Cancho R, Pastor-Satorras R, Chater N, Christiansen MH. Networks in Cognitive Science. Trends Cogn Sci 2013; 17:348-60. [PMID: 23726319 DOI: 10.1016/j.tics.2013.04.010] [Citation(s) in RCA: 209] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Revised: 04/16/2013] [Accepted: 04/17/2013] [Indexed: 01/14/2023]
|
12
|
Altieri N, Gruenenfelder T, Pisoni DB. Clustering coefficients of lexical neighborhoods: Does neighborhood structure matter in spoken word recognition? THE MENTAL LEXICON 2010; 5:1-21. [PMID: 21423865 PMCID: PMC3060033 DOI: 10.1075/ml.5.1.01alt] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
High neighborhood density reduces the speed and accuracy of spoken word recognition. The two studies reported here investigated whether Clustering Coefficient (CC) - a graph theoretic variable measuring the degree to which a word's neighbors are neighbors of one another, has similar effects on spoken word recognition. In Experiment 1, we found that high CC words were identified less accurately when spectrally degraded than low CC words. In Experiment 2, using a word repetition procedure, we observed longer response latencies for high CC words compared to low CC words. Taken together, the results of both studies indicate that higher CC leads to slower and less accurate spoken word recognition. The results are discussed in terms of activation-plus-competition models of spoken word recognition.
Collapse
|