1
|
Feature-rich multiplex lexical networks reveal mental strategies of early language learning. Sci Rep 2023; 13:1474. [PMID: 36702869 PMCID: PMC9879964 DOI: 10.1038/s41598-022-27029-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 12/23/2022] [Indexed: 01/27/2023] Open
Abstract
Knowledge in the human mind exhibits a dualistic vector/network nature. Modelling words as vectors is key to natural language processing, whereas networks of word associations can map the nature of semantic memory. We reconcile these paradigms-fragmented across linguistics, psychology and computer science-by introducing FEature-Rich MUltiplex LEXical (FERMULEX) networks. This novel framework merges structural similarities in networks and vector features of words, which can be combined or explored independently. Similarities model heterogenous word associations across semantic/syntactic/phonological aspects of knowledge. Words are enriched with multi-dimensional feature embeddings including frequency, age of acquisition, length and polysemy. These aspects enable unprecedented explorations of cognitive knowledge. Through CHILDES data, we use FERMULEX networks to model normative language acquisition by 1000 toddlers between 18 and 30 months. Similarities and embeddings capture word homophily via conformity, which measures assortative mixing via distance and features. Conformity unearths a language kernel of frequent/polysemous/short nouns and verbs key for basic sentence production, supporting recent evidence of children's syntactic constructs emerging at 30 months. This kernel is invisible to network core-detection and feature-only clustering: It emerges from the dual vector/network nature of words. Our quantitative analysis reveals two key strategies in early word learning. Modelling word acquisition as random walks on FERMULEX topology, we highlight non-uniform filling of communicative developmental inventories (CDIs). Biased random walkers lead to accurate (75%), precise (55%) and partially well-recalled (34%) predictions of early word learning in CDIs, providing quantitative support to previous empirical findings and developmental theories.
Collapse
|
2
|
Karuza EA. The Value of Statistical Learning to Cognitive Network Science. Top Cogn Sci 2022; 14:78-92. [PMID: 34165881 DOI: 10.1111/tops.12558] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 06/08/2021] [Accepted: 06/08/2021] [Indexed: 12/28/2022]
Abstract
To study the human mind is to consider the nature of associations-how are they learned, what are their constituent parts, and how can they be severed or adjusted? The manipulation of associations stands as a pillar of statistical learning (SL) research, which strongly suggests that processes as diverse as word segmentation, learning of grammatical patterns, and event perception can be explained by the learner's sensitivity to simple temporal dependencies (among other regularities). Used to determine the edges of a network, associations are similarly crucial to consider when quantifying the graph-theoretical properties of various cognitive systems. With this point of convergence in mind, the present work reaffirms the unique value of network science in illuminating the broad-level architectures of complex cognitive systems. However, I also describe how insights from the SL literature, coupled with insights from psycholinguistics more broadly, offer a strong theoretical backbone upon which we can develop and study networks that reflect, as closely as possible, the psychological realities of learning.
Collapse
|
3
|
Hills TT, Kenett YN. Is the Mind a Network? Maps, Vehicles, and Skyhooks in Cognitive Network Science. Top Cogn Sci 2021; 14:189-208. [PMID: 34435461 DOI: 10.1111/tops.12570] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 07/28/2021] [Accepted: 07/28/2021] [Indexed: 11/28/2022]
Abstract
Cognitive researchers often carve cognition up into structures and processes. Cognitive processes operate on structures, like vehicles driving over a map. Language alongside semantic and episodic memory are proposed to have structure, as are perceptual systems. Over these structures, processes operate to construct memory and solve problems by retrieving and manipulating information. Network science offers an approach to representing cognitive structures and has made tremendous inroads into understanding the nature of cognitive structure and process. But is the mind a network? If so, what kind? In this article, we briefly review the main metaphors, assumptions, and pitfalls prevalent in cognitive network science (maps and vehicles; one network/process to rule them all), highlight the need for new metaphors that elaborate on the map-and-vehicle framework (wormholes, skyhooks, and generators), and present open questions in studying the mind as a network (the challenge of capturing network change, what should the edges of cognitive networks be made of, and aggregated vs. individual-based networks). One critical lesson of this exercise is that the richness of the mind as network approach makes it a powerful tool in its own right; it has helped to make our assumptions more visible, generating new and fascinating questions, and enriching the prospects for future research. A second lesson is that the mind as a network-though useful-is incomplete. The mind is not a network, but it may contain them.
Collapse
Affiliation(s)
| | - Yoed N Kenett
- Faculty of Industrial Engineering and Management, Technion - Israel Institute of Technology
| |
Collapse
|
4
|
Fourtassi A, Bian Y, Frank MC. The Growth of Children's Semantic and Phonological Networks: Insight From 10 Languages. Cogn Sci 2021; 44:e12847. [PMID: 32621305 DOI: 10.1111/cogs.12847] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Revised: 04/03/2020] [Accepted: 04/08/2020] [Indexed: 12/01/2022]
Abstract
Children tend to produce words earlier when they are connected to a variety of other words along the phonological and semantic dimensions. Though these semantic and phonological connectivity effects have been extensively documented, little is known about their underlying developmental mechanism. One possibility is that learning is driven by lexical network growth where highly connected words in the child's early lexicon enable learning of similar words. Another possibility is that learning is driven by highly connected words in the external learning environment, instead of highly connected words in the early internal lexicon. The present study tests both scenarios systematically in both the phonological and semantic domains across 10 languages. We show that phonological and semantic connectivity in the learning environment drives growth in both production- and comprehension-based vocabularies, even controlling for word frequency and length. This pattern of findings suggests a word learning process where children harness their statistical learning abilities to detect and learn highly connected words in the learning environment.
Collapse
Affiliation(s)
| | - Yuan Bian
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology
| | | |
Collapse
|
5
|
Siew CSQ. Global and Local Feature Distinctiveness Effects in Language Acquisition. Cogn Sci 2021; 45:e13008. [PMID: 34213787 DOI: 10.1111/cogs.13008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2019] [Revised: 05/27/2021] [Accepted: 05/31/2021] [Indexed: 11/26/2022]
Abstract
Various aspects of semantic features drive early vocabulary development, but less is known about how the global and local structure of the overall semantic feature space influences language acquisition. A feature network of English words was constructed from a large database of adult feature production norms such that edges in the network represented feature distances between words (i.e., Manhattan distances of probability distributions of features elicited for each pair of words). A word's global feature distinctiveness is measured with respect to all other words in the network and a word's local feature distinctiveness is measured relative to words in sub-networks derived from clustering analyses. This paper investigates how feature distinctiveness of individual words at local and global scales of the network influences language acquisition. Regression analyses indicate that global feature distinctiveness was associated with earlier age of acquisition ratings, and was a stronger predictor of age of acquisition than local feature distinctiveness. These results suggest that the global structure of the semantic feature network could play an important role in language acquisition, whereby globally distinctive concepts help to structure vocabulary development over the lifespan.
Collapse
|
6
|
Castro N, Siew CSQ. Contributions of modern network science to the cognitive sciences: revisiting research spirals of representation and process. Proc Math Phys Eng Sci 2020; 476:20190825. [PMID: 32831584 PMCID: PMC7428042 DOI: 10.1098/rspa.2019.0825] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Accepted: 05/12/2020] [Indexed: 12/13/2022] Open
Abstract
Modelling the structure of cognitive systems is a central goal of the cognitive sciences-a goal that has greatly benefitted from the application of network science approaches. This paper provides an overview of how network science has been applied to the cognitive sciences, with a specific focus on the two research 'spirals' of cognitive sciences related to the representation and processes of the human mind. For each spiral, we first review classic papers in the psychological sciences that have drawn on graph-theoretic ideas or frameworks before the advent of modern network science approaches. We then discuss how current research in these areas has been shaped by modern network science, which provides the mathematical framework and methodological tools for psychologists to (i) represent cognitive network structure and (ii) investigate and model the psychological processes that occur in these cognitive networks. Finally, we briefly comment on the future of, and the challenges facing, cognitive network science.
Collapse
Affiliation(s)
- Nichol Castro
- Department of Speech and Hearing Sciences, University of Washington, Seattle, WA, USA
| | - Cynthia S. Q. Siew
- Department of Psychology, National University of Singapore, Singapore, Republic of Singapore
| |
Collapse
|
7
|
Applications of Network Science to Education Research: Quantifying Knowledge and the Development of Expertise through Network Analysis. EDUCATION SCIENCES 2020. [DOI: 10.3390/educsci10040101] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
A fundamental goal of education is to inspire and instill deep, meaningful, and long-lasting conceptual change within the knowledge landscapes of students. This commentary posits that the tools of network science could be useful in helping educators achieve this goal in two ways. First, methods from cognitive psychology and network science could be helpful in quantifying and analyzing the structure of students’ knowledge of a given discipline as a knowledge network of interconnected concepts. Second, network science methods could be relevant for investigating the developmental trajectories of knowledge structures by quantifying structural change in knowledge networks, and potentially inform instructional design in order to optimize the acquisition of meaningful knowledge as the student progresses from being a novice to an expert in the subject. This commentary provides a brief introduction to common network science measures and suggests how they might be relevant for shedding light on the cognitive processes that underlie learning and retrieval, and discusses ways in which generative network growth models could inform pedagogical strategies to enable meaningful long-term conceptual change and knowledge development among students.
Collapse
|
8
|
Castro N. The multiplex structure of the mental lexicon influences picture naming in people with aphasia. JOURNAL OF COMPLEX NETWORKS 2019; 7:913-931. [PMID: 31984136 PMCID: PMC6961494 DOI: 10.1093/comnet/cnz012] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 02/25/2019] [Indexed: 06/10/2023]
Abstract
An emerging area of research in cognitive science is the utilization of networks to model the structure and processes of the mental lexicon in healthy and clinical populations, like aphasia. Previous research has focused on only one type of word similarity at a time (e.g., semantic relationships), even though words are multi-faceted. Here, we investigate lexical retrieval in a picture naming task from people with Broca's and Wernicke's aphasia and healthy controls by utilizing a multiplex network structure that accounts for the interplay between multiple semantic and phonological relationships among words in the mental lexicon. Extending upon previous work, we focused on the global network measure of closeness centrality which is known to capture spreading activation, an important process supporting lexical retrieval. We conducted a series of logistic regression models predicting the probability of correct picture naming. We tested whether multiplex closeness centrality was a better predictor of picture naming performance than single-layer closeness centralities, other network measures assessing local and meso-scale structure, psycholinguistic variables and group differences. We also examined production gaps, or the difference between the likelihood of producing a word with the lowest and highest closeness centralities. Our results indicated that multiplex closeness centrality was a significant predictor of picture naming performance, where words with high closeness centrality were more likely to be produced than words with low closeness centrality. Additionally, multiplex closeness centrality outperformed single-layer closeness centralities and other multiplex network measures, and remained a significant predictor after controlling for psycholinguistic variables and group differences. Furthermore, we found that the facilitative effect of closeness centrality was similar for both types of aphasia. Our results underline the importance of integrating multiple measures of word similarities in cognitive language networks for better understanding lexical retrieval in aphasia, with an eye towards future clinical applications.
Collapse
Affiliation(s)
- Nichol Castro
- School of Psychology, Georgia Institute of Technology, 654 Cherry Street, Atlanta, Georgia, 30332 USA
| |
Collapse
|
9
|
Stella M, de Nigris S, Aloric A, Siew CSQ. Forma mentis networks quantify crucial differences in STEM perception between students and experts. PLoS One 2019; 14:e0222870. [PMID: 31622351 PMCID: PMC6797169 DOI: 10.1371/journal.pone.0222870] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 09/09/2019] [Indexed: 11/18/2022] Open
Abstract
In order to investigate how high school students and researchers perceive science-related (STEM) subjects, we introduce forma mentis networks. This framework models how people conceptually structure their stance, mindset or forma mentis toward a given topic. In this study, we build forma mentis networks revolving around STEM and based on psycholinguistic data, namely free associations of STEM concepts (i.e., which words are elicited first and associated by students/researchers reading "science"?) and their valence ratings concepts (i.e., is "science" perceived as positive, negative or neutral by students/researchers?). We construct separate networks for (Ns = 159) Italian high school students and (Nr = 59) interdisciplinary professionals and researchers in order to investigate how these groups differ in their conceptual knowledge and emotional perception of STEM. Our analysis of forma mentis networks at various scales indicate that, like researchers, students perceived "science" as a strongly positive entity. However, differently from researchers, students identified STEM subjects like "physics" and "mathematics" as negative and associated them with other negative STEM-related concepts. We call this surrounding of negative associations a negative emotional aura. Cross-validation with external datasets indicated that the negative emotional auras of physics, maths and statistics in the students' forma mentis network related to science anxiety. Furthermore, considering the semantic associates of "mathematics" and "physics" revealed that negative auras may originate from a bleak, dry perception of the technical methodology and mnemonic tools taught in these subjects (e.g., calculus rules). Overall, our results underline the crucial importance of emphasizing nontechnical and applied aspects of STEM disciplines, beyond purely methodological teaching. The quantitative insights achieved through forma mentis networks highlight the necessity of establishing novel pedagogic and interdisciplinary links between science, its real-world complexity, and creativity in science learning in order to enhance the impact of STEM education, learning and outreach activities.
Collapse
Affiliation(s)
- Massimo Stella
- Institute for Complex Systems Simulation, University of Southampton, Southampton, United Kingdom
- Complex Science Consulting, Lecce, Italy
| | - Sarah de Nigris
- Institute for Web Science and Technologies, University of Koblenz-Landau, Koblenz, Germany
| | - Aleksandra Aloric
- Scientific Computing Laboratory, Center for the Study of Complex Systems, Institute of Physics Belgrade, Belgrade, Serbia
| | - Cynthia S. Q. Siew
- Department of Psychology, University of Warwick, Coventry, United Kingdom
- Department of Psychology, National University of Singapore, Singapore, Singapore
| |
Collapse
|
10
|
Modelling Early Word Acquisition through Multiplex Lexical Networks and Machine Learning. BIG DATA AND COGNITIVE COMPUTING 2019. [DOI: 10.3390/bdcc3010010] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Early language acquisition is a complex cognitive task. Recent data-informed approaches showed that children do not learn words uniformly at random but rather follow specific strategies based on the associative representation of words in the mental lexicon, a conceptual system enabling human cognitive computing. Building on this evidence, the current investigation introduces a combination of machine learning techniques, psycholinguistic features (i.e., frequency, length, polysemy and class) and multiplex lexical networks, representing the semantics and phonology of the mental lexicon, with the aim of predicting normative acquisition of 529 English words by toddlers between 22 and 26 months. Classifications using logistic regression and based on four psycholinguistic features achieve the best baseline cross-validated accuracy of 61.7% when half of the words have been acquired. Adding network information through multiplex closeness centrality enhances accuracy (up to 67.7%) more than adding multiplex neighbourhood density/degree (62.4%) or multiplex PageRank versatility (63.0%) or the best single-layer network metric, i.e., free association degree (65.2%), instead. Multiplex closeness operationalises the structural relevance of words for semantic and phonological information flow. These results indicate that the whole, global, multi-level flow of information and structure of the mental lexicon influence word acquisition more than single-layer or local network features of words when considered in conjunction with language norms. The highlighted synergy of multiplex lexical structure and psycholinguistic norms opens new ways for understanding human cognition and language processing through powerful and data-parsimonious cognitive computing approaches.
Collapse
|