1
|
Tuckute G, Kanwisher N, Fedorenko E. Language in Brains, Minds, and Machines. Annu Rev Neurosci 2024; 47:277-301. [PMID: 38669478 DOI: 10.1146/annurev-neuro-120623-101142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]
Abstract
It has long been argued that only humans could produce and understand language. But now, for the first time, artificial language models (LMs) achieve this feat. Here we survey the new purchase LMs are providing on the question of how language is implemented in the brain. We discuss why, a priori, LMs might be expected to share similarities with the human language system. We then summarize evidence that LMs represent linguistic information similarly enough to humans to enable relatively accurate brain encoding and decoding during language processing. Finally, we examine which LM properties-their architecture, task performance, or training-are critical for capturing human neural responses to language and review studies using LMs as in silico model organisms for testing hypotheses about language. These ongoing investigations bring us closer to understanding the representations and processes that underlie our ability to comprehend sentences and express thoughts in language.
Collapse
Affiliation(s)
- Greta Tuckute
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Nancy Kanwisher
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| |
Collapse
|
2
|
Mahowald K, Ivanova AA, Blank IA, Kanwisher N, Tenenbaum JB, Fedorenko E. Dissociating language and thought in large language models. Trends Cogn Sci 2024; 28:517-540. [PMID: 38508911 DOI: 10.1016/j.tics.2024.01.011] [Citation(s) in RCA: 29] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 01/31/2024] [Accepted: 01/31/2024] [Indexed: 03/22/2024]
Abstract
Large language models (LLMs) have come closest among all models to date to mastering human language, yet opinions about their linguistic and cognitive capabilities remain split. Here, we evaluate LLMs using a distinction between formal linguistic competence (knowledge of linguistic rules and patterns) and functional linguistic competence (understanding and using language in the world). We ground this distinction in human neuroscience, which has shown that formal and functional competence rely on different neural mechanisms. Although LLMs are surprisingly good at formal competence, their performance on functional competence tasks remains spotty and often requires specialized fine-tuning and/or coupling with external modules. We posit that models that use language in human-like ways would need to master both of these competence types, which, in turn, could require the emergence of separate mechanisms specialized for formal versus functional linguistic competence.
Collapse
|
3
|
Huang Z, Zhang W. Forecasting carbon prices in China's pilot carbon market: A multi-source information approach with conditional generative adversarial networks. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 359:120967. [PMID: 38723494 DOI: 10.1016/j.jenvman.2024.120967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 03/27/2024] [Accepted: 04/19/2024] [Indexed: 05/22/2024]
Abstract
In recent years, the Chinese government has actively pursued the implementation of its 'dual-carbon' strategy, concurrently establishing a national carbon emissions trading market. Accurate carbon price forecasts have become essential for policymakers and investors involved in related initiatives. Nevertheless, influenced by the interaction of various information sources, carbon trading prices exhibit non-linear and non-stationary characteristics, posing challenges for accurate prediction. Current research, centered around deep learning models, predominantly emphasizes intricate network structures, optimisation algorithms, and data decomposition. However, these models face a developmental bottleneck in extracting carbon price features and efficiently leveraging multi-source information. Consequently, novel ideas and methodologies are imperative. This study focuses on the Hubei and Guangdong regional carbon markets as research subjects. It develops a prediction framework based on a generative adversarial network model to capture the time-series change characteristics of carbon trading prices and the condition matrix. First, a generator prediction model is used to obtain the input matrix features and extract the time series features through a complex network to predict the carbon price data at the next moment using a fully connected layer. Second, a discriminator is utilised to distinguish between the actual values and the predicted values. The generator and the discriminator undergo continuous iterative training and alternate optimisation. This process aims to bring the generated prediction distributions closer to the actual sample data, resulting in more accurate final predictions. The empirical results convincingly show that the proposed model achieves unparalleled forecasting precision in both markets. The proposed model demonstrates the lowest MAE (0.804 and 0.839), lowest MAPE (0.023 and 0.018), lowest RMSE (1.174 and 1.383), and highest R2 (0.971 and 0.989) across both markets, indicating superior predictive accuracy. Additionally, the proposed model consistently outshines traditional forecasting approaches across one-step, five-step, and ten-step forecasts, affirming its robustness and universal applicability in modelling carbon trading price series. The findings suggest that this study can aid policymakers in optimizing the carbon pricing system. Furthermore, it offers a reference for policymakers to comprehensively leverage external factors, such as regulating traditional energy prices, leveraging international carbon market experiences, and monitoring economic dynamics. This comprehensive strategy can streamline the exploration and management of carbon price fluctuations, ultimately strengthening the carbon market's risk control system.
Collapse
Affiliation(s)
- Zhigang Huang
- Fujian Research Center for High Quality Economic Development, Fuzhou University, Fuzhou, 350108, China; Fujian Provincial Key Laboratory of Financial Science and Technology Innovation, Fuzhou University, Fuzhou, 350108, China.
| | - Weilan Zhang
- School of Economics and Management, Fuzhou University, Fuzhou, 350108, China.
| |
Collapse
|
4
|
Beguš G, Zhou A, Zhao TC. Encoding of speech in convolutional layers and the brain stem based on language experience. Sci Rep 2023; 13:6480. [PMID: 37081119 PMCID: PMC10119295 DOI: 10.1038/s41598-023-33384-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 04/12/2023] [Indexed: 04/22/2023] Open
Abstract
Comparing artificial neural networks with outputs of neuroimaging techniques has recently seen substantial advances in (computer) vision and text-based language models. Here, we propose a framework to compare biological and artificial neural computations of spoken language representations and propose several new challenges to this paradigm. The proposed technique is based on a similar principle that underlies electroencephalography (EEG): averaging of neural (artificial or biological) activity across neurons in the time domain, and allows to compare encoding of any acoustic property in the brain and in intermediate convolutional layers of an artificial neural network. Our approach allows a direct comparison of responses to a phonetic property in the brain and in deep neural networks that requires no linear transformations between the signals. We argue that the brain stem response (cABR) and the response in intermediate convolutional layers to the exact same stimulus are highly similar without applying any transformations, and we quantify this observation. The proposed technique not only reveals similarities, but also allows for analysis of the encoding of actual acoustic properties in the two signals: we compare peak latency (i) in cABR relative to the stimulus in the brain stem and in (ii) intermediate convolutional layers relative to the input/output in deep convolutional networks. We also examine and compare the effect of prior language exposure on the peak latency in cABR and in intermediate convolutional layers. Substantial similarities in peak latency encoding between the human brain and intermediate convolutional networks emerge based on results from eight trained networks (including a replication experiment). The proposed technique can be used to compare encoding between the human brain and intermediate convolutional layers for any acoustic property and for other neuroimaging techniques.
Collapse
Affiliation(s)
- Gašper Beguš
- Department of Linguistics, University of California, Berkeley, USA.
| | - Alan Zhou
- Department of Cognitive Science, Johns Hopkins University, Baltimore, USA
| | - T Christina Zhao
- Institute for Learning and Brain Sciences, University of Washington, Seattle, USA
- Department of Speech and Hearing Sciences, University of Washington, Seattle, USA
| |
Collapse
|
5
|
Zhang XL, Xie L, Fosler-Lussier E, Vincent E. Guest editorial: Special issue on advances in deep learning based speech processing. Neural Netw 2023; 158:328-330. [PMID: 36493535 DOI: 10.1016/j.neunet.2022.11.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Affiliation(s)
- Xiao-Lei Zhang
- School of Marine Science and Technology, Northwestern Polytechnical University, Xi'an Shaanxi 710072, China.
| | - Lei Xie
- Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University, Xi'an Shaanxi 710072, China.
| | - Eric Fosler-Lussier
- Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA.
| | - Emmanuel Vincent
- Université de Lorraine, CNRS, Inria, LORIA, F-54000, Nancy, France.
| |
Collapse
|
7
|
Andreas J, Beguš G, Bronstein MM, Diamant R, Delaney D, Gero S, Goldwasser S, Gruber DF, de Haas S, Malkin P, Pavlov N, Payne R, Petri G, Rus D, Sharma P, Tchernov D, Tønnesen P, Torralba A, Vogt D, Wood RJ. Toward understanding the communication in sperm whales. iScience 2022; 25:104393. [PMID: 35663036 PMCID: PMC9160774 DOI: 10.1016/j.isci.2022.104393] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open
Abstract
Machine learning has been advancing dramatically over the past decade. Most strides are human-based applications due to the availability of large-scale datasets; however, opportunities are ripe to apply this technology to more deeply understand non-human communication. We detail a scientific roadmap for advancing the understanding of communication of whales that can be built further upon as a template to decipher other forms of animal and non-human communication. Sperm whales, with their highly developed neuroanatomical features, cognitive abilities, social structures, and discrete click-based encoding make for an excellent model for advanced tools that can be applied to other animals in the future. We outline the key elements required for the collection and processing of massive datasets, detecting basic communication units and language-like higher-level structures, and validating models through interactive playback experiments. The technological capabilities developed by such an undertaking hold potential for cross-applications in broader communities investigating non-human communication and behavioral research.
Collapse
Affiliation(s)
- Jacob Andreas
- MIT CSAIL, Cambridge, MA, USA
- Project CETI, New York, NY, USA
| | - Gašper Beguš
- Department of Linguistics, University of California, Berkeley, CA, USA
- Project CETI, New York, NY, USA
| | - Michael M. Bronstein
- Department of Computer Science, University of Oxford, Oxford, UK
- IDSIA, University of Lugano, Lugano, Switzerland
- Twitter, London, UK
- Project CETI, New York, NY, USA
| | - Roee Diamant
- Leon H. Charney School of Marine Sciences, University of Haifa, Haifa, Israel
- Project CETI, New York, NY, USA
| | - Denley Delaney
- Exploration Technology Lab, National Geographic Society, Washington DC, USA
- Project CETI, New York, NY, USA
| | - Shane Gero
- Dominica Sperm Whale Project, Roseau, Commonwealth of Dominica
- Department of Biology, Carleton University, Ottawa, ON, Canada
- Project CETI, New York, NY, USA
| | - Shafi Goldwasser
- Simons Institute for the Theory of Computing, University of California, Berkeley, CA, USA
| | - David F. Gruber
- Department of Natural Sciences, Baruch College and The Graduate Center, PhD Program in Biology, City University of New York, New York, NY, USA
- Project CETI, New York, NY, USA
| | - Sarah de Haas
- Google Research, Mountain View, CA USA
- Project CETI, New York, NY, USA
| | - Peter Malkin
- Google Research, Mountain View, CA USA
- Project CETI, New York, NY, USA
| | | | | | - Giovanni Petri
- ISI Foundation, Turin, Italy
- Project CETI, New York, NY, USA
| | - Daniela Rus
- MIT CSAIL, Cambridge, MA, USA
- Project CETI, New York, NY, USA
| | | | - Dan Tchernov
- Leon H. Charney School of Marine Sciences, University of Haifa, Haifa, Israel
- Project CETI, New York, NY, USA
| | - Pernille Tønnesen
- Marine Bioacoustics Lab, Zoophysiology, Department of Biology, Aarhus University, Aarhus, Denmark
- Project CETI, New York, NY, USA
| | | | - Daniel Vogt
- School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
- Project CETI, New York, NY, USA
| | - Robert J. Wood
- School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
- Project CETI, New York, NY, USA
| |
Collapse
|