1
|
Martínez-Calvo A, Biviano MD, Christensen AH, Katifori E, Jensen KH, Ruiz-García M. The fluidic memristor as a collective phenomenon in elastohydrodynamic networks. Nat Commun 2024; 15:3121. [PMID: 38600060 PMCID: PMC11006656 DOI: 10.1038/s41467-024-47110-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 03/19/2024] [Indexed: 04/12/2024] Open
Abstract
Fluid flow networks are ubiquitous and can be found in a broad range of contexts, from human-made systems such as water supply networks to living systems like animal and plant vasculature. In many cases, the elements forming these networks exhibit a highly non-linear pressure-flow relationship. Although we understand how these elements work individually, their collective behavior remains poorly understood. In this work, we combine experiments, theory, and numerical simulations to understand the main mechanisms underlying the collective behavior of soft flow networks with elements that exhibit negative differential resistance. Strikingly, our theoretical analysis and experiments reveal that a minimal network of nonlinear resistors, which we have termed a 'fluidic memristor', displays history-dependent resistance. This new class of element can be understood as a collection of hysteresis loops that allows this fluidic system to store information, and it can be directly used as a tunable resistor in fluidic setups. Our results provide insights that can inform other applications of fluid flow networks in soft materials science, biomedical settings, and soft robotics, and may also motivate new understanding of the flow networks involved in animal and plant physiology.
Collapse
Affiliation(s)
- Alejandro Martínez-Calvo
- Princeton Center for Theoretical Science, Princeton University, Princeton, NJ, 08544, USA
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
| | - Matthew D Biviano
- Department of Physics, Technical University of Denmark, DK 2800, Kgs. Lyngby, Denmark
| | | | - Eleni Katifori
- Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Center for Computational Biology, Flatiron Institute, New York, NY, 10010, USA
| | - Kaare H Jensen
- Department of Physics, Technical University of Denmark, DK 2800, Kgs. Lyngby, Denmark
| | - Miguel Ruiz-García
- Departamento de Estructura de la Materia, Física Térmica y Electrónica, Universidad Complutense Madrid, 28040, Madrid, Spain.
- GISC - Grupo Interdisciplinar de Sistemas Complejos, Universidad Complutense Madrid, 28040, Madrid, Spain.
- Department of Mathematics, Universidad Carlos III de Madrid, 28911, Leganés, Spain.
| |
Collapse
|
2
|
Stern M, Liu AJ, Balasubramanian V. Physical effects of learning. Phys Rev E 2024; 109:024311. [PMID: 38491658 DOI: 10.1103/physreve.109.024311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 01/31/2024] [Indexed: 03/18/2024]
Abstract
Interacting many-body physical systems ranging from neural networks in the brain to folding proteins to self-modifying electrical circuits can learn to perform diverse tasks. This learning, both in nature and in engineered systems, can occur through evolutionary selection or through dynamical rules that drive active learning from experience. Here, we show that learning in linear physical networks with weak input signals leaves architectural imprints on the Hessian of a physical system. Compared to a generic organization of the system components, (a) the effective physical dimension of the response to inputs decreases, (b) the response of physical degrees of freedom to random perturbations (or system "susceptibility") increases, and (c) the low-eigenvalue eigenvectors of the Hessian align with the task. Overall, these effects embody the typical scenario for learning processes in physical systems in the weak input regime, suggesting ways of discovering whether a physical network may have been trained.
Collapse
Affiliation(s)
- Menachem Stern
- Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Andrea J Liu
- Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, New York 10010, USA
| | - Vijay Balasubramanian
- Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, New Mexico 87501, USA
- Theoretische Natuurkunde, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels, Belgium
| |
Collapse
|
3
|
Ruiz-García M, Ozaita J, Pereda M, Alfonso A, Brañas-Garza P, Cuesta JA, Sánchez A. Triadic influence as a proxy for compatibility in social relationships. Proc Natl Acad Sci U S A 2023; 120:e2215041120. [PMID: 36947512 PMCID: PMC10068781 DOI: 10.1073/pnas.2215041120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 02/14/2023] [Indexed: 03/23/2023] Open
Abstract
Networks of social interactions are the substrate upon which civilizations are built. Often, we create new bonds with people that we like or feel that our relationships are damaged through the intervention of third parties. Despite their importance and the huge impact that these processes have in our lives, quantitative scientific understanding of them is still in its infancy, mainly due to the difficulty of collecting large datasets of social networks including individual attributes. In this work, we present a thorough study of real social networks of 13 schools, with more than 3,000 students and 60,000 declared positive and negative relationships, including tests for personal traits of all the students. We introduce a metric-the "triadic influence"-that measures the influence of nearest neighbors in the relationships of their contacts. We use neural networks to predict the sign of the relationships in these social networks, extracting the probability that two students are friends or enemies depending on their personal attributes or the triadic influence. We alternatively use a high-dimensional embedding of the network structure to also predict the relationships. Remarkably, using the triadic influence (a simple one-dimensional metric) achieves the best accuracy, and adding the personal traits of the students does not improve the results, suggesting that the triadic influence acts as a proxy for the social compatibility of students. We postulate that the probabilities extracted from the neural networks-functions of the triadic influence and the personalities of the students-control the evolution of real social networks, opening an avenue for the quantitative study of these systems.
Collapse
Affiliation(s)
- Miguel Ruiz-García
- Departamento de Estructura de la Materia, Física Térmica y Electrónica, Universidad Complutense Madrid, Madrid28040, Spain
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid28911, Spain
- Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés28911, Spain
| | - Juan Ozaita
- Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés28911, Spain
| | - María Pereda
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid28911, Spain
- Grupo de Investigación Ingeniería de Organización y Logística (IOL), Departamento Ingeniería de Organización, Administración de empresas y Estadística, Escuela Técnica Superior de Ingenieros Industriales, Universidad Politécnica de Madrid, Madrid28006, Spain
| | - Antonio Alfonso
- LoyolaBehLAB, Department of Economics and Fundación ETEA, Universidad Loyola Andalucía, Córdoba14004, Spain
| | - Pablo Brañas-Garza
- LoyolaBehLAB, Department of Economics and Fundación ETEA, Universidad Loyola Andalucía, Córdoba14004, Spain
| | - José A. Cuesta
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid28911, Spain
- Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés28911, Spain
- Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza, Zaragoza50018, Spain
| | - Angel Sánchez
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid28911, Spain
- Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés28911, Spain
- Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza, Zaragoza50018, Spain
| |
Collapse
|
4
|
Saglietti L, Mannelli SS, Saxe A. An analytical theory of curriculum learning in teacher-student networks. JOURNAL OF STATISTICAL MECHANICS (ONLINE) 2022; 2022:114014. [PMID: 37817944 PMCID: PMC10561397 DOI: 10.1088/1742-5468/ac9b3c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 10/13/2022] [Indexed: 10/12/2023]
Abstract
In animals and humans, curriculum learning-presenting data in a curated order-is critical to rapid learning and effective pedagogy. A long history of experiments has demonstrated the impact of curricula in a variety of animals but, despite its ubiquitous presence, a theoretical understanding of the phenomenon is still lacking. Surprisingly, in contrast to animal learning, curricula strategies are not widely used in machine learning and recent simulation studies reach the conclusion that curricula are moderately effective or even ineffective in most cases. This stark difference in the importance of curriculum raises a fundamental theoretical question: when and why does curriculum learning help? In this work, we analyse a prototypical neural network model of curriculum learning in the high-dimensional limit, employing statistical physics methods. We study a task in which a sparse set of informative features are embedded amidst a large set of noisy features. We analytically derive average learning trajectories for simple neural networks on this task, which establish a clear speed benefit for curriculum learning in the online setting. However, when training experiences can be stored and replayed (for instance, during sleep), the advantage of curriculum in standard neural networks disappears, in line with observations from the deep learning literature. Inspired by synaptic consolidation techniques developed to combat catastrophic forgetting, we propose curriculum-aware algorithms that consolidate synapses at curriculum change points and investigate whether this can boost the benefits of curricula. We derive generalisation performance as a function of consolidation strength (implemented as an L 2 regularisation/elastic coupling connecting learning phases), and show that curriculum-aware algorithms can yield a large improvement in test performance. Our reduced analytical descriptions help reconcile apparently conflicting empirical results, trace regimes where curriculum learning yields the largest gains, and provide experimentally-accessible predictions for the impact of task parameters on curriculum benefits. More broadly, our results suggest that fully exploiting a curriculum may require explicit adjustments in the loss.
Collapse
Affiliation(s)
- Luca Saglietti
- Institute for Data Science and Analytics, Bocconi University, Italy
| | - Stefano Sarao Mannelli
- Gatsby Computational Neuroscience Unit and Sainsbury Wellcome Centre, University College, London, United Kingdom
| | - Andrew Saxe
- Institute for Data Science and Analytics, Bocconi University, Italy
- FAIR, Meta AI, United States of America
| |
Collapse
|
5
|
Model architecture can transform catastrophic forgetting into positive transfer. Sci Rep 2022; 12:10736. [PMID: 35750768 PMCID: PMC9232654 DOI: 10.1038/s41598-022-14348-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 06/06/2022] [Indexed: 11/10/2022] Open
Abstract
The work of McCloskey and Cohen popularized the concept of catastrophic interference. They used a neural network that tried to learn addition using two groups of examples as two different tasks. In their case, learning the second task rapidly deteriorated the acquired knowledge about the previous one. We hypothesize that this could be a symptom of a fundamental problem: addition is an algorithmic task that should not be learned through pattern recognition. Therefore, other model architectures better suited for this task would avoid catastrophic forgetting. We use a neural network with a different architecture that can be trained to recover the correct algorithm for the addition of binary numbers. This neural network includes conditional clauses that are naturally treated within the back-propagation algorithm. We test it in the setting proposed by McCloskey and Cohen and training on random additions one by one. The neural network not only does not suffer from catastrophic forgetting but it improves its predictive power on unseen pairs of numbers as training progresses. We also show that this is a robust effect, also present when averaging many simulations. This work emphasizes the importance that neural network architecture has for the emergence of catastrophic forgetting and introduces a neural network that is able to learn an algorithm.
Collapse
|