1
|
Torresan F, Baltieri M. Disentangled representations for causal cognition. Phys Life Rev 2024; 51:343-381. [PMID: 39500032 DOI: 10.1016/j.plrev.2024.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Accepted: 10/14/2024] [Indexed: 12/06/2024]
Abstract
Complex adaptive agents consistently achieve their goals by solving problems that seem to require an understanding of causal information, information pertaining to the causal relationships that exist among elements of combined agent-environment systems. Causal cognition studies and describes the main characteristics of causal learning and reasoning in human and non-human animals, offering a conceptual framework to discuss cognitive performances based on the level of apparent causal understanding of a task. Despite the use of formal intervention-based models of causality, including causal Bayesian networks, psychological and behavioural research on causal cognition does not yet offer a computational account that operationalises how agents acquire a causal understanding of the world seemingly from scratch, i.e. without a-priori knowledge of relevant features of the environment. Research on causality in machine and reinforcement learning, especially involving disentanglement as a candidate process to build causal representations, represents on the other hand a concrete attempt at designing artificial agents that can learn about causality, shedding light on the inner workings of natural causal cognition. In this work, we connect these two areas of research to build a unifying framework for causal cognition that will offer a computational perspective on studies of animal cognition, and provide insights in the development of new algorithms for causal reinforcement learning in AI.
Collapse
Affiliation(s)
- Filippo Torresan
- University of Sussex, Falmer, Brighton, BN1 9RH, United Kingdom.
| | - Manuel Baltieri
- University of Sussex, Falmer, Brighton, BN1 9RH, United Kingdom; Araya Inc., Chiyoda City, Tokyo, 101 0025, Japan.
| |
Collapse
|
2
|
Wolpert DH, Korbel J, Lynn CW, Tasnim F, Grochow JA, Kardeş G, Aimone JB, Balasubramanian V, De Giuli E, Doty D, Freitas N, Marsili M, Ouldridge TE, Richa AW, Riechers P, Roldán É, Rubenstein B, Toroczkai Z, Paradiso J. Is stochastic thermodynamics the key to understanding the energy costs of computation? Proc Natl Acad Sci U S A 2024; 121:e2321112121. [PMID: 39471216 PMCID: PMC11551414 DOI: 10.1073/pnas.2321112121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2024] Open
Abstract
The relationship between the thermodynamic and computational properties of physical systems has been a major theoretical interest since at least the 19th century. It has also become of increasing practical importance over the last half-century as the energetic cost of digital devices has exploded. Importantly, real-world computers obey multiple physical constraints on how they work, which affects their thermodynamic properties. Moreover, many of these constraints apply to both naturally occurring computers, like brains or Eukaryotic cells, and digital systems. Most obviously, all such systems must finish their computation quickly, using as few degrees of freedom as possible. This means that they operate far from thermal equilibrium. Furthermore, many computers, both digital and biological, are modular, hierarchical systems with strong constraints on the connectivity among their subsystems. Yet another example is that to simplify their design, digital computers are required to be periodic processes governed by a global clock. None of these constraints were considered in 20th-century analyses of the thermodynamics of computation. The new field of stochastic thermodynamics provides formal tools for analyzing systems subject to all of these constraints. We argue here that these tools may help us understand at a far deeper level just how the fundamental thermodynamic properties of physical systems are related to the computation they perform.
Collapse
Affiliation(s)
- David H. Wolpert
- Santa Fe Institute, Santa Fe, NM87501
- Complexity Science Hub Vienna, Vienna1080, Austria
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ85287
- The Abdus Salam International Centre for Theoretical Physics, Trieste34151, Italy
- Albert Einstein Institute for Advanced Study in the Life Sciences, New York, NY10467
| | - Jan Korbel
- Complexity Science Hub Vienna, Vienna1080, Austria
- Institute for the Science of Complex Systems, Center for Medical Data Science (CeDAS), Medical University of Vienna, Vienna1090, Austria
| | - Christopher W. Lynn
- Center for the Physics of Biological Function, Princeton University, Princeton, NJ08544
- Center for the Physics of Biological Function, City University of New York, New York, NY10017
- Department of Physics, Yale University, New Haven, CT06520
| | | | - Joshua A. Grochow
- Department of Computer Science, University of Colorado Boulder, Boulder, CO80309
| | - Gülce Kardeş
- Santa Fe Institute, Santa Fe, NM87501
- Department of Computer Science, University of Colorado Boulder, Boulder, CO80309
| | | | - Vijay Balasubramanian
- Santa Fe Institute, Santa Fe, NM87501
- David Rittenhouse Laboratory, University of Pennsylvania, Philadelphia, PA19104
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, OX1 3PU, Oxford, United Kingdom
| | - Eric De Giuli
- Department of Physics, Toronto Metropolitan University, M5B 2K3, Toronto, ON, Canada
| | - David Doty
- Department of Computer Science, University of California, 95616, Davis, CA
| | - Nahuel Freitas
- Department of Physics, University of Buenos Aires, C1053, Buenos Aires, Argentina
| | - Matteo Marsili
- The Abdus Salam International Centre for Theoretical Physics, Trieste34151, Italy
| | - Thomas E. Ouldridge
- Department of Bioengineering, Imperial College London, SW7 2AZ, London, United Kingdom
- Centre for Synthetic Biology, Imperial College London, SW7 2AZ, London, United Kingdom
| | - Andréa W. Richa
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ85287
| | - Paul Riechers
- School of Physical and Mathematical Sciences, Nanyang Quantum Hub, Nanyang Technological University, Singapore639798, Singapore
| | - Édgar Roldán
- The Abdus Salam International Centre for Theoretical Physics, Trieste34151, Italy
| | | | - Zoltan Toroczkai
- Department of Physics and Astronomy, University of Notre Dame, Notre Dame, IN46556
| | - Joseph Paradiso
- Massachusetts Institute of Technology Media Lab, Massachusetts Institute of Technology, Cambridge, MA02139
| |
Collapse
|
3
|
Karbowski J. Information Thermodynamics: From Physics to Neuroscience. ENTROPY (BASEL, SWITZERLAND) 2024; 26:779. [PMID: 39330112 PMCID: PMC11431499 DOI: 10.3390/e26090779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Revised: 09/05/2024] [Accepted: 09/09/2024] [Indexed: 09/28/2024]
Abstract
This paper provides a perspective on applying the concepts of information thermodynamics, developed recently in non-equilibrium statistical physics, to problems in theoretical neuroscience. Historically, information and energy in neuroscience have been treated separately, in contrast to physics approaches, where the relationship of entropy production with heat is a central idea. It is argued here that also in neural systems, information and energy can be considered within the same theoretical framework. Starting from basic ideas of thermodynamics and information theory on a classic Brownian particle, it is shown how noisy neural networks can infer its probabilistic motion. The decoding of the particle motion by neurons is performed with some accuracy, and it has some energy cost, and both can be determined using information thermodynamics. In a similar fashion, we also discuss how neural networks in the brain can learn the particle velocity and maintain that information in the weights of plastic synapses from a physical point of view. Generally, it is shown how the framework of stochastic and information thermodynamics can be used practically to study neural inference, learning, and information storing.
Collapse
Affiliation(s)
- Jan Karbowski
- Institute of Applied Mathematics and Mechanics, Department of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
4
|
Cocconi L, Chen L. Efficiency of an autonomous, dynamic information engine operating on a single active particle. Phys Rev E 2024; 110:014602. [PMID: 39161009 DOI: 10.1103/physreve.110.014602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 06/10/2024] [Indexed: 08/21/2024]
Abstract
The Szilard engine stands as a compelling illustration of the intricate interplay between information and thermodynamics. While at thermodynamic equilibrium, the apparent breach of the second law of thermodynamics was reconciled by Landauer and Bennett's insights into memory writing and erasure, recent extensions of these concepts into regimes featuring active fluctuations have unveiled the prospect of exceeding Landauer's bound, capitalizing on information to divert free energy from dissipation towards useful work. To explore this question further, we investigate an autonomous dynamic information engine, addressing the thermodynamic consistency of work extraction and measurement costs by extending the phase space to incorporate an auxiliary system, which plays the role of an explicit measurement device. The nonreciprocal coupling between active particle and measurement device introduces a feedback control loop, and the cost of measurement is quantified through a suitably defined auxiliary entropy production. The study considers different measurement scenarios, highlighting the role of measurement precision in determining engine efficiency.
Collapse
|
5
|
Grelier M, Sivak DA, Ehrich J. Unlocking the potential of information flow: Maximizing free-energy transduction in a model of an autonomous rotary molecular motor. Phys Rev E 2024; 109:034115. [PMID: 38632770 DOI: 10.1103/physreve.109.034115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 01/30/2024] [Indexed: 04/19/2024]
Abstract
Molecular motors fulfill critical functions within all living beings. Understanding their underlying working principles is therefore of great interest. Here we develop a simple model inspired by the two-component biomolecular motor F_{o}-F_{1} ATP synthase. We analyze its energetics and characterize information flows between the machine's components. At maximum output power we find that information transduction plays a minor role for free-energy transduction. However, when the two components are coupled to different environments (e.g., when in contact with heat baths at different temperatures), we show that information flow becomes a resource worth exploiting to maximize free-energy transduction. Our findings suggest that real-world powerful and efficient information engines could be found in machines whose components are subjected to fluctuations of different strength, since in this situation the benefit gained from using information for work extraction can outweigh the costs of information generation.
Collapse
Affiliation(s)
- Mathis Grelier
- Department of Physics, Simon Fraser University, Burnaby, BC, V5A 1S6 Canada
- PULS Group, Department of Physics, FAU Erlangen-Nürnberg, IZNF, 91058 Erlangen, Germany
| | - David A Sivak
- Department of Physics, Simon Fraser University, Burnaby, BC, V5A 1S6 Canada
| | - Jannik Ehrich
- Department of Physics, Simon Fraser University, Burnaby, BC, V5A 1S6 Canada
| |
Collapse
|
6
|
Saha TK, Ehrich J, Gavrilov M, Still S, Sivak DA, Bechhoefer J. Information Engine in a Nonequilibrium Bath. PHYSICAL REVIEW LETTERS 2023; 131:057101. [PMID: 37595211 DOI: 10.1103/physrevlett.131.057101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 06/29/2023] [Indexed: 08/20/2023]
Abstract
Information engines can convert thermal fluctuations of a bath at temperature T into work at rates of order k_{B}T per relaxation time of the system. We show experimentally that such engines, when in contact with a bath that is out of equilibrium, can extract much more work. We place a heavy, micron-scale bead in a harmonic potential that ratchets up to capture favorable fluctuations. Adding a fluctuating electric field increases work extraction up to ten times, limited only by the strength of the applied field. Our results connect Maxwell's demon with energy harvesting and demonstrate that information engines in nonequilibrium baths can greatly outperform conventional engines.
Collapse
Affiliation(s)
- Tushar K Saha
- Department of Physics, Simon Fraser University, Burnaby, British Columbia, V5A 1S6 Canada
| | - Jannik Ehrich
- Department of Physics, Simon Fraser University, Burnaby, British Columbia, V5A 1S6 Canada
- Department of Physics and Astronomy, University of Hawaii at Mānoa, Honolulu, Hawaii 96822, USA
| | - Momčilo Gavrilov
- Department of Physics, Simon Fraser University, Burnaby, British Columbia, V5A 1S6 Canada
| | - Susanne Still
- Department of Physics and Astronomy, University of Hawaii at Mānoa, Honolulu, Hawaii 96822, USA
| | - David A Sivak
- Department of Physics, Simon Fraser University, Burnaby, British Columbia, V5A 1S6 Canada
| | - John Bechhoefer
- Department of Physics, Simon Fraser University, Burnaby, British Columbia, V5A 1S6 Canada
| |
Collapse
|
7
|
Lynn CW, Holmes CM, Bialek W, Schwab DJ. Decomposing the Local Arrow of Time in Interacting Systems. PHYSICAL REVIEW LETTERS 2022; 129:118101. [PMID: 36154397 PMCID: PMC9751844 DOI: 10.1103/physrevlett.129.118101] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 06/03/2022] [Accepted: 06/24/2022] [Indexed: 05/30/2023]
Abstract
We show that the evidence for a local arrow of time, which is equivalent to the entropy production in thermodynamic systems, can be decomposed. In a system with many degrees of freedom, there is a term that arises from the irreversible dynamics of the individual variables, and then a series of non-negative terms contributed by correlations among pairs, triplets, and higher-order combinations of variables. We illustrate this decomposition on simple models of noisy logical computations, and then apply it to the analysis of patterns of neural activity in the retina as it responds to complex dynamic visual scenes. We find that neural activity breaks detailed balance even when the visual inputs do not, and that this irreversibility arises primarily from interactions between pairs of neurons.
Collapse
Affiliation(s)
- Christopher W Lynn
- Initiative for the Theoretical Sciences, The Graduate Center, City University of New York, New York, New York 10016, USA
- Joseph Henry Laboratories of Physics and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - Caroline M Holmes
- Joseph Henry Laboratories of Physics and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - William Bialek
- Initiative for the Theoretical Sciences, The Graduate Center, City University of New York, New York, New York 10016, USA
- Joseph Henry Laboratories of Physics and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - David J Schwab
- Initiative for the Theoretical Sciences, The Graduate Center, City University of New York, New York, New York 10016, USA
| |
Collapse
|
8
|
Lynn CW, Holmes CM, Bialek W, Schwab DJ. Emergence of local irreversibility in complex interacting systems. Phys Rev E 2022; 106:034102. [PMID: 36266789 PMCID: PMC9751845 DOI: 10.1103/physreve.106.034102] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 06/24/2022] [Indexed: 04/28/2023]
Abstract
Living systems are fundamentally irreversible, breaking detailed balance and establishing an arrow of time. But how does the evident arrow of time for a whole system arise from the interactions among its multiple elements? We show that the local evidence for the arrow of time, which is the entropy production for thermodynamic systems, can be decomposed. First, it can be split into two components: an independent term reflecting the dynamics of individual elements and an interaction term driven by the dependencies among elements. Adapting tools from nonequilibrium physics, we further decompose the interaction term into contributions from pairs of elements, triplets, and higher-order terms. We illustrate our methods on models of cellular sensing and logical computations, as well as on patterns of neural activity in the retina as it responds to visual inputs. We find that neural activity can define the arrow of time even when the visual inputs do not, and that the dominant contribution to this breaking of detailed balance comes from interactions among pairs of neurons.
Collapse
Affiliation(s)
- Christopher W Lynn
- Initiative for the Theoretical Sciences, Graduate Center, City University of New York, New York, New York 10016, USA
- Joseph Henry Laboratories of Physics and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - Caroline M Holmes
- Joseph Henry Laboratories of Physics and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - William Bialek
- Initiative for the Theoretical Sciences, Graduate Center, City University of New York, New York, New York 10016, USA
- Joseph Henry Laboratories of Physics and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - David J Schwab
- Initiative for the Theoretical Sciences, Graduate Center, City University of New York, New York, New York 10016, USA
| |
Collapse
|
9
|
Tan AK, Tegmark M, Chuang IL. Pareto-Optimal Clustering with the Primal Deterministic Information Bottleneck. ENTROPY (BASEL, SWITZERLAND) 2022; 24:771. [PMID: 35741492 PMCID: PMC9222302 DOI: 10.3390/e24060771] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 05/24/2022] [Accepted: 05/24/2022] [Indexed: 02/04/2023]
Abstract
At the heart of both lossy compression and clustering is a trade-off between the fidelity and size of the learned representation. Our goal is to map out and study the Pareto frontier that quantifies this trade-off. We focus on the optimization of the Deterministic Information Bottleneck (DIB) objective over the space of hard clusterings. To this end, we introduce the primal DIB problem, which we show results in a much richer frontier than its previously studied Lagrangian relaxation when optimized over discrete search spaces. We present an algorithm for mapping out the Pareto frontier of the primal DIB trade-off that is also applicable to other two-objective clustering problems. We study general properties of the Pareto frontier, and we give both analytic and numerical evidence for logarithmic sparsity of the frontier in general. We provide evidence that our algorithm has polynomial scaling despite the super-exponential search space, and additionally, we propose a modification to the algorithm that can be used where sampling noise is expected to be significant. Finally, we use our algorithm to map the DIB frontier of three different tasks: compressing the English alphabet, extracting informative color classes from natural images, and compressing a group theory-inspired dataset, revealing interesting features of frontier, and demonstrating how the structure of the frontier can be used for model selection with a focus on points previously hidden by the cloak of the convex hull.
Collapse
Affiliation(s)
- Andrew K. Tan
- Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (M.T.); (I.L.C.)
- The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, Cambridge, MA 02139, USA
| | - Max Tegmark
- Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (M.T.); (I.L.C.)
- The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, Cambridge, MA 02139, USA
| | - Isaac L. Chuang
- Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (M.T.); (I.L.C.)
- The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, Cambridge, MA 02139, USA
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
10
|
Da Costa L, Friston K, Heins C, Pavliotis GA. Bayesian mechanics for stationary processes. Proc Math Phys Eng Sci 2022; 477:20210518. [PMID: 35153603 PMCID: PMC8652275 DOI: 10.1098/rspa.2021.0518] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 10/27/2021] [Indexed: 01/02/2023] Open
Abstract
This paper develops a Bayesian mechanics for adaptive systems. Firstly, we model the interface between a system and its environment with a Markov blanket. This affords conditions under which states internal to the blanket encode information about external states. Second, we introduce dynamics and represent adaptive systems as Markov blankets at steady state. This allows us to identify a wide class of systems whose internal states appear to infer external states, consistent with variational inference in Bayesian statistics and theoretical neuroscience. Finally, we partition the blanket into sensory and active states. It follows that active states can be seen as performing active inference and well-known forms of stochastic control (such as PID control), which are prominent formulations of adaptive behaviour in theoretical biology and engineering.
Collapse
Affiliation(s)
- Lancelot Da Costa
- Department of Mathematics, Imperial College London, London SW7 2AZ, UK.,Wellcome Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK
| | - Karl Friston
- Wellcome Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK
| | - Conor Heins
- Department of Collective Behaviour, Max Planck Institute of Animal Behavior, Konstanz D-78457, Germany.,Centre for the Advanced Study of Collective Behaviour, University of Konstanz, Konstanz D-78457, Germany.,Department of Biology, University of Konstanz, Konstanz D-78457, Germany
| | | |
Collapse
|
11
|
Ali SY, Rafeek R, Mondal D. Geometric Brownian information engine: Upper bound of the achievable work under feedback control. J Chem Phys 2022; 156:014902. [PMID: 34998347 DOI: 10.1063/5.0069582] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We design a geometric Brownian information engine by considering overdamped Brownian particles inside a two-dimensional monolobal confinement with irregular width along the transport direction. Under such detention, particles experience an effective entropic potential which has a logarithmic form. We employ a feedback control protocol as an outcome of error-free position measurement. The protocol comprises three stages: measurement, feedback, and relaxation. We reposition the center of the confinement to the measurement distance (xp) instantaneously when the position of the trapped particle crosses xp for the first time. Then, the particle is allowed for thermal relaxation. We calculate the extractable work, total information, and unavailable information associated with the feedback control using this equilibrium probability distribution function. We find the exact analytical value of the upper bound of extractable work as (53-2ln2)kBT. We introduce a constant force G downward to the transverse coordinate (y). A change in G alters the effective potential of the system and tunes the relative dominance of entropic and energetic contributions in it. The upper bound of the achievable work shows a crossover from (53-2ln2)kBT to 12kBT when the system changes from an entropy-dominated regime to an energy-dominated one. Compared to an energetic analog, the loss of information during the relaxation process is higher in the entropy-dominated region, which accredits the less value in achievable work. Theoretical predictions are in good agreement with the Langevin dynamics simulation studies.
Collapse
Affiliation(s)
- Syed Yunus Ali
- Department of Chemistry and Center for Molecular and Optical Sciences and Technologies, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| | - Rafna Rafeek
- Department of Chemistry and Center for Molecular and Optical Sciences and Technologies, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| | - Debasish Mondal
- Department of Chemistry and Center for Molecular and Optical Sciences and Technologies, Indian Institute of Technology Tirupati, Yerpedu 517619, Andhra Pradesh, India
| |
Collapse
|
12
|
Song J, Still S, Díaz Hernández Rojas R, Pérez Castillo I, Marsili M. Optimal work extraction and mutual information in a generalized Szilárd engine. Phys Rev E 2021; 103:052121. [PMID: 34134259 DOI: 10.1103/physreve.103.052121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 04/08/2021] [Indexed: 11/07/2022]
Abstract
A 1929 Gedankenexperiment proposed by Szilárd, often referred to as "Szilárd's engine", has served as a foundation for computing fundamental thermodynamic bounds to information processing. While Szilárd's original box could be partitioned into two halves and contains one gas molecule, we calculate here the maximal average work that can be extracted in a system with N particles and q partitions, given an observer which counts the molecules in each partition, and given a work extraction mechanism that is limited to pressure equalization. We find that the average extracted work is proportional to the mutual information between the one-particle position and the vector containing the counts of how many particles are in each partition. We optimize this quantity over the initial locations of the dividing walls, and find that there exists a critical number of particles N^{★}(q) below which the extracted work is maximized by a symmetric configuration of the q partitions, and above which the optimal partitioning is asymmetric. Overall, the average extracted work is maximized for a number of particles N[over ̂](q)<N^{★}(q), with a symmetric partition. We calculate asymptotic values for N→∞.
Collapse
Affiliation(s)
- Juyong Song
- Samsung Research, Samsung Electronics Co., Ltd., Seoul, 06765, Korea
| | - Susanne Still
- Department of Information and Computer Sciences, and Department of Physics and Astronomy, University of Hawai'i at Mānoa, Honolulu, Hawaii 96822, USA
| | | | - Isaac Pérez Castillo
- Departamento de Física, Universidad Autónoma Metropolitana-Iztapalapa, San Rafael Atlixco 186, Ciudad de México 09340, Mexico
| | - Matteo Marsili
- The Abdus Salam International Centre for Theoretical Physics (ICTP), Trieste 34151, Italy
| |
Collapse
|
13
|
Abstract
Information-driven engines that rectify thermal fluctuations are a modern realization of the Maxwell-demon thought experiment. We introduce a simple design based on a heavy colloidal particle, held by an optical trap and immersed in water. Using a carefully designed feedback loop, our experimental realization of an "information ratchet" takes advantage of favorable "up" fluctuations to lift a weight against gravity, storing potential energy without doing external work. By optimizing the ratchet design for performance via a simple theory, we find that the rate of work storage and velocity of directed motion are limited only by the physical parameters of the engine: the size of the particle, stiffness of the ratchet spring, friction produced by the motion, and temperature of the surrounding medium. Notably, because performance saturates with increasing frequency of observations, the measurement process is not a limiting factor. The extracted power and velocity are at least an order of magnitude higher than in previously reported engines.
Collapse
|
14
|
Ray KJ, Crutchfield JP. Variations on a demonic theme: Szilard's other engines. CHAOS (WOODBURY, N.Y.) 2020; 30:093105. [PMID: 33003907 DOI: 10.1063/5.0012052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 08/08/2020] [Indexed: 06/11/2023]
Abstract
Szilard's now-famous single-molecule engine was only the first of three constructions he introduced in 1929 to resolve several challenges arising from Maxwell's demon paradox. Given that it has been thoroughly analyzed, we analyze Szilard's remaining two demon models. We show that the second one, though a markedly different implementation employing a population of distinct molecular species and semipermeable membranes, is informationally and thermodynamically equivalent to an ideal gas of the single-molecule engines. One concludes that (i) it reduces to a chaotic dynamical system-called the Szilard Map, a composite of three piecewise linear maps and associated thermodynamic transformations that implement measurement, control, and erasure; (ii) its transitory functioning as an engine that converts disorganized heat energy to work is governed by the Kolmogorov-Sinai entropy rate; (iii) the demon's minimum necessary "intelligence" for optimal functioning is given by the engine's statistical complexity; and (iv) its functioning saturates thermodynamic bounds and so it is a minimal, optimal implementation. We show that Szilard's third construction is rather different and addresses the fundamental issue raised by the first two: the link between entropy production and the measurement task required to implement either of his engines. The analysis gives insight into designing and implementing novel nanoscale information engines by investigating the relationships between the demon's memory, the nature of the "working fluid," and the thermodynamic costs of erasure and measurement.
Collapse
Affiliation(s)
- Kyle J Ray
- Complexity Sciences Center and Physics Department, University of California at Davis, One Shields Avenue, Davis, California 95616, USA
| | - James P Crutchfield
- Complexity Sciences Center and Physics Department, University of California at Davis, One Shields Avenue, Davis, California 95616, USA
| |
Collapse
|