1
|
Grzywacz NM. Perceptual Complexity as Normalized Shannon Entropy. ENTROPY (BASEL, SWITZERLAND) 2025; 27:166. [PMID: 40003163 PMCID: PMC11854106 DOI: 10.3390/e27020166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2024] [Revised: 01/24/2025] [Accepted: 01/30/2025] [Indexed: 02/27/2025]
Abstract
Complexity is one of the most important variables in how the brain performs decision making based on esthetic values. Multiple definitions of perceptual complexity have been proposed, with one of the most fruitful being the Normalized Shannon Entropy one. However, the Normalized Shannon Entropy definition has theoretical gaps that we address in this article. Focusing on visual perception, we first address whether normalization fully corrects for the effects of measurement resolution on entropy. The answer is negative, but the remaining effects are minor, and we propose alternate definitions of complexity, correcting this problem. Related to resolution, we discuss the ideal spatial range in the computation of spatial complexity. The results show that this range must be small but not too small. Furthermore, it is suggested by the analysis of this range that perceptual spatial complexity is based solely on translational isometry. Finally, we study how the complexities of distinct visual variables interact. We argue that the complexities of the variables of interest to the brain's visual system may not interact linearly because of interclass correlation. But the interaction would be linear if the brain weighed complexities as in Kempthorne's λ-Bayes-based compromise problem. We finish by listing several experimental tests of these theoretical ideas on complexity.
Collapse
Affiliation(s)
- Norberto M. Grzywacz
- Department of Psychology, Loyola University Chicago, Chicago, IL 60660, USA; ; Tel.: +1-773-508-2970
- Department of Cognitive Science, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
2
|
Silas S, Müllensiefen D, Kopiez R. Singing Ability Assessment: Development and validation of a singing test based on item response theory and a general open-source software environment for singing data. Behav Res Methods 2024; 56:4358-4384. [PMID: 37672190 PMCID: PMC11289018 DOI: 10.3758/s13428-023-02188-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/30/2023] [Indexed: 09/07/2023]
Abstract
We describe the development of the Singing Ability Assessment (SAA) open-source test environment. The SAA captures and scores different aspects of human singing ability and melodic memory in the context of item response theory. Taking perspectives from both melodic recall and singing accuracy literature, we present results from two online experiments (N = 247; N = 910). On-the-fly audio transcription is produced via a probabilistic algorithm and scored via latent variable approaches. Measures of the ability to sing long notes indicate a three-dimensional principal components analysis solution representing pitch accuracy, pitch volatility and changes in pitch stability (proportion variance explained: 35%; 33%; 32%). For melody singing, a mixed-effects model uses features of melodic structure (e.g., tonality, melody length) to predict overall sung melodic recall performance via a composite score [R2c = .42; R2m = .16]. Additionally, two separate mixed-effects models were constructed to explain performance in singing back melodies in a rhythmic [R2c = .42; R2m = .13] and an arhythmic [R2c = .38; R2m = .11] condition. Results showed that the yielded SAA melodic scores are significantly associated with previously described measures of singing accuracy, the long note singing accuracy measures, demographic variables, and features of participants' hardware setup. Consequently, we release five R packages which facilitate deploying melodic stimuli online and in laboratory contexts, constructing audio production tests, transcribing audio in the R environment, and deploying the test elements and their supporting models. These are published as open-source, easy to access, and flexible to adapt.
Collapse
Affiliation(s)
- Sebastian Silas
- Goldsmiths University of London, London, UK.
- Hanover Music Lab, Hanover University of Music, Drama and Media, Neues Haus 1, 30175, Hannover, Germany.
| | - Daniel Müllensiefen
- Goldsmiths University of London, London, UK
- Hanover Music Lab, Hanover University of Music, Drama and Media, Neues Haus 1, 30175, Hannover, Germany
| | - Reinhard Kopiez
- Hanover Music Lab, Hanover University of Music, Drama and Media, Neues Haus 1, 30175, Hannover, Germany
| |
Collapse
|
3
|
Senn O. A predictive coding approach to modelling the perceived complexity of popular music drum patterns. Heliyon 2023; 9:e15199. [PMID: 37123947 PMCID: PMC10130781 DOI: 10.1016/j.heliyon.2023.e15199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 03/29/2023] [Accepted: 03/29/2023] [Indexed: 05/02/2023] Open
Abstract
This study presents a method to estimate the complexity of popular music drum patterns based on a core idea from predictive coding. Specifically, it postulates that the complexity of a drum pattern depends on the quantity of surprisal it causes in the listener. Surprisal, according to predictive coding theory, is a numerical measure that takes large values when the perceiver's internal model of the surrounding world fails to predict the actual stream of sensory data (i.e. when the perception surprises the perceiver), and low values if model predictions and sensory data agree. The proposed new method first approximates a listener's internal model of a popular music drum pattern (using ideas on enculturation and a Bayesian learning process). It then quantifies the listener's surprisal evaluating the discrepancies between the predictions of the internal model and the actual drum pattern. It finally estimates drum pattern complexity from surprisal. The method was optimised and tested using a set of forty popular music drum patterns, for which empirical perceived complexity measurements are available. The new method provided complexity estimates that had a good fit with the empirical measurements ( R 2 = . 852 ). The method was implemented as an R script that can be used to estimate the complexity of popular music drum patterns in the future. Simulations indicate that we can expect the method to predict perceived complexity with a good fit ( R 2 ≥ . 709 ) in 99% of drum pattern sets randomly drawn from the Western popular music repertoire. These results suggest that surprisal indeed captures essential aspects of complexity, and that it may serve as a basis for a general theory of perceived complexity.
Collapse
|
4
|
Basios V, Oikonomou T, De Gernier R. Symbolic dynamics of music from Europe and Japan. CHAOS (WOODBURY, N.Y.) 2021; 31:053122. [PMID: 34240945 DOI: 10.1063/5.0048396] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 04/27/2021] [Indexed: 06/13/2023]
Abstract
After a brief introduction to the theory underlying block-entropy and its relation to the dynamics of complex systems as well as certain information theory aspects, we study musical texts coming from two distinct musical traditions, Japanese and Western European, encoded via symbolic dynamics. We quantify their information content, also known as the degree of "non-randomness" which essentially defines the complexity of the text. We analyze the departure of "total randomness" to the constraints underlying the dynamics of the symbol generating process. Following Shannon on his attribution of these constraints as the key factors of the emergence of complexity, we observe that it can be accurately assessed by the texts' block-entropy vs block-length scaling laws.
Collapse
Affiliation(s)
- Vasileios Basios
- Service de Physique des Systèmes Complexes et Mécanique Statistique & Interdisciplinary Center for Nonlinear Phenomena and Complex Systems (C.P.231 CeNoLi-ULB), Université Libre de Bruxelles, Brussels 1050, Belgium
| | - Thomas Oikonomou
- College of Engineering and Computer Science, Vin University, 10000 Hanoi, Vietnam
| | - Robin De Gernier
- Faculté des Sciences Appliquées-École Polytechnique, Physique et Mathématique, Université Libre de Bruxelles, Brussels 1050, Belgium
| |
Collapse
|
5
|
Pesek M, Medvešek Š, Podlesek A, Tkalčič M, Marolt M. A Comparison of Human and Computational Melody Prediction Through Familiarity and Expertise. Front Psychol 2020; 11:557398. [PMID: 33362622 PMCID: PMC7756065 DOI: 10.3389/fpsyg.2020.557398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 11/13/2020] [Indexed: 11/16/2022] Open
Abstract
Melody prediction is an important aspect of music listening. The success of prediction, i.e., whether the next note played in a song is the same as the one predicted by the listener, depends on various factors. In the paper, we present two studies, where we assess how music familiarity and music expertise influence melody prediction in human listeners, and, expressed in appropriate data/algorithmic ways, computational models. To gather data on human listeners, we designed a melody prediction user study, where familiarity was controlled by two different music collections, while expertise was assessed by adapting the Music Sophistication Index instrument to Slovenian language. In the second study, we evaluated the melody prediction accuracy of computational melody prediction models. We evaluated two models, the SymCHM and the Implication-Realization model, which differ substantially in how they approach melody prediction. Our results show that both music familiarity and expertise affect the prediction accuracy of human listeners, as well as of computational models.
Collapse
Affiliation(s)
- Matevž Pesek
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia
| | - Špela Medvešek
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia
| | - Anja Podlesek
- Faculty of Arts, University of Ljubljana, Ljubljana, Slovenia
| | - Marko Tkalčič
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| | - Matija Marolt
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
6
|
Abstract
Given that complexity is critical for psychological processing, it is somewhat surprising that the field was dominated for a long time by probabilistic methods that focus on the quantitative aspects of the source/output. Although the more recent approaches based on the Minimum Description Length principle have produced interesting and useful models of psychological complexity, they have not directly defined the meaning and quantitative unit of complexity measurement. Contrasted to these mathematical approaches are various ad hoc measures based on different aspects of structure, which can work well but suffer from the same problem. The present manuscript is composed of two self-sufficient, yet related sections. In Section 1, we describe a complexity measure for binary strings which satisfies both these conditions (Aksentijevic–Gibson complexity; AG). We test the measure on a number of classic studies employing both short and long strings and draw attention to an important feature—a complexity profile—that could be of interest in modelling the psychological processing of structure as well as analysis of strings of any length. In Section 2 we discuss different factors affecting the complexity of visual form and showcase a 2D generalization of AG complexity. In addition, we provide algorithms in R that compute the AG complexity for binary strings and matrices and demonstrate their effectiveness on examples involving complexity judgments, symmetry perception, perceptual grouping, entropy, and elementary cellular automata. Finally, we enclose a repository of codes, data and stimuli for our example in order to facilitate experimentation and application of the measure in sciences outside psychology.
Collapse
|
7
|
Foldal MD, Blenkmann AO, Llorens A, Knight RT, Solbakk AK, Endestad T. The brain tracks auditory rhythm predictability independent of selective attention. Sci Rep 2020; 10:7975. [PMID: 32409738 PMCID: PMC7224206 DOI: 10.1038/s41598-020-64758-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 04/07/2020] [Indexed: 11/16/2022] Open
Abstract
The brain responds to violations of expected rhythms, due to extraction- and prediction of the temporal structure in auditory input. Yet, it is unknown how probability of rhythm violations affects the overall rhythm predictability. Another unresolved question is whether predictive processes are independent of attention processes. In this study, EEG was recorded while subjects listened to rhythmic sequences. Predictability was manipulated by changing the stimulus-onset-asynchrony (SOA deviants) for given tones in the rhythm. When SOA deviants were inserted rarely, predictability remained high, whereas predictability was lower with more frequent SOA deviants. Dichotic tone-presentation allowed for independent manipulation of attention, as specific tones of the rhythm were presented to separate ears. Attention was manipulated by instructing subjects to attend to tones in one ear only, while keeping the rhythmic structure of tones constant. The analyses of event-related potentials revealed an attenuated N1 for tones when rhythm predictability was high, while the N1 was enhanced by attention to tones. Bayesian statistics revealed no interaction between predictability and attention. A right-lateralization of attention effects, but not predictability effects, suggested potentially different cortical processes. This is the first study to show that probability of rhythm violation influences rhythm predictability, independent of attention.
Collapse
Affiliation(s)
- Maja D Foldal
- Department of Psychology, University of Oslo, Oslo, Norway. .,RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway.
| | - Alejandro O Blenkmann
- Department of Psychology, University of Oslo, Oslo, Norway.,RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway
| | - Anaïs Llorens
- Department of Psychology, University of Oslo, Oslo, Norway.,Department of Neurosurgery, Oslo University Hospital, Oslo, Norway.,Department of Psychology and the Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, USA
| | - Robert T Knight
- Department of Psychology and the Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, USA
| | - Anne-Kristin Solbakk
- Department of Psychology, University of Oslo, Oslo, Norway.,RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway.,Department of Neurosurgery, Oslo University Hospital, Oslo, Norway.,Department of Neuropsychology, Helgeland Hospital, Mosjøen, Norway
| | - Tor Endestad
- Department of Psychology, University of Oslo, Oslo, Norway.,RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway.,Department of Neuropsychology, Helgeland Hospital, Mosjøen, Norway
| |
Collapse
|
8
|
Siegelman N, Bogaerts L, Frost R. What Determines Visual Statistical Learning Performance? Insights From Information Theory. Cogn Sci 2019; 43:e12803. [DOI: 10.1111/cogs.12803] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Revised: 10/17/2019] [Accepted: 11/05/2019] [Indexed: 12/31/2022]
Affiliation(s)
- Noam Siegelman
- Department of Psychology The Hebrew University of Jerusalem
- Haskins Laboratories
| | | | - Ram Frost
- Department of Psychology The Hebrew University of Jerusalem
- Haskins Laboratories
- Basque Center of Cognition, Brain and Language (BCBL)
| |
Collapse
|
9
|
Lumaca M, Trusbak Haumann N, Brattico E, Grube M, Vuust P. Weighting of neural prediction error by rhythmic complexity: A predictive coding account using mismatch negativity. Eur J Neurosci 2019; 49:1597-1609. [DOI: 10.1111/ejn.14329] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 11/28/2018] [Accepted: 12/12/2018] [Indexed: 12/27/2022]
Affiliation(s)
- Massimo Lumaca
- Department of Clinical MedicineCenter for Music in the BrainAarhus University & The Royal Academy of Music Aarhus C Denmark
- SISSA International School for Advanced Studies Trieste Italy
| | - Niels Trusbak Haumann
- Department of Clinical MedicineCenter for Music in the BrainAarhus University & The Royal Academy of Music Aarhus C Denmark
| | - Elvira Brattico
- Department of Clinical MedicineCenter for Music in the BrainAarhus University & The Royal Academy of Music Aarhus C Denmark
| | - Manon Grube
- Department of Clinical MedicineCenter for Music in the BrainAarhus University & The Royal Academy of Music Aarhus C Denmark
| | - Peter Vuust
- Department of Clinical MedicineCenter for Music in the BrainAarhus University & The Royal Academy of Music Aarhus C Denmark
| |
Collapse
|