1
|
Lameira AR, Caneco B, Kershenbaum A, Santamaría-Bonfil G, Call J. Generative vocal plasticity in chimpanzees. iScience 2025; 28:112381. [PMID: 40322082 PMCID: PMC12049825 DOI: 10.1016/j.isci.2025.112381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 01/15/2025] [Accepted: 04/04/2025] [Indexed: 05/08/2025] Open
Abstract
Modern theory posits that human-ape differences in voice command account for speech evolution. However, comparison has been indirect and conjectural based on vocal learning taxa far related from Hominids, instead of direct and quantitative based on great ape calls that, like all speech sounds, are local-specific and non-universal to the species. Moreover, the null hypothesis that the great ape voice command is purely reflexive has never been directly tested. Here, we show that in controlled, constant experimental settings, captive chimpanzees exhibit high-dimensional dexterity over voice activation and modulation in two atypical vowel-like calls. Subjects made unrestricted, multidimensional, and distinct voice changes within and between individuals, inducing parameter changes up to 10,000%, rejecting null hypothesis' predictions. Forecasting models indicated unmitigated voice novelty, altogether demonstrating emancipated and vast real-time voice control. Findings show that, contrary to traditional assumptions, speech and song evolution likely hinged on prolific voice command already available in ancestral ape-like ancestors.
Collapse
Affiliation(s)
| | | | - Arik Kershenbaum
- Department of Zoology, University of Cambridge, Cambridge, UK
- Girton College, University of Cambridge, Cambridge, UK
| | | | - Josep Call
- Department of Psychology and Neuroscience, University of St Andrews, St Andrews, UK
| |
Collapse
|
2
|
Lameira AR. Vocal Learning Versus Speech Evolution: Untangling a False Equivalence. Ecol Evol 2025; 15:e71241. [PMID: 40190798 PMCID: PMC11968774 DOI: 10.1002/ece3.71241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 03/11/2025] [Accepted: 03/26/2025] [Indexed: 04/09/2025] Open
Abstract
The evolution of speech remains one of the most profound and unresolved questions in science. Despite significant advancements in comparative research, key assumptions about the evolutionary precursors of speech continue to be accepted with minimal scrutiny. One such assumption is the widely held belief that vocal learning-the ability to imitate and modify vocalizations-was an obligatory precondition for speech evolution. However, by the time ape-like human ancestors emerged amid Miocene's forests, the ancestors of vocal learning species already walked the Earth and flew the skies. A head-start of millions of years of vocal evolution didn't produce linguistic elephants, bats, or birds, suggesting that hominids' humble vocal beginnings were determinant for verbal evolution. Current evidence on extant great ape calls provides new details and insight into the extinct vocal forms and functions that allowed human ancestors to jump-start speech evolution. By reconsidering the evolutionary processes that led to speech, this paper advocates for a shift in focus toward the hominid biotope, body, brain, and behavior, rather than treating speech as the pinnacle endpoint of vocal learning evolution and drawing misleading parallels with far-related vocal learners.
Collapse
|
3
|
Ekström AG, Gärdenfors P, Snyder WD, Friedrichs D, McCarthy RC, Tsapos M, Tennie C, Strait DS, Edlund J, Moran S. Correlates of Vocal Tract Evolution in Late Pliocene and Pleistocene Hominins. HUMAN NATURE (HAWTHORNE, N.Y.) 2025; 36:22-69. [PMID: 40244547 PMCID: PMC12058909 DOI: 10.1007/s12110-025-09487-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 02/20/2025] [Indexed: 04/18/2025]
Abstract
Despite decades of research on the emergence of human speech capacities, an integrative account consistent with hominin evolution remains lacking. We review paleoanthropological and archaeological findings in search of a timeline for the emergence of modern human articulatory morphological features. Our synthesis shows that several behavioral innovations coincide with morphological changes to the would-be speech articulators. We find that significant reductions of the mandible and masticatory muscles and vocal tract anatomy coincide in the hominin fossil record with the incorporation of processed and (ultimately) cooked food, the appearance and development of rudimentary stone tools, increases in brain size, and likely changes to social life and organization. Many changes are likely mutually reinforcing; for example, gracilization of the hominin mandible may have been maintainable in the lineage because food processing had already been outsourced to the hands and stone tools, reducing selection pressures for robust mandibles in the process. We highlight correlates of the evolution of craniofacial and vocal tract features in the hominin lineage and outline a timeline by which our ancestors became 'pre-adapted' for the evolution of fully modern human speech.
Collapse
Affiliation(s)
- Axel G Ekström
- Speech, Music & Hearing, KTH Royal Institute of Technology, Stockholm, Sweden.
- Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland.
| | - Peter Gärdenfors
- Department of Philosophy, Lund University, Lund, Sweden
- Paleo-Research Institute, University of Johannesburg, Johannesburg, South Africa
| | - William D Snyder
- Senckenberg Centre for Human Evolution and Palaeoenvironment, University of Tübingen, Tübingen, Germany
- Early Prehistory and Quaternary Ecology, Department of Geosciences, University of Tübingen, Tübingen, Germany
| | - Daniel Friedrichs
- Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
- Linguistics Research Infrastructure (LiRI), University of Zurich, Zürich, Switzerland
| | - Robert C McCarthy
- Department of Biological Sciences, Benedictine University, Lisle, IL, US
| | - Melina Tsapos
- Department of Philosophy, Lund University, Lund, Sweden
| | - Claudio Tennie
- Early Prehistory and Quaternary Ecology, Department of Geosciences, University of Tübingen, Tübingen, Germany
| | - David S Strait
- Paleo-Research Institute, University of Johannesburg, Johannesburg, South Africa
- Department of Anthropology, Washington University in St. Louis, St. Louis, MO, US
- DFG Center for Advanced Studies "Words, Bones, Genes, Tools", University of Tübingen, Tübingen, Germany
| | - Jens Edlund
- Speech, Music & Hearing, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Steven Moran
- Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
- Linguistics Research Infrastructure (LiRI), University of Zurich, Zürich, Switzerland
- Department of Anthropology, University of Miami, Coral Gables, FL, US
| |
Collapse
|
4
|
Ekström AG, Gannon C, Edlund J, Moran S, Lameira AR. Chimpanzee utterances refute purported missing links for novel vocalizations and syllabic speech. Sci Rep 2024; 14:17135. [PMID: 39054330 PMCID: PMC11272771 DOI: 10.1038/s41598-024-67005-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 07/08/2024] [Indexed: 07/27/2024] Open
Abstract
Nonhuman great apes have been claimed to be unable to learn human words due to a lack of the necessary neural circuitry. We recovered original footage of two enculturated chimpanzees uttering the word "mama" and subjected recordings to phonetic analysis. Our analyses demonstrate that chimpanzees are capable of syllabic production, achieving consonant-to-vowel phonetic contrasts via the simultaneous recruitment and coupling of voice, jaw and lips. In an online experiment, human listeners naive to the recordings' origins reliably perceived chimpanzee utterances as syllabic utterances, primarily as "ma-ma", among foil syllables. Our findings demonstrate that in the absence of direct data-driven examination, great ape vocal production capacities have been underestimated. Chimpanzees possess the neural building blocks necessary for speech.
Collapse
Affiliation(s)
- Axel G Ekström
- Speech, Music & Hearing, KTH Royal Institute of Technology, Lindstedtsvägen 24, 118 28, Stockholm, Sweden.
| | | | - Jens Edlund
- Speech, Music & Hearing, KTH Royal Institute of Technology, Lindstedtsvägen 24, 118 28, Stockholm, Sweden
| | - Steven Moran
- Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
- Department of Anthropology, University of Miami, Coral Gables, USA
| | | |
Collapse
|
5
|
Lameira AR, Hardus ME, Ravignani A, Raimondi T, Gamba M. Recursive self-embedded vocal motifs in wild orangutans. eLife 2024; 12:RP88348. [PMID: 38252123 PMCID: PMC10945596 DOI: 10.7554/elife.88348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024] Open
Abstract
Recursive procedures that allow placing a vocal signal inside another of a similar kind provide a neuro-computational blueprint for syntax and phonology in spoken language and human song. There are, however, no known vocal sequences among nonhuman primates arranged in self-embedded patterns that evince vocal recursion or potential incipient or evolutionary transitional forms thereof, suggesting a neuro-cognitive transformation exclusive to humans. Here, we uncover that wild flanged male orangutan long calls feature rhythmically isochronous call sequences nested within isochronous call sequences, consistent with two hierarchical strata. Remarkably, three temporally and acoustically distinct call rhythms in the lower stratum were not related to the overarching rhythm at the higher stratum by any low multiples, which suggests that these recursive structures were neither the result of parallel non-hierarchical procedures nor anatomical artifacts of bodily constraints or resonances. Findings represent a case of temporally recursive hominid vocal combinatorics in the absence of syntax, semantics, phonology, or music. Second-order combinatorics, 'sequences within sequences', involving hierarchically organized and cyclically structured vocal sounds in ancient hominids may have preluded the evolution of recursion in modern language-able humans.
Collapse
Affiliation(s)
- Adriano R Lameira
- Department of Psychology, University of WarwickCoventryUnited Kingdom
| | | | - Andrea Ravignani
- Comparative Bioacoustics Group, Max Planck Institute for PsycholinguisticsNijmegenNetherlands
- Center for Music in the Brain, Department of Clinical Medicine, Aarhus University & The Royal Academy of Music Aarhus/AalborgAarhusDenmark
- Department of Human Neurosciences, Sapienza University of RomeRomeItaly
| | - Teresa Raimondi
- Department of Life Sciences and Systems Biology, University of TurinoTorinoItaly
| | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of TurinoTorinoItaly
| |
Collapse
|
6
|
Gannon C, Hill RA, Lameira AR. Open plains are not a level playing field for hominid consonant-like versus vowel-like calls. Sci Rep 2023; 13:21138. [PMID: 38129443 PMCID: PMC10739746 DOI: 10.1038/s41598-023-48165-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 11/22/2023] [Indexed: 12/23/2023] Open
Abstract
Africa's paleo-climate change represents an "ecological black-box" along the evolutionary timeline of spoken language; a vocal hominid went in and, millions of years later, out came a verbal human. It is unknown whether or how a shift from forested, dense habitats towards drier, open ones affected hominid vocal communication, potentially setting stage for speech evolution. To recreate how arboreal proto-vowels and proto-consonants would have interacted with a new ecology at ground level, we assessed how a series of orangutan voiceless consonant-like and voiced vowel-like calls travelled across the savannah. Vowel-like calls performed poorly in comparison to their counterparts. Only consonant-like calls afforded effective perceptibility beyond 100 m distance without requiring repetition, as is characteristic of loud calling behaviour in nonhuman primates, typically composed by vowel-like calls. Results show that proto-consonants in human ancestors may have enhanced reliability of distance vocal communication across a canopy-to-ground ecotone. The ecological settings and soundscapes experienced by human ancestors may have had a more profound impact on the emergence and shape of spoken language than previously recognized.
Collapse
Affiliation(s)
| | - Russell A Hill
- Department of Anthropology, Durham University, Durham, UK
- Primate and Predator Project, Soutpansberg Mountains, Thohoyandou, South Africa
- Department of Biological Sciences, University of Venda, Thohoyandou, South Africa
| | | |
Collapse
|
7
|
Bourjade M, Dafreville M, Scola C, Jover M. Six-month-old infants' communication in a comparative perspective: Do maternal attention and interaction matter? J Exp Child Psychol 2023; 231:105651. [PMID: 36842316 DOI: 10.1016/j.jecp.2023.105651] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 01/25/2023] [Accepted: 01/25/2023] [Indexed: 02/27/2023]
Abstract
Developmental precursors of the prelinguistic transition from gestures to word use can be found in the early pragmatic usage of auditory and visual signals across contexts. This study examined whether 6-month-old infants are capable of attention-sensitive communication with their mother, that is, adjusting the sensory modality of their communicative signals to their mother's attention. Proxies of maternal attention implemented in experimental conditions were the mother's visual attention (attentive/inattentive), interaction directed at the infant (interactive/non-interactive), and distance (far/close). The infants' signals were coded as either visual or auditory, following an ethological coding. Infants adjusted the sensory modality of their communicative signals mostly to maternal interaction. More auditory signals were produced when the mother was non-interactive than when she was interactive. Interactive conditions were characterized by higher rates of visual signaling and of gaze-coordinated non-vocal oral sounds. The more time infants spent looking at their attentive mother, the more they produced auditory signals, specifically non-vocal oral sounds. These findings are discussed within the articulated frameworks of evolutionary developmental psychology and early pragmatics.
Collapse
Affiliation(s)
- Marie Bourjade
- Cognition Langues Langage Ergonomie (CLLE), Université de Toulouse, Centre National de la Recherche Scientifique (CNRS), 31058 Toulouse, France; Institut Universitaire de France, 75005 Paris, France.
| | - Mawa Dafreville
- Cognition Langues Langage Ergonomie (CLLE), Université de Toulouse, Centre National de la Recherche Scientifique (CNRS), 31058 Toulouse, France
| | - Céline Scola
- Aix Marseille Université, Centre de Recherche en Psychologie de la Connaissance, du Langage et de l'Émotion (PSYCLE), EA 3273, 13621 Aix-en-Provence, France
| | - Marianne Jover
- Aix Marseille Université, Centre de Recherche en Psychologie de la Connaissance, du Langage et de l'Émotion (PSYCLE), EA 3273, 13621 Aix-en-Provence, France
| |
Collapse
|
8
|
Raimondi T, Di Panfilo G, Pasquali M, Zarantonello M, Favaro L, Savini T, Gamba M, Ravignani A. Isochrony and rhythmic interaction in ape duetting. Proc Biol Sci 2023; 290:20222244. [PMID: 36629119 PMCID: PMC9832542 DOI: 10.1098/rspb.2022.2244] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 12/08/2022] [Indexed: 01/12/2023] Open
Abstract
How did rhythm originate in humans, and other species? One cross-cultural universal, frequently found in human music, is isochrony: when note onsets repeat regularly like the ticking of a clock. Another universal consists in synchrony (e.g. when individuals coordinate their notes so that they are sung at the same time). An approach to biomusicology focuses on similarities and differences across species, trying to build phylogenies of musical traits. Here we test for the presence of, and a link between, isochrony and synchrony in a non-human animal. We focus on the songs of one of the few singing primates, the lar gibbon (Hylobates lar), extracting temporal features from their solo songs and duets. We show that another ape exhibits one rhythmic feature at the core of human musicality: isochrony. We show that an enhanced call rate overall boosts isochrony, suggesting that respiratory physiological constraints play a role in determining the song's rhythmic structure. However, call rate alone cannot explain the flexible isochrony we witness. Isochrony is plastic and modulated depending on the context of emission: gibbons are more isochronous when duetting than singing solo. We present evidence for rhythmic interaction: we find statistical causality between one individual's note onsets and the co-singer's onsets, and a higher than chance degree of synchrony in the duets. Finally, we find a sex-specific trade-off between individual isochrony and synchrony. Gibbon's plasticity for isochrony and rhythmic overlap may suggest a potential shared selective pressure for interactive vocal displays in singing primates. This pressure may have convergently shaped human and gibbon musicality while acting on a common neural primate substrate. Beyond humans, singing primates are promising models to understand how music and, specifically, a sense of rhythm originated in the primate phylogeny.
Collapse
Affiliation(s)
- Teresa Raimondi
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Giovanni Di Panfilo
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Matteo Pasquali
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Martina Zarantonello
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Livio Favaro
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Tommaso Savini
- Conservation Ecology Program, King Mongkut University of Technology Thonburi, School of Bioresources and Technology, Bangkok, Thailand
| | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Andrea Ravignani
- Comparative Bioacoustics Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
- Center for Music in the Brain, Department of Clinical Medicine, Aarhus University and The Royal Academy of Music Aarhus/Aalborg, Denmark
| |
Collapse
|
9
|
Salmi R, Szczupider M, Carrigan J. A novel attention-getting vocalization in zoo-housed western gorillas. PLoS One 2022; 17:e0271871. [PMID: 35947550 PMCID: PMC9365142 DOI: 10.1371/journal.pone.0271871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 07/09/2022] [Indexed: 11/18/2022] Open
Abstract
As a critical aspect of language, vocal learning is extremely rare in animals, having only been described in a few distantly related species. New evidence, however, extends vocal learning/innovation to the primate order, with zoo-housed chimpanzees and orangutans producing novel vocal signals to attract the attention of familiar human caregivers. If the ability to produce novel vocalizations as a means of navigating evolutionarily novel circumstances spans the Hominidae family, then we can expect to find evidence for it in the family’s third genus, Gorilla. To explore this possibility, we conduct an experiment with eight gorillas from Zoo Atlanta to examine whether they use species-atypical vocalizations to get the attention of humans across three different conditions: just a human, just food, or a human holding food. Additionally, we survey gorilla keepers from other AZA-member zoos to compile a list of common attention-getting signals used by the gorillas in their care. Our experiment results indicated that Zoo Atlanta gorillas vocalized most often during the human-food condition, with the most frequently used vocal signal being a species-atypical sound somewhere between a sneeze and a cough (n = 28). This previously undescribed sound is acoustically different from other calls commonly produced during feeding (i.e., single grunts and food-associated calls). Our survey and analyses of recordings from other zoos confirmed that this novel attention-getting sound is not unique to Zoo Atlanta, although further work should be done to better determine the extent and patterns of transmission and/or potential independent innovation of this sound across captive gorilla populations. These findings represent one of the few pieces of evidence of spontaneous novel vocal production in non-enculturated individuals of this species, supporting the inclusion of great apes as moderate vocal learners and perhaps demonstrating an evolutionary function to a flexible vocal repertoire.
Collapse
Affiliation(s)
- Roberta Salmi
- Department of Anthropology, University of Georgia, Athens, GA, United States of America
- * E-mail:
| | - Monica Szczupider
- Department of Anthropology, University of Georgia, Athens, GA, United States of America
- Intergrative Conservation Graduate Program, University of Georgia, Athens, GA, United States of America
| | - Jodi Carrigan
- Zoo Atlanta, Atlanta, Georgia, United States of America
| |
Collapse
|
10
|
Narayanan DZ, Takahashi DY, Kelly LM, Hlavaty SI, Huang J, Ghazanfar AA. Prenatal development of neonatal vocalizations. eLife 2022; 11:78485. [PMID: 35880740 PMCID: PMC9391037 DOI: 10.7554/elife.78485] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 07/11/2022] [Indexed: 11/13/2022] Open
Abstract
Human and non-human primates produce rhythmical sounds as soon as they are born. These early vocalizations are important for soliciting the attention of caregivers. How they develop, remains a mystery. The orofacial movements necessary for producing these vocalizations have distinct spatiotemporal signatures. Therefore, their development could potentially be tracked over the course of prenatal life. We densely and longitudinally sampled fetal head and orofacial movements in marmoset monkeys using ultrasound imaging. We show that orofacial movements necessary for producing rhythmical vocalizations differentiate from a larger movement pattern that includes the entire head. We also show that signature features of marmoset infant contact calls emerge prenatally as a distinct pattern of orofacial movements. Our results establish that aspects of the sensorimotor development necessary for vocalizing occur prenatally, even before the production of sound.
Collapse
Affiliation(s)
- Darshana Z Narayanan
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Daniel Y Takahashi
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Lauren M Kelly
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Sabina I Hlavaty
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, United States
| | - Junzhou Huang
- Department of Computer Science and Engineering, The University of Texas at Arlington, Arlington, United States
| | - Asif A Ghazanfar
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| |
Collapse
|
11
|
Lameira AR, Santamaría-Bonfil G, Galeone D, Gamba M, Hardus ME, Knott CD, Morrogh-Bernard H, Nowak MG, Campbell-Smith G, Wich SA. Sociality predicts orangutan vocal phenotype. Nat Ecol Evol 2022; 6:644-652. [PMID: 35314786 PMCID: PMC9085614 DOI: 10.1038/s41559-022-01689-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 02/02/2022] [Indexed: 11/09/2022]
Abstract
In humans, individuals' social setting determines which and how language is acquired. Social seclusion experiments show that sociality also guides vocal development in songbirds and marmoset monkeys, but absence of similar great ape data has been interpreted as support to saltational notions for language origin, even if such laboratorial protocols are unethical with great apes. Here we characterize the repertoire entropy of orangutan individuals and show that in the wild, different degrees of sociality across populations are associated with different 'vocal personalities' in the form of distinct regimes of alarm call variants. In high-density populations, individuals are vocally more original and acoustically unpredictable but new call variants are short lived, whereas individuals in low-density populations are more conformative and acoustically consistent but also exhibit more complex call repertoires. Findings provide non-invasive evidence that sociality predicts vocal phenotype in a wild great ape. They prove false hypotheses that discredit great apes as having hardwired vocal development programmes and non-plastic vocal behaviour. Social settings mould vocal output in hominids besides humans.
Collapse
Affiliation(s)
- Adriano R Lameira
- Department of Psychology, University of Warwick, Coventry, UK.
- School of Psychology and Neuroscience, University of St Andrews, St Andrews, UK.
| | - Guillermo Santamaría-Bonfil
- Instituto Nacional de Electricidad y Energías Limpias, Gerencia de Tecnologías de la Información, Cuernavaca, México
| | - Deborah Galeone
- Department of Life Sciences and Systems Biology, University of Torino, Turin, Italy
| | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of Torino, Turin, Italy
| | | | - Cheryl D Knott
- Department of Anthropology, Boston University, Boston, MA, USA
| | - Helen Morrogh-Bernard
- Borneo Nature Foundation, Palangka Raya, Indonesia
- College of Life and Environmental Sciences, University of Exeter, Penryn, UK
| | - Matthew G Nowak
- The PanEco Foundation-Sumatran Orangutan Conservation Programme, Berg am Irchel, Switzerland
- Department of Anthropology, Southern Illinois University, Carbondale, IL, USA
| | - Gail Campbell-Smith
- Yayasan Inisiasi Alam Rehabilitasi Indonesia, International Animal Rescue, Ketapang, Indonesia
| | - Serge A Wich
- School of Natural Sciences and Psychology, Liverpool John Moores University, Liverpool, UK
- Faculty of Science, University of Amsterdam, Amsterdam, Netherlands
| |
Collapse
|
12
|
Grawunder S, Uomini N, Samuni L, Bortolato T, Girard-Buttoz C, Wittig RM, Crockford C. Chimpanzee vowel-like sounds and voice quality suggest formant space expansion through the hominoid lineage. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200455. [PMID: 34775819 PMCID: PMC8591386 DOI: 10.1098/rstb.2020.0455] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 09/20/2021] [Indexed: 11/12/2022] Open
Abstract
The origins of human speech are obscure; it is still unclear what aspects are unique to our species or shared with our evolutionary cousins, in part due to a lack of a common framework for comparison. We asked what chimpanzee and human vocal production acoustics have in common. We examined visible supra-laryngeal articulators of four major chimpanzee vocalizations (hoos, grunts, barks, screams) and their associated acoustic structures, using techniques from human phonetic and animal communication analysis. Data were collected from wild adult chimpanzees, Taï National Park, Ivory Coast. Both discriminant and principal component classification procedures revealed classification of call types. Discriminating acoustic features include voice quality and formant structure, mirroring phonetic features in human speech. Chimpanzee lip and jaw articulation variables also offered similar discrimination of call types. Formant maps distinguished call types with different vowel-like sounds. Comparing our results with published primate data, humans show less F1-F2 correlation and further expansion of the vowel space, particularly for [i] sounds. Unlike recent studies suggesting monkeys achieve human vowel space, we conclude from our results that supra-laryngeal articulatory capacities show moderate evolutionary change, with vowel space expansion continuing through hominoid evolution. Studies on more primate species will be required to substantiate this. This article is part of the theme issue 'Voice modulation: from origin and mechanism to social impact (Part II)'.
Collapse
Affiliation(s)
- Sven Grawunder
- Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- Department of Empirical Linguistics, Goethe University, Frankfurt am Main, Germany
| | - Natalie Uomini
- Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
| | - Liran Samuni
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Tai Chimpanzee Project, Centre Suisse de Recherches Scientifiques, 01 BP 1303, Ivory Coast
| | - Tatiana Bortolato
- Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- The Ape Social Mind Lab, Institut des Sciences Cognitives, CNRS, 67 Boulevard Pinel, 69675 Bron, Lyon, France
- Tai Chimpanzee Project, Centre Suisse de Recherches Scientifiques, 01 BP 1303, Ivory Coast
| | - Cédric Girard-Buttoz
- Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- The Ape Social Mind Lab, Institut des Sciences Cognitives, CNRS, 67 Boulevard Pinel, 69675 Bron, Lyon, France
- Tai Chimpanzee Project, Centre Suisse de Recherches Scientifiques, 01 BP 1303, Ivory Coast
| | - Roman M. Wittig
- Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- The Ape Social Mind Lab, Institut des Sciences Cognitives, CNRS, 67 Boulevard Pinel, 69675 Bron, Lyon, France
- Tai Chimpanzee Project, Centre Suisse de Recherches Scientifiques, 01 BP 1303, Ivory Coast
| | - Catherine Crockford
- Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany
- The Ape Social Mind Lab, Institut des Sciences Cognitives, CNRS, 67 Boulevard Pinel, 69675 Bron, Lyon, France
- Tai Chimpanzee Project, Centre Suisse de Recherches Scientifiques, 01 BP 1303, Ivory Coast
| |
Collapse
|
13
|
Lameira AR, Alexandre A, Gamba M, Nowak MG, Vicente R, Wich S. Orangutan information broadcast via consonant-like and vowel-like calls breaches mathematical models of linguistic evolution. Biol Lett 2021; 17:20210302. [PMID: 34582737 PMCID: PMC8478518 DOI: 10.1098/rsbl.2021.0302] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The origin of language is one of the most significant evolutionary milestones of life on Earth, but one of the most persevering scientific unknowns. Two decades ago, game theorists and mathematicians predicted that the first words and grammar emerged as a response to transmission errors and information loss in language's precursor system, however, empirical proof is lacking. Here, we assessed information loss in proto-consonants and proto-vowels in human pre-linguistic ancestors as proxied by orangutan consonant-like and vowel-like calls that compose syllable-like combinations. We played back and re-recorded calls at increasing distances across a structurally complex habitat (i.e. adverse to sound transmission). Consonant-like and vowel-like calls degraded acoustically over distance, but no information loss was detected regarding three distinct classes of information (viz. individual ID, context and population ID). Our results refute prevailing mathematical predictions and herald a turning point in language evolution theory and heuristics. Namely, explaining how the vocal–verbal continuum was crossed in the hominid family will benefit from future mathematical and computational models that, in order to enjoy empirical validity and superior explanatory power, will be informed by great ape behaviour and repertoire.
Collapse
Affiliation(s)
- Adriano R Lameira
- Department of Psychology, University of Warwick, Coventry, UK.,School of Psychology and Neuroscience, University of St Andrews, Scotland, UK
| | | | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Matthew G Nowak
- Sumatran Orangutan Research Programme, PanEco-YEL, North Sumatra, Indonesia.,Department of Anthropology, Southern Illinois University, Carbondale, IL, USA
| | - Raquel Vicente
- Independent researcher, University of Turin, Turin, Italy
| | - Serge Wich
- School of Natural Sciences and Psychology, Liverpool John Moores University, Liverpool, UK.,Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
14
|
Fischer J, Wegdell F, Trede F, Dal Pesco F, Hammerschmidt K. Vocal convergence in a multi-level primate society: insights into the evolution of vocal learning. Proc Biol Sci 2020; 287:20202531. [PMID: 33323082 PMCID: PMC7779498 DOI: 10.1098/rspb.2020.2531] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 11/25/2020] [Indexed: 12/17/2022] Open
Abstract
The extent to which nonhuman primate vocalizations are amenable to modification through experience is relevant for understanding the substrate from which human speech evolved. We examined the vocal behaviour of Guinea baboons, Papio papio, ranging in the Niokolo Koba National Park in Senegal. Guinea baboons live in a multi-level society, with units nested within parties nested within gangs. We investigated whether the acoustic structure of grunts of 27 male baboons of two gangs varied with party/gang membership and genetic relatedness. Males in this species are philopatric, resulting in increased male relatedness within gangs and parties. Grunts of males that were members of the same social levels were more similar than those of males in different social levels (N = 351 dyads for comparison within and between gangs, and N = 169 dyads within and between parties), but the effect sizes were small. Yet, acoustic similarity did not correlate with genetic relatedness, suggesting that higher amounts of social interactions rather than genetic relatedness promote the observed vocal convergence. We consider this convergence a result of sensory-motor integration and suggest this to be an implicit form of vocal learning shared with humans, in contrast to the goal-directed and intentional explicit form of vocal learning unique to human speech acquisition.
Collapse
Affiliation(s)
- Julia Fischer
- Cognitive Ethology Laboratory, German Primate Center, Kellnerweg 4, 37077 Göttingen, Germany
- Department of Primate Cognition, Georg August University Göttingen, Göttingen, Germany
- Leibniz ScienceCampus Primate Cognition, Göttingen, Germany
| | - Franziska Wegdell
- Cognitive Ethology Laboratory, German Primate Center, Kellnerweg 4, 37077 Göttingen, Germany
- Leibniz ScienceCampus Primate Cognition, Göttingen, Germany
| | - Franziska Trede
- Cognitive Ethology Laboratory, German Primate Center, Kellnerweg 4, 37077 Göttingen, Germany
- Primate Genetics Laboratory, German Primate Center, Kellnerweg 4, 37077 Göttingen, Germany
| | - Federica Dal Pesco
- Cognitive Ethology Laboratory, German Primate Center, Kellnerweg 4, 37077 Göttingen, Germany
- Leibniz ScienceCampus Primate Cognition, Göttingen, Germany
| | - Kurt Hammerschmidt
- Cognitive Ethology Laboratory, German Primate Center, Kellnerweg 4, 37077 Göttingen, Germany
- Leibniz ScienceCampus Primate Cognition, Göttingen, Germany
| |
Collapse
|
15
|
Risueno-Segovia C, Hage SR. Theta Synchronization of Phonatory and Articulatory Systems in Marmoset Monkey Vocal Production. Curr Biol 2020; 30:4276-4283.e3. [PMID: 32888481 DOI: 10.1016/j.cub.2020.08.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 07/20/2020] [Accepted: 08/05/2020] [Indexed: 11/27/2022]
Abstract
Human speech shares a 3-8-Hz theta rhythm across all languages [1-3]. According to the frame/content theory of speech evolution, this rhythm corresponds to syllabic rates derived from natural mandibular-associated oscillations [4]. The underlying pattern originates from oscillatory movements of articulatory muscles [4, 5] tightly linked to periodic vocal fold vibrations [4, 6, 7]. Such phono-articulatory rhythms have been proposed as one of the crucial preadaptations for human speech evolution [3, 8, 9]. However, the evolutionary link in phono-articulatory rhythmicity between vertebrate vocalization and human speech remains unclear. From the phonatory perspective, theta oscillations might be phylogenetically preserved throughout all vertebrate clades [10-12]. From the articulatory perspective, theta oscillations are present in non-vocal lip smacking [1, 13, 14], teeth chattering [15], vocal lip smacking [16], and clicks and faux-speech [17] in non-human primates, potential evolutionary precursors for speech rhythmicity [1, 13]. Notably, a universal phono-articulatory rhythmicity similar to that in human speech is considered to be absent in non-human primate vocalizations, typically produced with sound modulations lacking concomitant articulatory movements [1, 9, 18]. Here, we challenge this view by investigating the coupling of phonatory and articulatory systems in marmoset vocalizations. Using quantitative measures of acoustic call structure, e.g., amplitude envelope, and call-associated articulatory movements, i.e., inter-lip distance, we show that marmosets display speech-like bi-motor rhythmicity. These oscillations are synchronized and phase locked at theta rhythms. Our findings suggest that oscillatory rhythms underlying speech production evolved early in the primate lineage, identifying marmosets as a suitable animal model to decipher the evolutionary and neural basis of coupled phono-articulatory movements.
Collapse
Affiliation(s)
- Cristina Risueno-Segovia
- Neurobiology of Social Communication, Department of Otolaryngology, Head and Neck Surgery, Hearing Research Centre, University of Tübingen Medical Center, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; Werner Reichardt Centre for Integrative Neuroscience, University of Tübingen, Otfried-Müller-Str. 25, 72076 Tübingen, Germany; Graduate School of Neural & Behavioural Sciences - International Max Planck Research School, University of Tübingen, Österberg-Str. 3, 72074 Tübingen, Germany
| | - Steffen R Hage
- Neurobiology of Social Communication, Department of Otolaryngology, Head and Neck Surgery, Hearing Research Centre, University of Tübingen Medical Center, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; Werner Reichardt Centre for Integrative Neuroscience, University of Tübingen, Otfried-Müller-Str. 25, 72076 Tübingen, Germany.
| |
Collapse
|
16
|
Emotional expressions in human and non-human great apes. Neurosci Biobehav Rev 2020; 115:378-395. [DOI: 10.1016/j.neubiorev.2020.01.027] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2019] [Revised: 01/17/2020] [Accepted: 01/22/2020] [Indexed: 11/23/2022]
|
17
|
Pereira AS, Kavanagh E, Hobaiter C, Slocombe KE, Lameira AR. Chimpanzee lip-smacks confirm primate continuity for speech-rhythm evolution. Biol Lett 2020; 16:20200232. [PMID: 32453963 DOI: 10.1098/rsbl.2020.0232] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Speech is a human hallmark, but its evolutionary origins continue to defy scientific explanation. Recently, the open-close mouth rhythm of 2-7 Hz (cycles/second) characteristic of all spoken languages has been identified in the orofacial signals of several nonhuman primate genera, including orangutans, but evidence from any of the African apes remained missing. Evolutionary continuity for the emergence of speech is, thus, still inconclusive. To address this empirical gap, we investigated the rhythm of chimpanzee lip-smacks across four populations (two captive and two wild). We found that lip-smacks exhibit a speech-like rhythm at approximately 4 Hz, closing a gap in the evidence for the evolution of speech-rhythm within the primate order. We observed sizeable rhythmic variation within and between chimpanzee populations, with differences of over 2 Hz at each level. This variation did not result, however, in systematic group differences within our sample. To further explore the phylogenetic and evolutionary perspective on this variability, inter-individual and inter-population analyses will be necessary across primate species producing mouth signals at speech-like rhythm. Our findings support the hypothesis that speech recruited ancient primate rhythmic signals and suggest that multi-site studies may still reveal new windows of understanding about these signals' use and production along the evolutionary timeline of speech.
Collapse
Affiliation(s)
- André S Pereira
- School of Psychology and Neuroscience, University of St Andrews, St Mary's Quad, South Street, St Andrews, KY16 9JP, UK.,School of Biological Sciences, University of Aberdeen, Zoology Building, Tillydrone Avenue, Aberdeen, AB24 2TZ, UK
| | - Eithne Kavanagh
- Department of Psychology, University of York, Heslington, York, YO10 5DD, UK
| | - Catherine Hobaiter
- School of Psychology and Neuroscience, University of St Andrews, St Mary's Quad, South Street, St Andrews, KY16 9JP, UK
| | - Katie E Slocombe
- Department of Psychology, University of York, Heslington, York, YO10 5DD, UK
| | - Adriano R Lameira
- School of Psychology and Neuroscience, University of St Andrews, St Mary's Quad, South Street, St Andrews, KY16 9JP, UK.,Department of Psychology, University of Warwick, University Road, Coventry, CV4 7AL, UK
| |
Collapse
|
18
|
Abstract
Vocal learning is the ability to modify vocal output on the basis of experience. Traditionally, species have been classified as either displaying or lacking this ability. A recent proposal, the vocal learning continuum, recognizes the need to have a more nuanced view of this phenotype and abandon the yes–no dichotomy. However, it also limits vocal learning to production of novel calls through imitation, moreover subserved by a forebrain-to-phonatory-muscles circuit. We discuss its limitations regarding the characterization of vocal learning across species and argue for a more permissive view. Vocal learning is the capacity to modify vocal output on the basis of experience, crucial for human speech and several animal communication systems. This Essay maintains that the existing evidence supports a more nuanced view of this phenotype, broadening the set of species, behaviors, and factors that can help us understand it.
Collapse
Affiliation(s)
- Pedro Tiago Martins
- Section of General Linguistics, Universitat de Barcelona, Barcelona, Spain
- University of Barcelona Institute of Complex Systems (UBICS), Barcelona, Spain
- * E-mail:
| | - Cedric Boeckx
- University of Barcelona Institute of Complex Systems (UBICS), Barcelona, Spain
- Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain
| |
Collapse
|
19
|
Zhang YS, Ghazanfar AA. A Hierarchy of Autonomous Systems for Vocal Production. Trends Neurosci 2020; 43:115-126. [PMID: 31955902 PMCID: PMC7213988 DOI: 10.1016/j.tins.2019.12.006] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 12/01/2019] [Accepted: 12/12/2019] [Indexed: 10/25/2022]
Abstract
Vocal production is hierarchical in the time domain. These hierarchies build upon biomechanical and neural dynamics across various timescales. We review studies in marmoset monkeys, songbirds, and other vertebrates. To organize these data in an accessible and across-species framework, we interpret the different timescales of vocal production as belonging to different levels of an autonomous systems hierarchy. The first level accounts for vocal acoustics produced on short timescales; subsequent levels account for longer timescales of vocal output. The hierarchy of autonomous systems that we put forth accounts for vocal patterning, sequence generation, dyadic interactions, and context dependence by sequentially incorporating central pattern generators, intrinsic drives, and sensory signals from the environment. We then show the framework's utility by providing an integrative explanation of infant vocal production learning in which social feedback modulates infant vocal acoustics through the tuning of a drive signal.
Collapse
Affiliation(s)
- Yisi S Zhang
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA.
| | - Asif A Ghazanfar
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA; Department of Psychology, Princeton University, Princeton, NJ 08544, USA; Department of Ecology & Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA.
| |
Collapse
|
20
|
Lameira AR, Call J. Understanding Language Evolution: Beyond Pan-Centrism. Bioessays 2020; 42:e1900102. [PMID: 31994246 DOI: 10.1002/bies.201900102] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Revised: 12/18/2019] [Indexed: 12/20/2022]
Abstract
Language does not fossilize but this does not mean that the language's evolutionary timeline is lost forever. Great apes provide a window back in time on our last prelinguistic ancestor's communication and cognition. Phylogeny and cladistics implicitly conjure Pan (chimpanzees, bonobos) as a superior (often the only) model for language evolution compared with earlier diverging lineages, Gorilla and Pongo (orangutans). Here, in reviewing the literature, it is shown that Pan do not surpass other great apes along genetic, cognitive, ecologic, or vocal traits that are putatively paramount for language onset and evolution. Instead, revived herein is the idea that only by abandoning single-species models and learning about the variation among great apes, there might be a chance to retrieve lost fragments of the evolutionary timeline of language.
Collapse
Affiliation(s)
- Adriano R Lameira
- School of Psychology and Neuroscience, University of St. Andrews, South Street, KY16 9JP, St Andrews, UK.,Deparment of Psychology, University of Warwick, University Road, CV4 7AL, Coventry, UK
| | - Josep Call
- School of Psychology and Neuroscience, University of St. Andrews, South Street, KY16 9JP, St Andrews, UK
| |
Collapse
|
21
|
Prieur J, Barbu S, Blois‐Heulin C, Lemasson A. The origins of gestures and language: history, current advances and proposed theories. Biol Rev Camb Philos Soc 2019; 95:531-554. [DOI: 10.1111/brv.12576] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 11/30/2019] [Accepted: 12/03/2019] [Indexed: 12/16/2022]
Affiliation(s)
- Jacques Prieur
- Department of Education and PsychologyComparative Developmental Psychology, Freie Universität Berlin Berlin Germany
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| | - Stéphanie Barbu
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| | - Catherine Blois‐Heulin
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| | - Alban Lemasson
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| |
Collapse
|
22
|
Coupled whole-body rhythmic entrainment between two chimpanzees. Sci Rep 2019; 9:18914. [PMID: 31831862 PMCID: PMC6908706 DOI: 10.1038/s41598-019-55360-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 11/08/2019] [Indexed: 11/16/2022] Open
Abstract
Dance is an icon of human expression. Despite astounding diversity around the world’s cultures and dazzling abundance of reminiscent animal systems, the evolution of dance in the human clade remains obscure. Dance requires individuals to interactively synchronize their whole-body tempo to their partner’s, with near-perfect precision. This capacity is motorically-heavy, engaging multiple neural circuitries, but also dependent on an acute socio-emotional bond between partners. Hitherto, these factors helped explain why no dance forms were present amongst nonhuman primates. Critically, evidence for conjoined full-body rhythmic entrainment in great apes that could help reconstruct possible proto-stages of human dance is still lacking. Here, we report an endogenously-effected case of ritualized dance-like behaviour between two captive chimpanzees – synchronized bipedalism. We submitted video recordings to rigorous time-series analysis and circular statistics. We found that individual step tempo was within the genus’ range of “solo” bipedalism. Between-individual analyses, however, revealed that synchronisation between individuals was non-random, predictable, phase concordant, maintained with instantaneous centi-second precision and jointly regulated, with individuals also taking turns as “pace-makers”. No function was apparent besides the behaviour’s putative positive social affiliation. Our analyses show a first case of spontaneous whole-body entrainment between two ape peers, thus providing tentative empirical evidence for phylogenies of human dance. Human proto-dance, we argue, may have been rooted in mechanisms of social cohesion among small groups that might have granted stress-releasing benefits via gait-synchrony and mutual-touch. An external sound/musical beat may have been initially uninvolved. We discuss dance evolution as driven by ecologically-, socially- and/or culturally-imposed “captivity”.
Collapse
|
23
|
Ravignani A, Dalla Bella S, Falk S, Kello CT, Noriega F, Kotz SA. Rhythm in speech and animal vocalizations: a cross-species perspective. Ann N Y Acad Sci 2019; 1453:79-98. [PMID: 31237365 PMCID: PMC6851814 DOI: 10.1111/nyas.14166] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 05/14/2019] [Accepted: 05/24/2019] [Indexed: 12/31/2022]
Abstract
Why does human speech have rhythm? As we cannot travel back in time to witness how speech developed its rhythmic properties and why humans have the cognitive skills to process them, we rely on alternative methods to find out. One powerful tool is the comparative approach: studying the presence or absence of cognitive/behavioral traits in other species to determine which traits are shared between species and which are recent human inventions. Vocalizations of many species exhibit temporal structure, but little is known about how these rhythmic structures evolved, are perceived and produced, their biological and developmental bases, and communicative functions. We review the literature on rhythm in speech and animal vocalizations as a first step toward understanding similarities and differences across species. We extend this review to quantitative techniques that are useful for computing rhythmic structure in acoustic sequences and hence facilitate cross-species research. We report links between vocal perception and motor coordination and the differentiation of rhythm based on hierarchical temporal structure. While still far from a complete cross-species perspective of speech rhythm, our review puts some pieces of the puzzle together.
Collapse
Affiliation(s)
- Andrea Ravignani
- Artificial Intelligence LaboratoryVrije Universiteit BrusselBrusselsBelgium
- Institute for Advanced StudyUniversity of AmsterdamAmsterdamthe Netherlands
| | - Simone Dalla Bella
- International Laboratory for BrainMusic and Sound Research (BRAMS)MontréalQuebecCanada
- Department of PsychologyUniversity of MontrealMontréalQuebecCanada
- Department of Cognitive PsychologyWarsawPoland
| | - Simone Falk
- International Laboratory for BrainMusic and Sound Research (BRAMS)MontréalQuebecCanada
- Laboratoire de Phonétique et Phonologie, UMR 7018, CNRS/Université Sorbonne Nouvelle Paris‐3Institut de Linguistique et Phonétique générales et appliquéesParisFrance
| | | | - Florencia Noriega
- Chair for Network DynamicsCenter for Advancing Electronics Dresden (CFAED), TU DresdenDresdenGermany
- CODE University of Applied SciencesBerlinGermany
| | - Sonja A. Kotz
- International Laboratory for BrainMusic and Sound Research (BRAMS)MontréalQuebecCanada
- Basic and Applied NeuroDynamics Laboratory, Faculty of Psychology and Neuroscience, Department of Neuropsychology and PsychopharmacologyMaastricht UniversityMaastrichtthe Netherlands
- Department of NeuropsychologyMax‐Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| |
Collapse
|
24
|
Lameira AR, Shumaker RW. Orangutans show active voicing through a membranophone. Sci Rep 2019; 9:12289. [PMID: 31444387 PMCID: PMC6707206 DOI: 10.1038/s41598-019-48760-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 07/31/2019] [Indexed: 11/09/2022] Open
Abstract
Active voicing - voluntary control over vocal fold oscillation - is essential for speech. Nonhuman great apes can learn new consonant- and vowel-like calls, but active voicing by our closest relatives has historically been the hardest evidence to concede to. To resolve this controversy, a diagnostic test for active voicing is reached here through the use of a membranophone: a musical instrument where a player's voice flares a membrane's vibration through oscillating air pressure. We gave the opportunity to use a membranophone to six orangutans (with no effective training), three of whom produced a priori novel (species-atypical) individual-specific vocalizations. After 11 and 34 min, two subjects were successful by producing their novel vocalizations into the instrument, hence, confirming active voicing. Beyond expectation, however, within <1 hour, both subjects found opposite strategies to significantly alter their voice duration and frequency to better activate the membranophone, further demonstrating plastic voice control as a result of experience with the instrument. Results highlight how individual differences in vocal proficiency between great apes may affect performance in experimental tests. Failing to adjust a test's difficulty level to individuals' vocal skill may lead to false negatives, which may have largely been the case in past studies now used as "textbook fact" for great ape "missing" vocal capacities. Results qualitatively differ from small changes that can be caused in innate monkey calls by intensive months-long conditional training. Our findings verify that active voicing beyond the typical range of the species' repertoire, which in our species underpins the acquisition of new voiced speech sounds, is not uniquely human among great apes.
Collapse
Affiliation(s)
- Adriano R Lameira
- School of Psychology and Neuroscience, University of St. Andrews, St Andrews, UK. .,Department of Anthropology, Durham University, Durham, UK.
| | - Robert W Shumaker
- Indianapolis Zoo, Indianapolis, USA.,Krasnow Institute for Advanced Studies, George Mason University, Fairfax, USA.,Anthropology Department, Indiana University, Bloomington, USA
| |
Collapse
|
25
|
Burunat E. Love is a physiological motivation (like hunger, thirst, sleep or sex). Med Hypotheses 2019; 129:109225. [PMID: 31371074 DOI: 10.1016/j.mehy.2019.05.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Revised: 04/17/2019] [Accepted: 05/12/2019] [Indexed: 10/26/2022]
Abstract
The multitude of terms associated with love has given rise to a false perception of love. In this paper, only maternal and romantic love are considered. Love is usually regarded as a feeling, motivation, addiction, passion, and, above all, an emotion. This confusion has consequences in the lives of human beings, leading not only to divorces, suicides, femicides but possibly also to a number of mental illnesses and suffering. Therefore, it is crucial to first clarify what is meant by emotion, motivation and love. This work aims to finally place love within the category of physiological motivations, such as hunger, thirst, sleep, or sex, on the basis that love is also essential for human survival, especially in childhood. Love is presented from an evolutionary perspective. Some other similarities between love and other physiological motivations are pointed out, such as its importance for appropriate human development, both its ontogeny and its permanence, and the long-lasting consequences of abuse and neglect. There are summarized reasons that account for this, such as the fact that physiological motivations are essential for survival and that love is an essential motivation for the survival of human offspring. Other reasons are that minimum changes in the quantity and quality of love alters development, that there can be a variety of neurophysiological and behavioural states within a motivation, and that motivations (also love) appear and change throughout development. Also, motivations and love sometimes may lead to an addictive behaviour. Finally, it is recognized that once physiological motivations (and love) appear, they become permanent. In a third section, some potential social, cultural, clinical and scientific consequences of the proposed consideration of love as a motivation are discussed. Accordingly, love's recognition as a motivation in the clinical field would imply a better understanding of its disorders and its inclusion in classifications manuals such as The Diagnostic and Statistical Manual of Mental Disorders (DSM), or in the International Classification of Diseases (ICD). Considering love as a motivation rather than an emotion could also impact the results of scientific research (an example is included). A comprehensive understanding of these questions could potentially allow for a new therapeutic approach in the treatment of mental illness, while offering an all-inclusive evolutionary explanation of cultural phenomena such as the origin and diffusion of both language and art. Love should be understood as a physiological motivation, like hunger, sleep or sex, and not as an emotion as it is commonly considered.
Collapse
Affiliation(s)
- Enrique Burunat
- School of Health Sciences/School of Psychology, Department of Clinical Psychology, Psychobiology and Methodology, University of La Laguna, P.O. Box 456, 38200 Santa Cruz de Tenerife, Canary Islands, Spain.
| |
Collapse
|
26
|
Gesture Use in Communication between Mothers and Offspring in Wild Orang-Utans (Pongo pygmaeus wurmbii) from the Sabangau Peat-Swamp Forest, Borneo. INT J PRIMATOL 2019. [DOI: 10.1007/s10764-019-00095-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|
27
|
|
28
|
|
29
|
Tamariz M. Replication and emergence in cultural transmission. Phys Life Rev 2019; 30:47-71. [PMID: 31005570 DOI: 10.1016/j.plrev.2019.04.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 04/09/2019] [Indexed: 12/31/2022]
Abstract
Humans are fundamentally defined by our socially transmitted, often long-lived, sophisticated cultural traits. The nature of cultural transmission is the subject of ongoing debate: while some emphasize that it is a biased, transformational process, others point out that high-fidelity transmission is required to explain the quintessentially cumulative nature of human culture. This paper integrates both views into a model that has two main components: First, actions - observable motor-behavioural patterns - are inherited with high fidelity, or replicated, when they are copied, largely independently of their normal, effective or conventional function, by naive learners. Replicative action copying is the unbiased transmission process that ensures the continuity of cultural traditions. Second, mental culture - knowledge, skills, attitudes and values - is not inherited directly or faithfully, but instead emerges, or develops, during usage, when individuals learn the associations between actions and their contexts and outcomes. Mental cultural traits remain stable over generations to the extent that they are informed by similar (replicated) motor patterns unfolding in similar environments. The arguments in support of this model rest on clear distinctions between inheritance and usage; between public-behavioural and private-mental culture; and between selection for fidelity and selection for function.
Collapse
Affiliation(s)
- Monica Tamariz
- Psychology, Heriot-Watt University, Riccarton Campus, Edinburgh EH14 4AS, UK.
| |
Collapse
|
30
|
Raine J, Pisanski K, Bond R, Simner J, Reby D. Human roars communicate upper-body strength more effectively than do screams or aggressive and distressed speech. PLoS One 2019; 14:e0213034. [PMID: 30830931 PMCID: PMC6398857 DOI: 10.1371/journal.pone.0213034] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Accepted: 02/13/2019] [Indexed: 11/27/2022] Open
Abstract
Despite widespread evidence that nonverbal components of human speech (e.g., voice pitch) communicate information about physical attributes of vocalizers and that listeners can judge traits such as strength and body size from speech, few studies have examined the communicative functions of human nonverbal vocalizations (such as roars, screams, grunts and laughs). Critically, no previous study has yet to examine the acoustic correlates of strength in nonverbal vocalisations, including roars, nor identified reliable vocal cues to strength in human speech. In addition to being less acoustically constrained than articulated speech, agonistic nonverbal vocalizations function primarily to express motivation and emotion, such as threat, and may therefore communicate strength and body size more effectively than speech. Here, we investigated acoustic cues to strength and size in roars compared to screams and speech sentences produced in both aggressive and distress contexts. Using playback experiments, we then tested whether listeners can reliably infer a vocalizer's actual strength and height from roars, screams, and valenced speech equivalents, and which acoustic features predicted listeners' judgments. While there were no consistent acoustic cues to strength in any vocal stimuli, listeners accurately judged inter-individual differences in strength, and did so most effectively from aggressive voice stimuli (roars and aggressive speech). In addition, listeners more accurately judged strength from roars than from aggressive speech. In contrast, listeners' judgments of height were most accurate for speech stimuli. These results support the prediction that vocalizers maximize impressions of physical strength in aggressive compared to distress contexts, and that inter-individual variation in strength may only be honestly communicated in vocalizations that function to communicate threat, particularly roars. Thus, in continuity with nonhuman mammals, the acoustic structure of human aggressive roars may have been selected to communicate, and to some extent exaggerate, functional cues to physical formidability.
Collapse
Affiliation(s)
- Jordan Raine
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
| | - Katarzyna Pisanski
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
- Equipe Neuro-Ethologie Sensorielle, ENES/Neuro-PSI CNRS UMR 9197, Bioacoustics Team, University of Lyon/Saint-Etienne, Saint-Etienne, France
| | - Rod Bond
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
| | - Julia Simner
- MULTISENSE Research Lab, University of Sussex, Brighton, United Kingdom
| | - David Reby
- Mammal Vocal Communication and Cognition Research Group, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
31
|
Lameira AR, Call J. Time-space-displaced responses in the orangutan vocal system. SCIENCE ADVANCES 2018; 4:eaau3401. [PMID: 30443595 PMCID: PMC6235548 DOI: 10.1126/sciadv.aau3401] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 10/18/2018] [Indexed: 06/09/2023]
Abstract
One of the defining features of language is displaced reference-the capacity to transmit information about something that is not present or about a past or future event. It is very rare in nature and has not been shown in any nonhuman primate, confounding, as such, any understanding of its precursors and evolution in the human lineage. Here, we describe a vocal phenomenon in a wild great ape with unparalleled affinities with displaced reference. When exposed to predator models, Sumatran orangutan mothers temporarily suppressed alarm calls up to 20 min until the model was out of sight. Subjects delayed their vocal responses in function of perceived danger for themselves, but four major predictions for stress-based mechanisms were not met. Conversely, vocal delay was also a function of perceived danger for another-an infant-suggesting high-order cognition. Our findings suggest that displaced reference in language is likely to have originally piggybacked on akin behaviors in an ancestral hominid.
Collapse
|
32
|
Piel AK. Temporal patterns of chimpanzee loud calls in the Issa Valley, Tanzania: Evidence of nocturnal acoustic behavior in wild chimpanzees. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2018; 166:530-540. [DOI: 10.1002/ajpa.23609] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Revised: 04/06/2018] [Accepted: 04/28/2018] [Indexed: 11/08/2022]
Affiliation(s)
- Alex K. Piel
- School of Natural Sciences and PsychologyLiverpool John Moores UniversityLiverpool United Kingdom
- Greater Mahale Ecosystem Research and Conservation (Project) Tanzania
| |
Collapse
|
33
|
|
34
|
Deshpande A, Gupta S, Sinha A. Intentional communication between wild bonnet macaques and humans. Sci Rep 2018; 8:5147. [PMID: 29650972 PMCID: PMC5897542 DOI: 10.1038/s41598-018-22928-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 03/05/2018] [Indexed: 11/23/2022] Open
Abstract
Comparative studies of nonhuman communication systems could provide insights into the origins and evolution of a distinct dimension of human language: intentionality. Recent studies have provided evidence for intentional communication in different species but generally in captive settings. We report here a novel behaviour of food requesting from humans displayed by wild bonnet macaques Macaca radiata, an Old World cercopithecine primate, in the Bandipur National Park of southern India. Using both natural observations and field experiments, we examined four different behavioural components—coo-calls, hand-extension gesture, orientation, and monitoring behaviour—of food requesting for their conformity with the established criteria of intentional communication. Our results suggest that food requesting by bonnet macaques is potentially an intentionally produced behavioural strategy as all the food requesting behaviours except coo-calls qualify the criteria for intentionality. We comment on plausible hypotheses for the origin and spread of this novel behavioural strategy in the study macaque population and speculate that the cognitive precursors for language production may be manifest in the usage of combination of signals of different modalities in communication, which could have emerged in simians earlier than in the anthropoid apes.
Collapse
Affiliation(s)
- Adwait Deshpande
- Consciousness Studies Programme, National Institute of Advanced Studies, Bangalore, India.
| | - Shreejata Gupta
- Animal Behaviour and Cognition Programme, National Institute of Advanced Studies, Bangalore, India
| | - Anindya Sinha
- Consciousness Studies Programme, National Institute of Advanced Studies, Bangalore, India.,Animal Behaviour and Cognition Programme, National Institute of Advanced Studies, Bangalore, India.,Primate Programme, Nature Conservation Foundation, Mysore, India.,Centre for Neuroscience, Indian Institute of Science, Bangalore, India.,Dhole's Den Research Foundation, Bandipur National Park, Karnataka, India
| |
Collapse
|
35
|
FOXP2 variation in great ape populations offers insight into the evolution of communication skills. Sci Rep 2017; 7:16866. [PMID: 29203828 PMCID: PMC5715162 DOI: 10.1038/s41598-017-16844-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Accepted: 11/17/2017] [Indexed: 12/13/2022] Open
Abstract
The gene coding for the forkhead box protein P2 (FOXP2) is associated with human language disorders. Evolutionary changes in this gene are hypothesized to have contributed to the emergence of speech and language in the human lineage. Although FOXP2 is highly conserved across most mammals, humans differ at two functional amino acid substitutions from chimpanzees, bonobos and gorillas, with an additional fixed substitution found in orangutans. However, FOXP2 has been characterized in only a small number of apes and no publication to date has examined the degree of natural variation in large samples of unrelated great apes. Here, we analyzed the genetic variation in the FOXP2 coding sequence in 63 chimpanzees, 11 bonobos, 48 gorillas, 37 orangutans and 2 gibbons and observed undescribed variation in great apes. We identified two variable polyglutamine microsatellites in chimpanzees and orangutans and found three nonsynonymous single nucleotide polymorphisms, one in chimpanzees, one in gorillas and one in orangutans with derived allele frequencies of 0.01, 0.26 and 0.29, respectively. Structural and functional protein modeling indicate a biochemical effect of the substitution in orangutans, and because of its presence solely in the Sumatran orangutan species, the mutation may be associated with reported population differences in vocalizations.
Collapse
|
36
|
Bidding evidence for primate vocal learning and the cultural substrates for speech evolution. Neurosci Biobehav Rev 2017; 83:429-439. [DOI: 10.1016/j.neubiorev.2017.09.021] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Revised: 09/19/2017] [Accepted: 09/21/2017] [Indexed: 10/18/2022]
|
37
|
Loh KK, Petrides M, Hopkins WD, Procyk E, Amiez C. Cognitive control of vocalizations in the primate ventrolateral-dorsomedial frontal (VLF-DMF) brain network. Neurosci Biobehav Rev 2017; 82:32-44. [DOI: 10.1016/j.neubiorev.2016.12.001] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Revised: 12/01/2016] [Accepted: 12/02/2016] [Indexed: 11/30/2022]
|
38
|
Ravignani A, Honing H, Kotz SA. Editorial: The Evolution of Rhythm Cognition: Timing in Music and Speech. Front Hum Neurosci 2017; 11:303. [PMID: 28659775 PMCID: PMC5468413 DOI: 10.3389/fnhum.2017.00303] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2017] [Accepted: 05/26/2017] [Indexed: 01/12/2023] Open
Affiliation(s)
- Andrea Ravignani
- Veterinary and Research Department, Sealcentre PieterburenPieterburen, Netherlands.,Language and Cognition Department, Max Planck Institute for PsycholinguisticsNijmegen, Netherlands.,Artificial Intelligence Lab, Vrije Universiteit BrusselBrussels, Belgium
| | - Henkjan Honing
- Music Cognition Group, Amsterdam Brain and Cognition, Institute for Logic, Language, and Computation, University of AmsterdamAmsterdam, Netherlands
| | - Sonja A Kotz
- Basic and Applied NeuroDynamics Lab, Faculty of Psychology and Neuroscience, Department of Neuropsychology and Psychopharmacology, Maastricht UniversityMaastricht, Netherlands.,Department of Neuropsychology, Max-Planck Institute for Human Cognitive and Brain SciencesLeipzig, Germany
| |
Collapse
|
39
|
The origins of the vocal brain in humans. Neurosci Biobehav Rev 2017; 77:177-193. [DOI: 10.1016/j.neubiorev.2017.03.014] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Revised: 02/15/2017] [Accepted: 03/22/2017] [Indexed: 01/13/2023]
|
40
|
Gustison ML, Bergman TJ. Divergent acoustic properties of gelada and baboon vocalizations and their implications for the evolution of human speech. JOURNAL OF LANGUAGE EVOLUTION 2017; 2:20-36. [PMID: 31402984 PMCID: PMC6681840 DOI: 10.1093/jole/lzx015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Human speech has many complex spectral and temporal features traditionally thought to be absent in the vocalizations of other primates. Recent explorations of the vocal capabilities of non-human primates are challenging this view. Here, we continue this trend by exploring the spectro-temporal properties of gelada (Theropithecus gelada) vocalizations. First, we made cross-species comparisons of geladas, chacma baboons, and human vowel space area. We found that adult male and female gelada exhaled grunts-a call type shared with baboons-have formant profiles that overlap more with human vowel space than do baboon grunts. These gelada grunts also contained more modulation of fundamental and formant frequencies than did baboon grunts. Second, we compared formant profiles and modulation of exhaled grunts to the derived call types (those not shared with baboons) produced by gelada males. These derived calls contained divergent formant profiles, and a subset of them, notably wobbles and vocalized yawns, were more modulated than grunts. Third, we investigated the rhythmic patterns of wobbles, a call type shown previously to contain cycles that match the 3-8 Hz tempo of speech. We use a larger dataset to show that the wobble rhythm overlaps more with speech rhythm than previously thought. We also found that variation in cycle duration depends on the production modality; specifically, exhaled wobbles were produced at a slower tempo than inhaled wobbles. Moreover, the variability in cycle duration within wobbles aligns with a linguistic property known as 'Menzerath's law' in that there was a negative association between cycle duration and wobble size (i.e. the number of cycles). Taken together, our results add to growing evidence that non-human primates are anatomically capable of producing modulated sounds. Our results also support and expand on current hypotheses of speech evolution, including the 'neural hypothesis' and the 'bimodal speech rhythm hypothesis'.
Collapse
Affiliation(s)
| | - Thore J. Bergman
- Department of Psychology, University of Michigan
- Department of Ecology and Evolutionary Biology, University of Michigan
| |
Collapse
|
41
|
Volitional exaggeration of body size through fundamental and formant frequency modulation in humans. Sci Rep 2016; 6:34389. [PMID: 27687571 PMCID: PMC5043380 DOI: 10.1038/srep34389] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 09/09/2016] [Indexed: 11/09/2022] Open
Abstract
Several mammalian species scale their voice fundamental frequency (F0) and formant frequencies in competitive and mating contexts, reducing vocal tract and laryngeal allometry thereby exaggerating apparent body size. Although humans’ rare capacity to volitionally modulate these same frequencies is thought to subserve articulated speech, the potential function of voice frequency modulation in human nonverbal communication remains largely unexplored. Here, the voices of 167 men and women from Canada, Cuba, and Poland were recorded in a baseline condition and while volitionally imitating a physically small and large body size. Modulation of F0, formant spacing (∆F), and apparent vocal tract length (VTL) were measured using Praat. Our results indicate that men and women spontaneously and systemically increased VTL and decreased F0 to imitate a large body size, and reduced VTL and increased F0 to imitate small size. These voice modulations did not differ substantially across cultures, indicating potentially universal sound-size correspondences or anatomical and biomechanical constraints on voice modulation. In each culture, men generally modulated their voices (particularly formants) more than did women. This latter finding could help to explain sexual dimorphism in F0 and formants that is currently unaccounted for by sexual dimorphism in human vocal anatomy and body size.
Collapse
|
42
|
Vocal fold control beyond the species-specific repertoire in an orang-utan. Sci Rep 2016; 6:30315. [PMID: 27461756 PMCID: PMC4962094 DOI: 10.1038/srep30315] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2016] [Accepted: 06/29/2016] [Indexed: 11/16/2022] Open
Abstract
Vocal fold control was critical to the evolution of spoken language, much as it today allows us to learn vowel systems. It has, however, never been demonstrated directly in a non-human primate, leading to the suggestion that it evolved in the human lineage after divergence from great apes. Here, we provide the first evidence for real-time, dynamic and interactive vocal fold control in a great ape during an imitation “do-as-I-do” game with a human demonstrator. Notably, the orang-utan subject skilfully produced “wookies” – an idiosyncratic vocalization exhibiting a unique spectral profile among the orang-utan vocal repertoire. The subject instantaneously matched human-produced wookies as they were randomly modulated in pitch, adjusting his voice frequency up or down when the human demonstrator did so, readily generating distinct low vs. high frequency sub-variants. These sub-variants were significantly different from spontaneous ones (not produced in matching trials). Results indicate a latent capacity for vocal fold exercise in a great ape (i) in real-time, (ii) up and down the frequency spectrum, (iii) across a register range beyond the species-repertoire and, (iv) in a co-operative turn-taking social setup. Such ancestral capacity likely provided the neuro-behavioural basis of the more fine-tuned vocal fold control that is a human hallmark.
Collapse
|
43
|
Pisanski K, Cartei V, McGettigan C, Raine J, Reby D. Voice Modulation: A Window into the Origins of Human Vocal Control? Trends Cogn Sci 2016; 20:304-318. [PMID: 26857619 DOI: 10.1016/j.tics.2016.01.002] [Citation(s) in RCA: 103] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Revised: 01/05/2016] [Accepted: 01/07/2016] [Indexed: 11/17/2022]
Abstract
An unresolved issue in comparative approaches to speech evolution is the apparent absence of an intermediate vocal communication system between human speech and the less flexible vocal repertoires of other primates. We argue that humans' ability to modulate nonverbal vocal features evolutionarily linked to expression of body size and sex (fundamental and formant frequencies) provides a largely overlooked window into the nature of this intermediate system. Recent behavioral and neural evidence indicates that humans' vocal control abilities, commonly assumed to subserve speech, extend to these nonverbal dimensions. This capacity appears in continuity with context-dependent frequency modulations recently identified in other mammals, including primates, and may represent a living relic of early vocal control abilities that led to articulated human speech.
Collapse
Affiliation(s)
- Katarzyna Pisanski
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK; Institute of Psychology, University of Wrocław, Wrocław, Poland
| | - Valentina Cartei
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK
| | - Carolyn McGettigan
- Royal Holloway Vocal Communication Laboratory, Department of Psychology, Royal Holloway, University of London, Egham, UK
| | - Jordan Raine
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK
| | - David Reby
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, UK.
| |
Collapse
|
44
|
Clay Z, Archbold J, Zuberbühler K. Functional flexibility in wild bonobo vocal behaviour. PeerJ 2015; 3:e1124. [PMID: 26290789 PMCID: PMC4540007 DOI: 10.7717/peerj.1124] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2015] [Accepted: 07/03/2015] [Indexed: 11/23/2022] Open
Abstract
A shared principle in the evolution of language and the development of speech is the emergence of functional flexibility, the capacity of vocal signals to express a range of emotional states independently of context and biological function. Functional flexibility has recently been demonstrated in the vocalisations of pre-linguistic human infants, which has been contrasted to the functionally fixed vocal behaviour of non-human primates. Here, we revisited the presumed chasm in functional flexibility between human and non-human primate vocal behaviour, with a study on our closest living primate relatives, the bonobo (Pan paniscus). We found that wild bonobos use a specific call type (the "peep") across a range of contexts that cover the full valence range (positive-neutral-negative) in much of their daily activities, including feeding, travel, rest, aggression, alarm, nesting and grooming. Peeps were produced in functionally flexible ways in some contexts, but not others. Crucially, calls did not vary acoustically between neutral and positive contexts, suggesting that recipients take pragmatic information into account to make inferences about call meaning. In comparison, peeps during negative contexts were acoustically distinct. Our data suggest that the capacity for functional flexibility has evolutionary roots that predate the evolution of human speech. We interpret this evidence as an example of an evolutionary early transition away from fixed vocal signalling towards functional flexibility.
Collapse
Affiliation(s)
- Zanna Clay
- School of Psychology, University of Birmingham, Birmingham, UK
- Department of Comparative Cognition, Institute of Biology, University of Neuchatel, Neuchatel, Switzerland
| | - Jahmaira Archbold
- Department of Comparative Cognition, Institute of Biology, University of Neuchatel, Neuchatel, Switzerland
| | - Klaus Zuberbühler
- Department of Comparative Cognition, Institute of Biology, University of Neuchatel, Neuchatel, Switzerland
- School of Psychology & Neurosciences, University of St Andrews, St Andrews, Fife, UK
| |
Collapse
|
45
|
Poliva O. From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans. F1000Res 2015; 4:67. [PMID: 28928931 PMCID: PMC5600004 DOI: 10.12688/f1000research.6175.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/03/2015] [Indexed: 03/28/2024] Open
Abstract
In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS) and via the inferior parietal lobule (auditory dorsal stream; ADS). The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and audio-visual integration. I propose that the primary role of the ADS in monkeys/apes is the perception and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring) and are used for monitoring location. Perception of contact calls occurs by the ADS detecting a voice, localizing it, and verifying that the corresponding face is out of sight. The auditory cortex then projects to parieto-frontal visuospatial regions (visual dorsal stream) for searching the caller, and via a series of frontal lobe-brainstem connections, a contact call is produced in return. Because the human ADS processes also speech production and repetition, I further describe a course for the development of speech in humans. I propose that, due to duplication of a parietal region and its frontal projections, and strengthening of direct frontal-brainstem connections, the ADS converted auditory input directly to vocal regions in the frontal lobe, which endowed early Hominans with partial vocal control. This enabled offspring to modify their contact calls with intonations for signaling different distress levels to their mother. Vocal control could then enable question-answer conversations, by offspring emitting a low-level distress call for inquiring about the safety of objects, and mothers responding with high- or low-level distress calls. Gradually, the ADS and the direct frontal-brainstem connections became more robust and vocal control became more volitional. Eventually, individuals were capable of inventing new words and offspring were capable of inquiring about objects in their environment and learning their names via mimicry.
Collapse
|
46
|
Poliva O. From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans. F1000Res 2015; 4:67. [PMID: 28928931 PMCID: PMC5600004 DOI: 10.12688/f1000research.6175.3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/21/2017] [Indexed: 12/28/2022] Open
Abstract
In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS) and via the inferior parietal lobe (auditory dorsal stream; ADS). The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and integration of calls with faces. I propose that the primary role of the ADS in non-human primates is the detection and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring) and are used for monitoring location. Detection of contact calls occurs by the ADS identifying a voice, localizing it, and verifying that the corresponding face is out of sight. Once a contact call is detected, the primate produces a contact call in return via descending connections from the frontal lobe to a network of limbic and brainstem regions. Because the ADS of present day humans also performs speech production, I further propose an evolutionary course for the transition from contact call exchange to an early form of speech. In accordance with this model, structural changes to the ADS endowed early members of the genus Homo with partial vocal control. This development was beneficial as it enabled offspring to modify their contact calls with intonations for signaling high or low levels of distress to their mother. Eventually, individuals were capable of participating in yes-no question-answer conversations. In these conversations the offspring emitted a low-level distress call for inquiring about the safety of objects (e.g., food), and his/her mother responded with a high- or low-level distress call to signal approval or disapproval of the interaction. Gradually, the ADS and its connections with brainstem motor regions became more robust and vocal control became more volitional. Speech emerged once vocal control was sufficient for inventing novel calls.
Collapse
|
47
|
Poliva O. From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans. F1000Res 2015; 4:67. [PMID: 28928931 PMCID: PMC5600004.2 DOI: 10.12688/f1000research.6175.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/12/2016] [Indexed: 03/28/2024] Open
Abstract
In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS) and via the inferior parietal lobe (auditory dorsal stream; ADS). The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and integration of calls with faces. I propose that the primary role of the ADS in non-human primates is the detection and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring) and are used for monitoring location. Detection of contact calls occurs by the ADS identifying a voice, localizing it, and verifying that the corresponding face is out of sight. Once a contact call is detected, the primate produces a contact call in return via descending connections from the frontal lobe to a network of limbic and brainstem regions. Because the ADS of present day humans also performs speech production, I further propose an evolutionary course for the transition from contact call exchange to an early form of speech. In accordance with this model, structural changes to the ADS endowed early members of the genus Homo with partial vocal control. This development was beneficial as it enabled offspring to modify their contact calls with intonations for signaling high or low levels of distress to their mother. Eventually, individuals were capable of participating in yes-no question-answer conversations. In these conversations the offspring emitted a low-level distress call for inquiring about the safety of objects (e.g., food), and his/her mother responded with a high- or low-level distress call to signal approval or disapproval of the interaction. Gradually, the ADS and its connections with brainstem motor regions became more robust and vocal control became more volitional. Speech emerged once vocal control was sufficient for inventing novel calls.
Collapse
|