1
|
Ter Bekke M, Drijvers L, Holler J. Hand Gestures Have Predictive Potential During Conversation: An Investigation of the Timing of Gestures in Relation to Speech. Cogn Sci 2024; 48:e13407. [PMID: 38279899 DOI: 10.1111/cogs.13407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 07/09/2023] [Accepted: 01/10/2024] [Indexed: 01/29/2024]
Abstract
During face-to-face conversation, transitions between speaker turns are incredibly fast. These fast turn exchanges seem to involve next speakers predicting upcoming semantic information, such that next turn planning can begin before a current turn is complete. Given that face-to-face conversation also involves the use of communicative bodily signals, an important question is how bodily signals such as co-speech hand gestures play into these processes of prediction and fast responding. In this corpus study, we found that hand gestures that depict or refer to semantic information started before the corresponding information in speech, which held both for the onset of the gesture as a whole, as well as the onset of the stroke (the most meaningful part of the gesture). This early timing potentially allows listeners to use the gestural information to predict the corresponding semantic information to be conveyed in speech. Moreover, we provided further evidence that questions with gestures got faster responses than questions without gestures. However, we found no evidence for the idea that how much a gesture precedes its lexical affiliate (i.e., its predictive potential) relates to how fast responses were given. The findings presented here highlight the importance of the temporal relation between speech and gesture and help to illuminate the potential mechanisms underpinning multimodal language processing during face-to-face conversation.
Collapse
Affiliation(s)
- Marlijn Ter Bekke
- Donders Institute for Brain, Cognition and Behaviour, Radboud University
- Max Planck Institute for Psycholinguistics
| | - Linda Drijvers
- Donders Institute for Brain, Cognition and Behaviour, Radboud University
- Max Planck Institute for Psycholinguistics
| | - Judith Holler
- Donders Institute for Brain, Cognition and Behaviour, Radboud University
- Max Planck Institute for Psycholinguistics
| |
Collapse
|
2
|
Bögels S, Levinson SC. Ultrasound measurements of interactive turn-taking in question-answer sequences: Articulatory preparation is delayed but not tied to the response. PLoS One 2023; 18:e0276470. [PMID: 37405982 DOI: 10.1371/journal.pone.0276470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 06/16/2023] [Indexed: 07/07/2023] Open
Abstract
We know that speech planning in conversational turn-taking can happen in overlap with the previous turn and research suggests that it starts as early as possible, that is, as soon as the gist of the previous turn becomes clear. The present study aimed to investigate whether planning proceeds all the way up to the last stage of articulatory preparation (i.e., putting the articulators in place for the first phoneme of the response) and what the timing of this process is. Participants answered pre-recorded quiz questions (being under the illusion that they were asked live), while their tongue movements were measured using ultrasound. Planning could start early for some quiz questions (i.e., midway during the question), but late for others (i.e., only at the end of the question). The results showed no evidence for a difference between tongue movements in these two types of questions for at least two seconds after planning could start in early-planning questions, suggesting that speech planning in overlap with the current turn proceeds more slowly than in the clear. On the other hand, when time-locking to speech onset, tongue movements differed between the two conditions from up to two seconds before this point. This suggests that articulatory preparation can occur in advance and is not fully tied to the overt response itself.
Collapse
Affiliation(s)
- Sara Bögels
- Department of Communication and Cognition, Tilburg University, Tilburg, The Netherlands
- Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, The Netherlands
| | | |
Collapse
|
3
|
Kondaurova MV, Zheng Q, Donaldson CW, Betts A, Smith AF, Fagan MK. The effect of telepractice on vocal turn-taking between a provider, children with cochlear implants, and caregivers: A preliminary report. Cochlear Implants Int 2023; 24:155-166. [PMID: 36624981 DOI: 10.1080/14670100.2022.2159131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
OBJECTIVES To examine the effect of telepractice on vocal turn-taking between one clinical provider and children with cochlear implants and their caregivers during child-centered auditory rehabilitation intervention. METHODS Seven dyads of children with cochlear implants (mean age 4:11 years) and their hearing mothers and one speech-language pathologist participated together in a telepractice session and an in-person intervention session. Dependent variables were vocalization rate, turn taking rate, rate of speech overlap per second, and between-speaker pause duration. RESULTS The speech-language pathologist and children had lower rates of vocalization in the telepractice session than the in-person session. However, maternal vocalization rate was higher in the telepractice than in-person session. The rate of turn-taking between the provider and children was lower in telepractice than in-person sessions but the rate of turn taking between mothers and children was higher in telepractice than in-person sessions. Between-speaker pause duration between children and the provider and between mothers and children was longer in telepractice than in-person sessions. Rate of speech overlap did not vary significantly by session type. DISCUSSION The quantity and temporal characteristics of vocal turn-taking were impacted by remote communication during tele-intervention suggesting a potential increase in the cognitive effort required of participants.
Collapse
Affiliation(s)
- Maria V Kondaurova
- Department of Psychological & Brain Sciences, University of Louisville, Louisville, KY, USA
| | - Qi Zheng
- Department of Biostatistics, University of Louisville, Louisville, KY, USA
| | | | - Abigail Betts
- Department of Otolaryngology-Head/Neck Surgery & Communicative Disorders, University of Louisville, Louisville, KY, USA
| | - Alan F Smith
- Department of Otolaryngology-Head/Neck Surgery & Communicative Disorders, University of Louisville, Louisville, KY, USA
| | | |
Collapse
|
4
|
Desmons C, Lavault S, Mazel A, Niérat M, Tadiello S, Khamassi M, Pelachaud C, Similowski T. Influence d’une activité pseudo-ventilatoire chez un robot humanoïde sur les interactions humain-machine. Rev Mal Respir 2023. [DOI: 10.1016/j.rmr.2022.11.076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
|
5
|
Listeners are sensitive to the speech breathing time series: Evidence from a gap detection task. Cognition 2022; 225:105171. [DOI: 10.1016/j.cognition.2022.105171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 04/29/2022] [Accepted: 05/11/2022] [Indexed: 11/23/2022]
|
6
|
Haiduk F, Fitch WT. Understanding Design Features of Music and Language: The Choric/Dialogic Distinction. Front Psychol 2022; 13:786899. [PMID: 35529579 PMCID: PMC9075586 DOI: 10.3389/fpsyg.2022.786899] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 02/22/2022] [Indexed: 12/03/2022] Open
Abstract
Music and spoken language share certain characteristics: both consist of sequences of acoustic elements that are combinatorically combined, and these elements partition the same continuous acoustic dimensions (frequency, formant space and duration). However, the resulting categories differ sharply: scale tones and note durations of small integer ratios appear in music, while speech uses phonemes, lexical tone, and non-isochronous durations. Why did music and language diverge into the two systems we have today, differing in these specific features? We propose a framework based on information theory and a reverse-engineering perspective, suggesting that design features of music and language are a response to their differential deployment along three different continuous dimensions. These include the familiar propositional-aesthetic ('goal') and repetitive-novel ('novelty') dimensions, and a dialogic-choric ('interactivity') dimension that is our focus here. Specifically, we hypothesize that music exhibits specializations enhancing coherent production by several individuals concurrently-the 'choric' context. In contrast, language is specialized for exchange in tightly coordinated turn-taking-'dialogic' contexts. We examine the evidence for our framework, both from humans and non-human animals, and conclude that many proposed design features of music and language follow naturally from their use in distinct dialogic and choric communicative contexts. Furthermore, the hybrid nature of intermediate systems like poetry, chant, or solo lament follows from their deployment in the less typical interactive context.
Collapse
Affiliation(s)
- Felix Haiduk
- Department of Behavioral and Cognitive Biology, University of Vienna, Vienna, Austria
| | - W. Tecumseh Fitch
- Department of Behavioral and Cognitive Biology, University of Vienna, Vienna, Austria
- Vienna Cognitive Science Hub, University of Vienna, Vienna, Austria
| |
Collapse
|
7
|
Wilschut T, Sense F, van der Velde M, Fountas Z, Maaß SC, van Rijn H. Benefits of Adaptive Learning Transfer From Typing-Based Learning to Speech-Based Learning. Front Artif Intell 2021; 4:780131. [PMID: 34950869 PMCID: PMC8689065 DOI: 10.3389/frai.2021.780131] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 10/27/2021] [Indexed: 11/22/2022] Open
Abstract
Memorising vocabulary is an important aspect of formal foreign-language learning. Advances in cognitive psychology have led to the development of adaptive learning systems that make vocabulary learning more efficient. One way these computer-based systems optimize learning is by measuring learning performance in real time to create optimal repetition schedules for individual learners. While such adaptive learning systems have been successfully applied to word learning using keyboard-based input, they have thus far seen little application in word learning where spoken instead of typed input is used. Here we present a framework for speech-based word learning using an adaptive model that was developed for and tested with typing-based word learning. We show that typing- and speech-based learning result in similar behavioral patterns that can be used to reliably estimate individual memory processes. We extend earlier findings demonstrating that a response-time based adaptive learning approach outperforms an accuracy-based, Leitner flashcard approach in learning efficiency (demonstrated by higher average accuracy and lower response times after a learning session). In short, we show that adaptive learning benefits transfer from typing-based learning, to speech based learning. Our work provides a basis for the development of language learning applications that use real-time pronunciation assessment software to score the accuracy of the learner’s pronunciations. We discuss the implications for our approach for the development of educationally relevant, adaptive speech-based learning applications.
Collapse
Affiliation(s)
- Thomas Wilschut
- Department of Experimental Psychology, University of Groningen, Groningen, Netherlands.,Department of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, Netherlands
| | - Florian Sense
- Department of Experimental Psychology, University of Groningen, Groningen, Netherlands
| | - Maarten van der Velde
- Department of Experimental Psychology, University of Groningen, Groningen, Netherlands.,Department of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, Netherlands
| | | | - Sarah C Maaß
- Department of Experimental Psychology, University of Groningen, Groningen, Netherlands.,Department of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, Netherlands.,Aging and Cognition Research Group, DZNE, Magdeburg, Germany
| | - Hedderik van Rijn
- Department of Experimental Psychology, University of Groningen, Groningen, Netherlands.,Department of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, Netherlands
| |
Collapse
|
8
|
Bögels S, Torreira F. Turn-end Estimation in Conversational Turn-taking: The Roles of Context and Prosody. DISCOURSE PROCESSES 2021. [DOI: 10.1080/0163853x.2021.1986664] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Sara Bögels
- Department of Communication and Cognition, Tilburg University
- Language and Cognition Department, Max Planck Institute for Psycholinguistics
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University
| | - Francisco Torreira
- Language and Cognition Department, Max Planck Institute for Psycholinguistics
- Department of Linguistics, McGill University
| |
Collapse
|
9
|
Krause PA, Kawamoto AH. Predicting One's Turn With Both Body and Mind: Anticipatory Speech Postures During Dyadic Conversation. Front Psychol 2021; 12:684248. [PMID: 34326798 PMCID: PMC8315268 DOI: 10.3389/fpsyg.2021.684248] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 06/18/2021] [Indexed: 11/17/2022] Open
Abstract
In natural conversation, turns are handed off quickly, with the mean downtime commonly ranging from 7 to 423 ms. To achieve this, speakers plan their upcoming speech as their partner’s turn unfolds, holding the audible utterance in abeyance until socially appropriate. The role played by prediction is debated, with some researchers claiming that speakers predict upcoming speech opportunities, and others claiming that speakers wait for detection of turn-final cues. The dynamics of articulatory triggering may speak to this debate. It is often assumed that the prepared utterance is held in a response buffer and then initiated all at once. This assumption is consistent with standard phonetic models in which articulatory actions must follow tightly prescribed patterns of coordination. This assumption has recently been challenged by single-word production experiments in which participants partly positioned their articulators to anticipate upcoming utterances, long before starting the acoustic response. The present study considered whether similar anticipatory postures arise when speakers in conversation await their next opportunity to speak. We analyzed a pre-existing audiovisual database of dyads engaging in unstructured conversation. Video motion tracking was used to determine speakers’ lip areas over time. When utterance-initial syllables began with labial consonants or included rounded vowels, speakers produced distinctly smaller lip areas (compared to other utterances), prior to audible speech. This effect was moderated by the number of words in the upcoming utterance; postures arose up to 3,000 ms before acoustic onset for short utterances of 1–3 words. We discuss the implications for models of conversation and phonetic control.
Collapse
Affiliation(s)
- Peter A Krause
- Department of Psychology, California State University Channel Islands, Camarillo, CA, United States.,Department of Psychology, University of California, Santa Cruz, Santa Cruz, CA, United States
| | - Alan H Kawamoto
- Department of Psychology, University of California, Santa Cruz, Santa Cruz, CA, United States
| |
Collapse
|
10
|
Sørensen AJM, Fereczkowski M, MacDonald EN. Effects of Noise and Second Language on Conversational Dynamics in Task Dialogue. Trends Hear 2021; 25:23312165211024482. [PMID: 34189999 PMCID: PMC8256248 DOI: 10.1177/23312165211024482] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
This study provides a framework for measuring conversational dynamics between
conversational partners (interlocutors). Conversations from 20 pairs of young,
normal-hearing, native-Danish talkers were recorded when speaking in both quiet
and noise (70 dBA sound pressure level [SPL]) and in Danish and English.
Previous studies investigating the intervals from when one talker stops talking
to when the next one starts, termed floor-transfer offsets (FTOs), suggest that
typical turn-taking requires interlocutors to predict when the current talker
will finish their turn. We hypothesized that adding noise and/or speaking in a
second language (L2) would increase the communication difficulty and result in
longer and more variable FTOs. The median and interquartile range of FTOs
increased slightly in noise, and in L2, there was a small increase in
interquartile range but a small decrease in the median of FTO durations. It took
the participants longer to complete the task in both L2 and noise, indicating
increased communication difficulty. The average duration of interpausal units,
that is, units of connected speech surrounded by silences of 180 ms or more,
increased by 18% in noise and 8% in L2. These findings suggest that talkers held
their turn for longer, allowing more time for speech understanding and planning.
In L2, participants spoke slower, and in both L2 and noise, they took fewer
turns. These changes in behavior may have offset some of the increased
difficulty when communicating in noise or L2. We speculate that talkers
prioritize the maintenance of turn-taking timing over other speech measures.
Collapse
Affiliation(s)
- A Josefine Munch Sørensen
- Hearing Systems Group, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Michal Fereczkowski
- Hearing Systems Group, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.,Institute of Clinical Research, Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark.,Research Unit for ORL - Head & Neck Surgery and Audiology, Odense University Hospital, Odense, Denmark; University of Southern Denmark, Odense, Denmark
| | - Ewen N MacDonald
- Hearing Systems Group, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.,Department of Systems Design Engineering, University of Waterloo, Waterloo, Ontario, Canada
| |
Collapse
|
11
|
Pekarek Doehler S. Word Order Affects Response Latency: Action Projection and the Timing of Responses to Question-word Questions. DISCOURSE PROCESSES 2021. [DOI: 10.1080/0163853x.2020.1824443] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
12
|
|
13
|
De Stefani E. Embodied Responses to Questions-in-Progress: Silent Nods as Affirmative Answers. DISCOURSE PROCESSES 2021. [DOI: 10.1080/0163853x.2020.1836916] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
14
|
Affiliation(s)
- Arnulf Deppermann
- Pragmatics Department, Leibniz-Institut für Deutsche Sprache, Mannheim, Germany
| | - Lorenza Mondada
- Philosophisch-Historische Fakultät, Departement Sprach- und Literaturwissenschaften, Universität Basel, Basel, Switzerland
| | - Simona Pekarek Doehler
- Institut des Sciences du Langage et de la Communication, Université de Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
15
|
Abstract
This work revisits the problem of breathing cues used for management of speaking turns in multiparty casual conversation. We propose a new categorization of turn-taking events which combines the criterion of speaker change with whether the original speaker inhales before producing the next talkspurt. We demonstrate that the latter criterion could be potentially used as a good proxy for pragmatic completeness of the previous utterance (and, by extension, of the interruptive character of the incoming speech). We also present evidence that breath holds are used in reaction to incoming talk rather than as a turn-holding cue. In addition to analysing dimensions which are routinely omitted in studies of interactional functions of breathing (exhalations, presence of overlapping speech, breath holds), the present study also looks at patterns of breath holds in silent breathing and shows that breath holds are sometimes produced toward the beginning (and toward the top) of silent exhalations, potentially indicating an abandoned intention to take the turn. We claim that the breathing signal can thus be successfully used for uncovering hidden turn-taking events, which are otherwise obscured by silence-based representations of interaction.
Collapse
Affiliation(s)
| | - Mattias Heldner
- Department of Linguistics, Stockholm University, Stockholm, Sweden
| |
Collapse
|
16
|
Aruffo C. Reading Scripted Dialogue: Pretending to Take Turns. DISCOURSE PROCESSES 2020. [DOI: 10.1080/0163853x.2019.1651588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
17
|
Schwering SC, MacDonald MC. Verbal Working Memory as Emergent from Language Comprehension and Production. Front Hum Neurosci 2020; 14:68. [PMID: 32226368 PMCID: PMC7081770 DOI: 10.3389/fnhum.2020.00068] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 02/13/2020] [Indexed: 12/31/2022] Open
Abstract
This article reviews current models of verbal working memory and considers the role of language comprehension and long-term memory in the ability to maintain and order verbal information for short periods of time. While all models of verbal working memory posit some interaction with long-term memory, few have considered the character of these long-term representations or how they might affect performance on verbal working memory tasks. Similarly, few models have considered how comprehension processes and production processes might affect performance in verbal working memory tasks. Modern theories of comprehension emphasize that people learn a vast web of correlated information about the language and the world and must activate that information from long-term memory to cope with the demands of language input. To date, there has been little consideration in theories of verbal working memory for how this rich input from comprehension would affect the nature of temporary memory. There has also been relatively little attention to the degree to which language production processes naturally manage serial order of verbal information. The authors argue for an emergent model of verbal working memory supported by a rich, distributed long-term memory for language. On this view, comprehension processes provide encoding in verbal working memory tasks, and production processes maintenance, serial ordering, and recall. Moreover, the computational capacity to maintain and order information varies with language experience. Implications for theories of working memory, comprehension, and production are considered.
Collapse
|
18
|
Pause for effect: A 10-s interviewer wait time gives children time to respond to open-ended prompts. J Exp Child Psychol 2020; 194:104824. [PMID: 32127193 DOI: 10.1016/j.jecp.2020.104824] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 01/30/2020] [Accepted: 01/30/2020] [Indexed: 11/22/2022]
Abstract
When researchers and helping professionals interview children about a target event, how long should they tolerate silence before delivering another prompt? In other words, at what point are children so unlikely to begin talking again that continued silence would likely be unproductive? To test the reasonableness of a 10-s wait time guideline during open-ended prompting, we analyzed the wait times of research assistants (N = 7) who interviewed with a 10-s guideline, timed how quickly children responded to prompts, and also timed pauses within children's event narratives. In our sample (105 conversations with children aged 4-8 years), interviewers complied fully with the 10-s rule in the majority of interviews, children often paused for longer than 5 s before beginning to talk about the event or continuing a narrative, and more than 96% of children's pauses that were followed by event information fell within the 10-s window. These findings show that the 10-s wait time was a practical guideline that gave children time to respond without peppering interviews with uncomfortably long pauses. We conclude that adding wait time guidelines to protocols for interviewing children, and augmenting guidelines with wait time training for research assistants and helping professionals, could improve the quality of information obtained from children and advance our understanding of age differences in event memory.
Collapse
|
19
|
Sjerps MJ, Decuyper C, Meyer AS. Initiation of utterance planning in response to pre-recorded and "live" utterances. Q J Exp Psychol (Hove) 2019; 73:357-374. [PMID: 31544625 DOI: 10.1177/1747021819881265] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In everyday conversation, interlocutors often plan their utterances while listening to their conversational partners, thereby achieving short gaps between their turns. Important issues for current psycholinguistics are how interlocutors distribute their attention between listening and speech planning and how speech planning is timed relative to listening. Laboratory studies addressing these issues have used a variety of paradigms, some of which have involved using recorded speech to which participants responded, whereas others have involved interactions with confederates. This study investigated how this variation in the speech input affected the participants' timing of speech planning. In Experiment 1, participants responded to utterances produced by a confederate, who sat next to them and looked at the same screen. In Experiment 2, they responded to recorded utterances of the same confederate. Analyses of the participants' speech, their eye movements, and their performance in a concurrent tapping task showed that, compared with recorded speech, the presence of the confederate increased the processing load for the participants, but did not alter their global sentence planning strategy. These results have implications for the design of psycholinguistic experiments and theories of listening and speaking in dyadic settings.
Collapse
Affiliation(s)
- Matthias J Sjerps
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Caitlin Decuyper
- Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Antje S Meyer
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands.,Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| |
Collapse
|
20
|
Demartsev V, Strandburg-Peshkin A, Ruffner M, Manser M. Vocal Turn-Taking in Meerkat Group Calling Sessions. Curr Biol 2018; 28:3661-3666.e3. [DOI: 10.1016/j.cub.2018.09.065] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 08/07/2018] [Accepted: 09/26/2018] [Indexed: 01/14/2023]
|
21
|
Corps RE, Gambi C, Pickering MJ. Coordinating Utterances During Turn-Taking: The Role of Prediction, Response Preparation, and Articulation. DISCOURSE PROCESSES 2017. [DOI: 10.1080/0163853x.2017.1330031] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Ruth E. Corps
- Department of Psychology University of Edinburgh, Edinburgh, UK
| | - Chiara Gambi
- Department of Psychology University of Edinburgh, Edinburgh, UK
| | | |
Collapse
|
22
|
Włodarczak M, Heldner M. Respiratory Constraints in Verbal and Non-verbal Communication. Front Psychol 2017; 8:708. [PMID: 28567023 PMCID: PMC5434352 DOI: 10.3389/fpsyg.2017.00708] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 04/21/2017] [Indexed: 11/24/2022] Open
Abstract
In the present paper we address the old question of respiratory planning in speech production. We recast the problem in terms of speakers' communicative goals and propose that speakers try to minimize respiratory effort in line with the H&H theory. We analyze respiratory cycles coinciding with no speech (i.e., silence), short verbal feedback expressions (SFE's) as well as longer vocalizations in terms of parameters of the respiratory cycle and find little evidence for respiratory planning in feedback production. We also investigate timing of speech and SFEs in the exhalation and contrast it with nods. We find that while speech is strongly tied to the exhalation onset, SFEs are distributed much more uniformly throughout the exhalation and are often produced on residual air. Given that nods, which do not have any respiratory constraints, tend to be more frequent toward the end of an exhalation, we propose a mechanism whereby respiratory patterns are determined by the trade-off between speakers' communicative goals and respiratory constraints.
Collapse
Affiliation(s)
| | - Mattias Heldner
- Department of Linguistics, Stockholm UniversityStockholm, Sweden
| |
Collapse
|
23
|
Goregliad Fjaellingsdal T, Ruigendijk E, Scherbaum S, Bleichner MG. The N400 Effect during Speaker-Switch-Towards a Conversational Approach of Measuring Neural Correlates of Language. Front Psychol 2016; 7:1854. [PMID: 27965604 PMCID: PMC5124707 DOI: 10.3389/fpsyg.2016.01854] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Accepted: 11/09/2016] [Indexed: 11/21/2022] Open
Abstract
Language occurs naturally in conversations. However, the study of the neural underpinnings of language has mainly taken place in single individuals using controlled language material. The interactive elements of a conversation (e.g., turn-taking) are often not part of neurolinguistic setups. The prime reason is the difficulty to combine open unrestricted conversations with the requirements of neuroimaging. It is necessary to find a trade-off between the naturalness of a conversation and the restrictions imposed by neuroscientific methods to allow for ecologically more valid studies. Here, we make an attempt to study the effects of a conversational element, namely turn-taking, on linguistic neural correlates, specifically the N400 effect. We focus on the physiological aspect of turn-taking, the speaker-switch, and its effect on the detectability of the N400 effect. The N400 event-related potential reflects expectation violations in a semantic context; the N400 effect describes the difference of the N400 amplitude between semantically expected and unexpected items. Sentences with semantically congruent and incongruent final words were presented in two turn-taking modes: (1) reading aloud first part of the sentence and listening to speaker-switch for the final word, and (2) listening to first part of the sentence and speaker-switch for the final word. A significant N400 effect was found for both turn-taking modes, which was not influenced by the mode itself. However, the mode significantly affected the P200, which was increased for the reading aloud mode compared to the listening mode. Our results show that an N400 effect can be detected during a speaker-switch. Speech articulation (reading aloud) before the analyzed sentence fragment did also not impede the N400 effect detection for the final word. The speaker-switch, however, seems to influence earlier components of the electroencephalogram, related to processing of salient stimuli. We conclude that the N400 can effectively be used to study neural correlates of language in conversational approaches including speaker-switches.
Collapse
Affiliation(s)
| | - Esther Ruigendijk
- Cluster of Excellence Hearing4all, University of OldenburgOldenburg, Germany; Department of Dutch, University of OldenburgOldenburg, Germany
| | - Stefan Scherbaum
- Department of Psychology, Technische Universität Dresden Dresden, Germany
| | - Martin G Bleichner
- Department of Psychology, European Medical School, University of OldenburgOldenburg, Germany; Cluster of Excellence Hearing4all, University of OldenburgOldenburg, Germany
| |
Collapse
|
24
|
Levinson SC. Turn-taking in Human Communication--Origins and Implications for Language Processing. Trends Cogn Sci 2015; 20:6-14. [PMID: 26651245 DOI: 10.1016/j.tics.2015.10.010] [Citation(s) in RCA: 235] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Revised: 10/26/2015] [Accepted: 10/28/2015] [Indexed: 11/16/2022]
Abstract
Most language usage is interactive, involving rapid turn-taking. The turn-taking system has a number of striking properties: turns are short and responses are remarkably rapid, but turns are of varying length and often of very complex construction such that the underlying cognitive processing is highly compressed. Although neglected in cognitive science, the system has deep implications for language processing and acquisition that are only now becoming clear. Appearing earlier in ontogeny than linguistic competence, it is also found across all the major primate clades. This suggests a possible phylogenetic continuity, which may provide key insights into language evolution.
Collapse
Affiliation(s)
- Stephen C Levinson
- Max Planck Institute for Psycholinguistics, Wundtlaan 1, NL-6525 XD Nijmegen, The Netherlands; Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands.
| |
Collapse
|
25
|
Piai V, Roelofs A, Rommers J, Dahlslätt K, Maris E. Withholding planned speech is reflected in synchronized beta-band oscillations. Front Hum Neurosci 2015; 9:549. [PMID: 26528164 PMCID: PMC4601260 DOI: 10.3389/fnhum.2015.00549] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Accepted: 09/18/2015] [Indexed: 11/13/2022] Open
Abstract
When engaged in a conversation, speakers sometimes have to withhold a planned response, for example, before it is their turn to speak. In the present study, using magnetoencephalography (MEG) outside of a conversational setting, we investigate the oscillatory brain mechanisms involved in the process of withholding a planned verbal response until it is time to speak. Our participants viewed a sequence of four random consonant strings and one pseudoword, which they had to pronounce when the fifth string (the imperative stimulus) was presented. The pseudoword appeared either as the fourth or fifth stimulus in the sequence, creating two conditions. In the withhold condition, the pseudoword was the fourth string and the verbal response was withheld until the imperative stimulus was presented. In the control condition, the fifth string was the pseudoword, so no response was withheld. We compared oscillatory responses to the withhold relative to the control condition in the time period preceding speech. Alpha-beta power (8–30 Hz) decreased over occipital sensors in the withhold condition relative to the control condition. Source-level analysis indicated a posterior source (i.e., occipital cortex) associated with the alpha-beta power decreases. This occipital alpha-beta desynchronization likely reflects attentional allocation to the upcoming imperative stimulus. Moreover, beta (12–20 Hz) power increased over frontal sensors. Source-level analysis indicated a frontal source (i.e., middle and superior frontal gyri) associated with the beta-power increases. We interpret the frontal beta synchronization to reflect a mechanism aiding the maintenance of the current motor or cognitive state. Our results provide a window into a possible oscillatory mechanism implementing the ability of speakers to withhold a planned verbal response until they have to speak.
Collapse
Affiliation(s)
- Vitória Piai
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, Netherlands ; Knight Lab, Department of Psychology and Helen Wills Neuroscience Institute, University of California, Berkeley Berkeley, CA, USA
| | - Ardi Roelofs
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, Netherlands
| | - Joost Rommers
- Department of Psychology and The Beckman Institute for Advanced Science and Technology, University of Illinois Urbana, IL, USA
| | | | - Eric Maris
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, Netherlands
| |
Collapse
|
26
|
Bögels S, Magyari L, Levinson SC. Neural signatures of response planning occur midway through an incoming question in conversation. Sci Rep 2015; 5:12881. [PMID: 26242909 PMCID: PMC4525376 DOI: 10.1038/srep12881] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 07/13/2015] [Indexed: 11/22/2022] Open
Abstract
A striking puzzle about language use in everyday conversation is that turn-taking latencies are usually very short, whereas planning language production takes much longer. This implies overlap between language comprehension and production processes, but the nature and extent of such overlap has never been studied directly. Combining an interactive quiz paradigm with EEG measurements in an innovative way, we show that production planning processes start as soon as possible, that is, within half a second after the answer to a question can be retrieved (up to several seconds before the end of the question). Localization of ERP data shows early activation even of brain areas related to late stages of production planning (e.g., syllabification). Finally, oscillation results suggest an attention switch from comprehension to production around the same time frame. This perspective from interactive language use throws new light on the performance characteristics that language competence involves.
Collapse
Affiliation(s)
- Sara Bögels
- Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Lilla Magyari
- 1] Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands [2] Pazmany Peter Catholic University, Faculty of Humanities and Social Sciences, Department of General Psychology, Budapest, Hungary
| | - Stephen C Levinson
- 1] Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands [2] Donders Institute for Brain, Cognition and Behaviour, The Netherlands
| |
Collapse
|
27
|
Levinson SC, Torreira F. Timing in turn-taking and its implications for processing models of language. Front Psychol 2015; 6:731. [PMID: 26124727 PMCID: PMC4464110 DOI: 10.3389/fpsyg.2015.00731] [Citation(s) in RCA: 130] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 05/16/2015] [Indexed: 12/03/2022] Open
Abstract
The core niche for language use is in verbal interaction, involving the rapid exchange of turns at talking. This paper reviews the extensive literature about this system, adding new statistical analyses of behavioral data where they have been missing, demonstrating that turn-taking has the systematic properties originally noted by Sacks et al. (1974; hereafter SSJ). This system poses some significant puzzles for current theories of language processing: the gaps between turns are short (of the order of 200 ms), but the latencies involved in language production are much longer (over 600 ms). This seems to imply that participants in conversation must predict (or 'project' as SSJ have it) the end of the current speaker's turn in order to prepare their response in advance. This in turn implies some overlap between production and comprehension despite their use of common processing resources. Collecting together what is known behaviorally and experimentally about the system, the space for systematic explanations of language processing for conversation can be significantly narrowed, and we sketch some first model of the mental processes involved for the participant preparing to speak next.
Collapse
Affiliation(s)
- Stephen C. Levinson
- Language and Cognition Department, Max Planck Institute for PsycholinguisticsNijmegen, Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud UniversityNijmegen, Netherlands
| | - Francisco Torreira
- Language and Cognition Department, Max Planck Institute for PsycholinguisticsNijmegen, Netherlands
| |
Collapse
|
28
|
Roberts SG, Torreira F, Levinson SC. The effects of processing and sequence organization on the timing of turn taking: a corpus study. Front Psychol 2015; 6:509. [PMID: 26029125 PMCID: PMC4429583 DOI: 10.3389/fpsyg.2015.00509] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 04/10/2015] [Indexed: 11/13/2022] Open
Abstract
The timing of turn taking in conversation is extremely rapid given the cognitive demands on speakers to comprehend, plan and execute turns in real time. Findings from psycholinguistics predict that the timing of turn taking is influenced by demands on processing, such as word frequency or syntactic complexity. An alternative view comes from the field of conversation analysis, which predicts that the rules of turn-taking and sequence organization may dictate the variation in gap durations (e.g., the functional role of each turn in communication). In this paper, we estimate the role of these two different kinds of factors in determining the speed of turn-taking in conversation. We use the Switchboard corpus of English telephone conversation, already richly annotated for syntactic structure speech act sequences, and segmental alignment. To this we add further information including Floor Transfer Offset (the amount of time between the end of one turn and the beginning of the next), word frequency, concreteness, and surprisal values. We then apply a novel statistical framework ("random forests") to show that these two dimensions are interwoven together with indexical properties of the speakers as explanatory factors determining the speed of response. We conclude that an explanation of the of the timing of turn taking will require insights from both processing and sequence organization.
Collapse
Affiliation(s)
- Seán G. Roberts
- Language and Cognition Department, Max Planck Institute for PsycholinguisticsNijmegen, Netherlands
| | | | | |
Collapse
|
29
|
de Vos C, Torreira F, Levinson SC. Turn-timing in signed conversations: coordinating stroke-to-stroke turn boundaries. Front Psychol 2015; 6:268. [PMID: 25852593 PMCID: PMC4371657 DOI: 10.3389/fpsyg.2015.00268] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 02/23/2015] [Indexed: 11/13/2022] Open
Abstract
In spoken interactions, interlocutors carefully plan, and time their utterances, minimizing gaps and overlaps between consecutive turns. Cross-linguistic comparison has indicated that spoken languages vary only minimally in terms of turn-timing, and language acquisition research has shown pre-linguistic vocal turn-taking in the first half year of life. These observations suggest that the turn-taking system may provide a fundamental basis for our linguistic capacities. The question remains, however, to what extent our capacity for rapid turn-taking is determined by modality constraints. The avoidance of overlapping turns could be motivated by the difficulty of hearing and speaking at the same time. If so, turn-taking in sign might show greater toleration for overlap. Alternatively, signed conversations may show a similar distribution of turn-timing as spoken languages, thus avoiding both gaps and overlaps. To address this question we look at turn-timing in question-answer sequences in spontaneous conversations of Sign Language of the Netherlands. The findings indicate that although there is considerable overlap in two or more signers' articulators in conversation, when proper allowance is made for onset preparation, post-utterance retraction and the intentional holding of signs for response, turn-taking latencies in sign look remarkably like those reported for spoken language. This is consistent with the possibility that, at least with regard to responses to questions, speakers and signers follow similar time courses in planning and producing their utterances in on-going conversation. This suggests that turn-taking systems may well be a shared cognitive infrastructure underlying all modern human languages, both spoken and signed.
Collapse
Affiliation(s)
- Connie de Vos
- Language and Cognition Department, Max Planck Institute for Psycholinguistics, Nijmegen Netherlands
| | - Francisco Torreira
- Language and Cognition Department, Max Planck Institute for Psycholinguistics, Nijmegen Netherlands
| | - Stephen C Levinson
- Language and Cognition Department, Max Planck Institute for Psycholinguistics, Nijmegen Netherlands ; Radboud University, Nijmegen Netherlands
| |
Collapse
|