1
|
Ajuwon V, Ojeda A, Murphy RA, Monteiro T, Kacelnik A. Paradoxical choice and the reinforcing value of information. Anim Cogn 2023; 26:623-637. [PMID: 36306041 PMCID: PMC9950180 DOI: 10.1007/s10071-022-01698-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 09/07/2022] [Accepted: 10/01/2022] [Indexed: 11/01/2022]
Abstract
Signals that reduce uncertainty can be valuable because well-informed decision-makers can better align their preferences to opportunities. However, some birds and mammals display an appetite for informative signals that cannot be used to increase returns. We explore the role that reward-predictive stimuli have in fostering such preferences, aiming at distinguishing between two putative underlying mechanisms. The 'information hypothesis' proposes that reducing uncertainty is reinforcing per se, somewhat consistently with the concept of curiosity: a motivation to know in the absence of tractable extrinsic benefits. In contrast, the 'conditioned reinforcement hypothesis', an associative account, proposes asymmetries in secondarily acquired reinforcement: post-choice stimuli announcing forthcoming rewards (S+) reinforce responses more than stimuli signalling no rewards (S-) inhibit responses. In three treatments, rats faced two equally profitable options delivering food probabilistically after a fixed delay. In the informative option (Info), food or no food was signalled immediately after choice, whereas in the non-informative option (NoInfo) outcomes were uncertain until the delay lapsed. Subjects preferred Info when (1) both outcomes were explicitly signalled by salient auditory cues, (2) only forthcoming food delivery was explicitly signalled, and (3) only the absence of forthcoming reward was explicitly signalled. Acquisition was slower in (3), when food was not explicitly signalled, showing that signals for positive outcomes have a greater influence on the development of preference than signals for negative ones. Our results are consistent with an elaborated conditioned reinforcement account, and with the conjecture that both uncertainty reduction and conditioned reinforcement jointly act to generate preference.
Collapse
Affiliation(s)
- Victor Ajuwon
- Department of Biology, University of Oxford, Oxford, UK.
| | - Andrés Ojeda
- grid.4991.50000 0004 1936 8948Department of Biology, University of Oxford, Oxford, UK
| | - Robin A. Murphy
- grid.4991.50000 0004 1936 8948Department of Experimental Psychology, University of Oxford, Oxford, UK
| | - Tiago Monteiro
- grid.4991.50000 0004 1936 8948Department of Biology, University of Oxford, Oxford, UK ,grid.6583.80000 0000 9686 6466Domestication Lab, Department of Interdisciplinary Life Sciences, Konrad Lorenz Institute of Ethology, University of Veterinary Medicine Vienna, Vienna, Austria
| | - Alex Kacelnik
- Department of Biology, University of Oxford, Oxford, UK.
| |
Collapse
|
2
|
Gomes-Ng S, Elliffe D, Cowie S. Environment tracking and signal following in a reinforcer-ratio reversal procedure. Behav Processes 2018; 157:208-224. [PMID: 30315866 DOI: 10.1016/j.beproc.2018.10.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2018] [Revised: 09/27/2018] [Accepted: 10/04/2018] [Indexed: 01/05/2023]
Abstract
Several studies suggest that the degree of control by reinforcer ratios (environment tracking) and by exteroceptive stimuli that signal future reinforcer availability (signal following) depends on environmental certainty: As reinforcers become more likely at one location, environmental contingencies exert stronger control and exteroceptive stimuli exert weaker control. This research has not yet been extended to environments in which reinforcer availability changes across time, even though such changes are present in most natural environments. Thus, in the present experiment, we examined environment tracking and signal following in a concurrent schedule in which the reinforcer ratio reversed to its reciprocal 30 s after a reinforcer delivery and keylight-color stimuli signaled the likely or definite time or location of the next reinforcer. Across conditions, we manipulated environmental certainty by varying the probability of reinforcer deliveries on the locally richer key. This made the location of future reinforcers at a particular time more or less certain, but did not change the overall reinforcer ratio. Changes in local environmental certainty had little to no effect on environment tracking and signal following; in all conditions, keylight-color stimuli strongly controlled choice and reinforcer ratios exerted weak control. The present findings suggest that the extent of environment tracking and signal following is primarily determined by global, not local, environmental certainty.
Collapse
Affiliation(s)
| | - Douglas Elliffe
- School of Psychology, The University of Auckland, New Zealand
| | - Sarah Cowie
- School of Psychology, The University of Auckland, New Zealand
| |
Collapse
|
3
|
Bland VJ, Bai JY, Fullerton JA, Podlesnik CA. Signaled alternative reinforcement and the persistence of operant behavior. J Exp Anal Behav 2016; 106:22-33. [DOI: 10.1002/jeab.212] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2015] [Accepted: 05/14/2016] [Indexed: 11/08/2022]
Affiliation(s)
| | | | | | - Christopher A. Podlesnik
- The University of Auckland
- Florida Institute of Technology and The Scott Center for Autism Treatment
| |
Collapse
|
4
|
Meyer PJ, Cogan ES, Robinson TE. The form of a conditioned stimulus can influence the degree to which it acquires incentive motivational properties. PLoS One 2014; 9:e98163. [PMID: 24905195 PMCID: PMC4048203 DOI: 10.1371/journal.pone.0098163] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Accepted: 04/29/2014] [Indexed: 11/19/2022] Open
Abstract
There is considerable individual variation in the extent to which food- and drug-associated cues (conditioned stimuli, CSs) acquire incentive salience, as indicated by whether they elicit approach towards them, and/or act as conditioned reinforcers. Here we asked whether this variation is influenced by properties of the CS itself. In rats, we assessed both the attractiveness and conditioned reinforcing properties of two CSs: a manipulable lever CS versus an auditory (tone) CS. There was considerable individual variation in the extent to which a lever CS acquired incentive motivational properties, as indicated by whether it became attractive (evoked a sign-tracking or goal-tracking conditioned response) or acted as a conditioned reinforcer. However, with a tone CS all rats learned a goal-tracking response, and the tone CS was an equally effective conditioned reinforcer in sign-trackers and goal-trackers. Even when presented in compound (a lever-tone CS), the two elements of the compound differentially acquired motivational properties. In contrast, amphetamine and stress potentiated the conditioned reinforcing properties of both visual and auditory CSs similarly in rats that primarily sign-tracked or goal-tracked. We conclude that variation in the to the ability of CSs to acquire incentive salience, and thus their ability to act as incentive stimuli capable of motivating behavior, is determined in part by properties of the CS itself.
Collapse
Affiliation(s)
- Paul J. Meyer
- Department of Psychology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Elizabeth S. Cogan
- Department of Psychology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Terry E. Robinson
- Department of Psychology, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
5
|
Fantino E. Judgment and decision making: Behavioral approaches. THE BEHAVIOR ANALYST 2012; 21:203-18. [PMID: 22478308 DOI: 10.1007/bf03391964] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The area of judgment and decision making has given rise to the study of many interesting phenomena, including reasoning fallacies, which are also of interest to behavior analysts. Indeed, techniques and principles of behavior analysis may be applied to study these fallacies. This article reviews research from a behavioral perspective that suggests that humans are not the information-seekers we sometimes suppose ourselves to be. Nor do we utilize information effectively when it is presented. This is shown from the results of research utilizing matching to sample and other behavioral tools (monetary reward, feedback, instructional control) to study phenomena such as the conjunction fallacy, base-rate neglect, and probability matching. Research from a behavioral perspective can complement research from other perspectives in furthering our understanding of judgment and decision making.
Collapse
|
6
|
Models of trace decay, eligibility for reinforcement, and delay of reinforcement gradients, from exponential to hyperboloid. Behav Processes 2011; 87:57-63. [PMID: 21215304 DOI: 10.1016/j.beproc.2010.12.016] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2010] [Revised: 12/24/2010] [Accepted: 12/27/2010] [Indexed: 11/24/2022]
Abstract
Behavior such as depression of a lever or perception of a stimulus may be strengthened by consequent behaviorally significant events (BSEs), such as reinforcers. This is the Law of Effect. As time passes since its emission, the ability for the behavior to be reinforced decreases. This is trace decay. It is upon decayed traces that subsequent BSEs operate. If the trace comes from a response, it constitutes primary reinforcement; if from perception of an extended stimulus, it is classical conditioning. This paper develops simple models of these processes. It premises exponentially decaying traces related to the richness of the environment, and conditioned reinforcement as the average of such traces over the extended stimulus, yielding an almost-hyperbolic function of duration. The models account for some data, and reinforce the theories of other analysts by providing a sufficient account of the provenance of these effects. It leads to a linear relation between sooner and later isopreference delays whose slope depends on sensitivity to reinforcement, and intercept on that and the steepness of the delay gradient. Unlike human prospective judgments, all control is vested in either primary or secondary reinforcement processes; therefore the use of the term discounting, appropriate for humans, may be less descriptive of the behavior of nonverbal organisms.
Collapse
|
7
|
Allen KD, Lattal KA. On conditioned reinforcing effects of negative discriminative stimuli. J Exp Anal Behav 2010; 52:335-9. [PMID: 16812600 PMCID: PMC1339185 DOI: 10.1901/jeab.1989.52-335] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Observing responses by pigeons were studied during sessions in which a food key and an observing key were available continuously. A variable-interval schedule and extinction alternated randomly on the food key. In one condition, food-key pecking during extinction decreased reinforcement frequency during the next variable-interval component, and in the other condition such pecking did not affect reinforcement frequency. Observing responses either changed both keylight colors from white to green (S+) or to red (S-) depending on the condition on the food key, or the observing responses never produced the S+ but produced the S- when extinction was in effect on the food key. Observing responses that produced only S- were maintained only when food-key pecking during extinction decreased reinforcement frequency in the subsequent variable-interval component. The red light conformed to conventional definitions of a negative discriminative stimulus, rendering results counter to previous findings that production of S- alone does not maintain observing. Rather than offering support for an informational account of conditioned reinforcement, the results are discussed in terms of a molar analysis to account for how stimuli acquire response-maintaining properties.
Collapse
|
8
|
Mulvaney DE, Dinsmoor JA, Jwaideh AR, Hughes LH. Punishment of observing by the negative discriminative stimulus. J Exp Anal Behav 2010; 21:37-44. [PMID: 16811733 PMCID: PMC1333168 DOI: 10.1901/jeab.1974.21-37] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
To determine the effect of a negative discriminative stimulus on the response producing it, two pigeons were each studied in a three-key conditioning chamber. During alternating periods of unpredictable duration, pecking the center (food) key either was reinforced with grain on a variable-interval schedule or was never reinforced. On equal but independent variable-interval schedules, pecking either of the side (observing) keys changed the color of all keys for 30 sec from yellow to either green or red. When the schedule on the center key was variable-interval reinforcement, the color was green (positive discriminative stimulus); when no reinforcements were scheduled, the color was red (negative discriminative stimulus). Since pecking the side keys did not affect grain deliveries, changes in the rate of pecking could not be ascribed to changes in the frequency of primary reinforcement. In subsequent sessions, red was withheld as one of the possible consequences of pecking a given side key. When red was omitted, the rate on that key increased, and when red was restored, the rate decreased. It was concluded that red illumination of the keys, the negative discriminative stimulus, had a suppressive effect on the response that produced it.
Collapse
|
9
|
Green L, Rachlin H. Pigeons' preferences for stimulus information: effects of amount of information. J Exp Anal Behav 2010; 27:255-63. [PMID: 16811988 PMCID: PMC1333589 DOI: 10.1901/jeab.1977.27-255] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A concurrent-chain procedure was used to study pigeons' preferences as a function of amount of information. Pigeons chose between two terminal links. Both terminal links ended in food reinforcement with probability (p) and in blackout with probability (1-p). One terminal link (noninformative link) was signalled by a stimulus uncorrelated with either food or blackout. The other terminal link (informative link) was signalled by stimuli correlated with these outcomes. Amount of information conveyed by these stimuli was varied across conditions by changing the probability of reinforcement (p) and blackout (1-p). The pigeons strongly preferred the informative link, and preferences were greater at p values above 0.50 than for their complements. The pigeons engaged in different behaviors during the stimulus periods, suggesting that the value of informative stimuli may be in their function as discriminative stimuli for interim activities and terminal responses.
Collapse
|
10
|
Abstract
In a concurrent-chains procedure, pigeons chose between equivalent mixed and multiple fixed-interval schedules of reinforcement. In the first experiment, preference for the multiple schedule was higher when the probability of the shorter fixed interval was less than .50 than for complementary points, an outcome consistent with the delay-reduction hypothesis of conditioned reinforcement and observing, but inconsistent with the uncertainty-reduction hypothesis which requires symmetrical preferences with a maximum when the two intervals are equiprobable. A second experiment assessed preference for equivalent mixed and multiple schedules when each choice outcome resulted in two reinforcements, one on the longer and one on the shorter fixed interval. The order of the two fixed intervals was determined probabilistically. Pigeons again preferred multiple to mixed schedules, although multiple-schedule preference did not vary systematically with the likelihood of the shorter fixed interval occurring first. The results from these choice procedures are consistent with those from the observing-response literature in suggesting that the strength of a stimulus cannot be well described as a function of the degree of uncertainty reduction the stimulus provides about reinforcement.
Collapse
|
11
|
Perone M, Baron A. Reinforcement of human observing behavior by a stimulue correlated with extinction or increased effort. J Exp Anal Behav 2010; 34:239-61. [PMID: 16812189 PMCID: PMC1333004 DOI: 10.1901/jeab.1980.34-239] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Young men pulled a plunger on mixed and multiple schedules in which periods of variable-interval monetary reinforcement alternated irregularly with periods of extinction (Experiment 1), or in which reinforcement was contingent on different degrees of effort in the two alternating components (Experiment 2). In the baseline conditions, the pair of stimuli correlated with the schedule components could be obtained intermittently by pressing either of two observing keys. In the main conditions, pressing one of the keys continued to produce both discriminative stimuli as appropriate. Pressing the other key produced only the stimulus correlated with variable-interval reinforcement or reduced effort; presses on this key were ineffective during periods of extinction or increased effort. In both experiments, key presses producing both stimuli occurred at higher rates than key presses producing only one, demonstrating enhancement of observing behavior by a stimulus correlated with the less favorable of two contingencies. A control experiment showed that stimulus change alone was not an important factor in the maintenance of the behavior. These findings suggest that negative as well as positive stimuli may play a role in the conditioned reinforcement of human behavior.
Collapse
|
12
|
Perone M, Kaminski BJ. Conditioned reinforcement of human observing behavior by descriptive and arbitrary verbal stimuli. J Exp Anal Behav 2010; 58:557-75. [PMID: 16812679 PMCID: PMC1322102 DOI: 10.1901/jeab.1992.58-557] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
College students earned monetary reinforcers by pressing a key according to a compound schedule with variable-interval and extinction components. Pressing additional keys occasionally produced displays of either of two verbal stimuli; one was uncorrelated with the schedule components, and the other was correlated with the extinction component. In Experiments 1 and 2, the display area of the apparatus was blank unless an observing key was pressed, whereupon a descriptive message appeared. Most students preferred an uncorrelated stimulus stating that "Some of this time scores are TWICE AS LIKELY as normal, and some of this time NO SCORES can be earned" over a stimulus stating that "At this time NO SCORES can be earned." In Experiment 3, the display area indicated that "The Current Status of the Program is: NOT SHOWN." Presses on the observing keys replaced this message with stimuli that provided arbitrary labels for the schedule conditions. All of the students preferred a stimulus stating that "The Current Status of the Program is: B" over an uncorrelated stimulus stating that "The Current Status of the Program is: either A or B." Thus, under some circumstances, observing was maintained by a stimulus correlated with extinction-a finding that poses a challenge for Pavolvian accounts of conditioned reinforcement. Differences in the maintenance of observing by the descriptive and arbitrary stimuli may be attributed to differences in either the strength or nature of the instructional control exerted by the verbal stimuli.
Collapse
|
13
|
Catania AC. Freedom and knowledge: an experimental analysis of preference in pigeons. J Exp Anal Behav 2010; 24:89-106. [PMID: 16811866 PMCID: PMC1333385 DOI: 10.1901/jeab.1975.24-89] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Relative responding in initial links of concurrent-chain schedules showed that pigeons preferred free to forced choices and informative to uninformative stimuli. Variable-interval initial links on two lower keys (white) of a six-key chamber produced terminal links on either two upper-left keys (blue and/or amber) or two upper-right keys (green and/or red). Terminal.links in which pecks on either of two lit keys produced fixed-interval reinforcement (free choice) were preferred to links with only one lit fixed-interval key available (forced choice). Terminal links with different key colors correlated with concurrent fixed-interval reinforcement and extinction (informative stimuli) were preferred to links with these schedules operating on same-color keys (uninformative stimuli). Scheduling extinction for one of the two free-choice keys assessed preference for two lit keys over one lit key, but confounded number with whether stimuli were informative. Fixed-interval reinforcement for both keys in each terminal link, but with different-color keys in one link and same-color keys in the other, showed that preference for informative stimuli did not depend on stimulus variety. Preferences were independent of relative responses per reinforcement and other properties of terminal-link performance.
Collapse
|
14
|
Dinsmoor JA, Mulvaney DE, Jwaideh AR. Conditioned reinforcement as a function of duration of stimulus. J Exp Anal Behav 2010; 36:41-9. [PMID: 16812230 PMCID: PMC1333051 DOI: 10.1901/jeab.1981.36-41] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Pigeons were provided with three keys. Pecking the center key produced grain on a schedule that alternated at unpredictable times between a variable-interval component and extinction. On concurrent variable-interval schedules, pecking either side key produced a stimulus associated with the variable-interval component on the center key provided that said schedule was currently in effect. The independent variable was the length of time this stimulus remained on the keys. Pecking one side key produced the stimulus for 27 seconds, whereas the duration produced by pecking the other key varied for successive blocks of sessions. For the first four birds, the values tested were 3, 9, 27, and 81 seconds. For the second group, numbering three birds, the values tested were 1, 3, 9, and 27 seconds. The dependent variable was the proportion of total side key pecks that occurred on the variable key. For all birds, the function was positive in slope and negative in acceleration. This finding supports a formulation that ascribes the maintenance of observing responses in a normal setting to the fact that the subject exposes itself to the positive discriminative stimulus for a longer mean duration than it does to the negative stimulus.
Collapse
|
15
|
Wasserman EA, Anderson PA. Differential autoshaping to common and distinctive elements of positive and negative discriminative stimuli. J Exp Anal Behav 2010; 22:491-6. [PMID: 16811812 PMCID: PMC1333297 DOI: 10.1901/jeab.1974.22-491] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The learning by hungry pigeons of a discrimination between two successively presented compound visual stimuli was investigated using a two-key autoshaping procedure. Common and distinctive stimulus elements were simultaneously presented on separate keys and either followed by food delivery, S+, or not, S-. The subjects acquired both between-trial and within-trial discriminations. On S+ trials, pigeons pecked the distinctive stimulus more than the common stimulus; before responding ceased on S- trials, they pecked the common stimulus more than the distinctive one. Mastery of the within-display discrimination during S+ trials preceded mastery of the between-trials discrimination. These findings extend the Jenkins-Sainsbury analysis of discriminations based upon a single distinguishing feature to discriminations in which common and distinctive elements are associated with both the positive and negative discriminative stimuli. The similarity of these findings to other effects found in autoshaping-approach to signals that forecast reinforcement and withdrawal from signals that forecast nonreinforcement-is also discussed.
Collapse
|
16
|
Bowe CA, Dinsmoor JA. Spatial and temporal relations in conditioned reinforcement and observing behavior. J Exp Anal Behav 2010; 39:227-40. [PMID: 16812316 PMCID: PMC1347916 DOI: 10.1901/jeab.1983.39-227] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
In Experiment 1, depressing one perch produced stimuli indicating which of two keys, if pecked, could produce food (spatial information) and depressing the other perch produced stimuli indicating whether a variable-interval or an extinction schedule was operating (temporal information). The pigeons increased the time they spent depressing the perch that produced the temporal information but did not increase the time they spent depressing the perch that produced the spatial information. In Experiment 2, pigeons that were allowed to produce combined spatial and temporal information did not acquire the perch pressing any faster or maintain it at a higher level than pigeons allowed to produce only temporal information. Later, when perching produced only spatial information, the time spent depressing the perch eventually declined. The results are not those implied by the statement that information concerning biologically important events is reinforcing but are consistent with an interpretation in terms of the acquisition of reinforcing properties by a stimulus associated with a higher density of primary reinforcement.
Collapse
|
17
|
Abstract
Pigeons made observing responses for stimuli signalling either a fixed-interval 30-sec schedule or a fixed-ratio x schedule, where x was either 20, 30, 100, 140, or 200 and the schedules alternated at random after reinforcement. If observing responses did not occur, food-producing responses occurred to a stimulus common to both reinforcement schedules. When the fixed-interval schedule was paired with a low-value fixed ratio, i.e., 20 or 30, the presentation of the stimulus reliably signalling the fixed-ratio schedule reinforced observing behavior, but the presentation of the stimulus reliably signalling the fixed-interval schedule did not. The converse was the case when the fixed-interval schedule was paired with a large-valued fixed ratio, i.e., 100, 140, or 200. The results demonstrated that the occasional presentation of the stimulus signalling the shorter interreinforcement interval was necessary for the maintenance of observing behavior. The reinforcement relationship was a function of the schedule context and was reversed by changing the context. Taken together, the results show that the establishment and measurement of conditioned reinforcement is dependent upon the context or environment in which stimuli reliably correlated with differential events occur.
Collapse
|
18
|
Abstract
In a series of three experiments, rats were exposed to successive schedule components arranged on two levers, in which lever pressing produced a light, and nose-key pressing produced water in 50% of the light periods. When one auditory signal was presented only during those light periods correlated with water on one lever, and a different signal was presented only during those light periods correlated with nonreinforcement on the other lever, the former lever was preferred in choice trials, and higher rates of responding were maintained on the former lever in nonchoice (forced) trials. Thus, the rats preferred a schedule component that included a conditioned reinforcer over one that did not, with the schedules of primary reinforcement and the information value of the signals equated. Preferences were maintained when one or the other of the auditory signals was deleted, but were not established in naive subjects when training began with either the positive or negative signal only. Discriminative control of nose-key pressing by the auditory signals was highly variable across subjects and was not correlated with choice.
Collapse
|
19
|
|
20
|
The multiple determinants of observing behavior. Behav Brain Sci 2010. [DOI: 10.1017/s0140525x00018045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
21
|
|
22
|
|
23
|
|
24
|
|
25
|
|
26
|
Secondary reinforcement: Still alive? Behav Brain Sci 2010. [DOI: 10.1017/s0140525x00018033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
27
|
|
28
|
|
29
|
|
30
|
|
31
|
|
32
|
Some more information on observing and some more observations on information. Behav Brain Sci 2010. [DOI: 10.1017/s0140525x00057976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
33
|
|
34
|
Dinsmoor JA. Stimuli inevitably generated by behavior that avoids electric shock are inherently reinforcing. J Exp Anal Behav 2001; 75:311-33. [PMID: 11453621 PMCID: PMC1284820 DOI: 10.1901/jeab.2001.75-311] [Citation(s) in RCA: 124] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A molecular analysis based on the termination of stimuli that are positively correlated with shock and the production of stimuli that are negatively correlated with shock provides a parsimonious count for both traditional discrete-trial avoidance behavior and the data derived from more recent free-operant procedures. The necessary stimuli are provided by the intrinsic feedback generated by the subject's behavior, in addition to those presented by the experimenter. Moreover, all data compatible with the molar principle of shock-frequency reduction as reinforcement are also compatible with a delay-of-shock gradient, but some data compatible with the delay gradient are not compatible with frequency reduction. The delay gradient corresponds to functions relating magnitude of behavioral effect to the time between conditional and unconditional stimuli, the time between conditioned and primary reinforcers, and the time between responses and positive reinforcers.
Collapse
Affiliation(s)
- J A Dinsmoor
- Department of Psychology, Indiana University, Bloomington 47405-7007, USA.
| |
Collapse
|
35
|
Jäger R. Lateral forebrain lesions affect pecking accuracy in the pigeon. Behav Processes 1993; 28:181-8. [DOI: 10.1016/0376-6357(93)90091-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/28/1992] [Indexed: 11/26/2022]
|
36
|
|
37
|
Abstract
A concurrent-chains schedule was used to examine how a delay to conditional discriminative stimuli affects conditioned reinforcement strength. Pigeons' key-peck responses in the initial link produced either of two terminal links according to independent variable-interval 30-s schedules. Each terminal link involved an identical successive conditional discrimination and was segmented into three links: a delay interval (green), a color conditional discriminative stimulus (blue or red), and a line conditional discriminative stimulus (vertical or horizontal lines). Food delivery occurred 45 s after entering the terminal link with a probability of .5, but its conditional probability (1.0 or 0) depended on the combination of the color and the line stimuli. One of the color stimuli occurred independently of further responding, 5 s after entry into the right terminal link, but it occurred 35 s after entry into the left terminal link. One of the line stimuli occurred independently of responding 40 s after entry into either terminal link, synchronized with the offset of the color stimulus. The initial-link relative response rate for the right was consistently higher in comparison with a control condition in which the color stimuli occurred 20 s after entry into either terminal link. The preference for the short delay to the color conditional discriminative stimuli suggests the possibility of conditioned reinforcement by information about the relation between the line conditional discriminative stimuli and the outcomes.
Collapse
Affiliation(s)
- A Ohta
- Department of Psychology, Faculty of Letters, Kyoto University, Japan
| |
Collapse
|
38
|
Grasping in the pigeon (Columba livid): Stimulus control during conditioned and consummatory responses. ACTA ACUST UNITED AC 1984. [DOI: 10.3758/bf03213146] [Citation(s) in RCA: 33] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
39
|
A conditioned reinforcement theory of observing responses is not a refutation of cognitive psychology. Behav Brain Sci 1983. [DOI: 10.1017/s0140525x00018112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
40
|
Consummatory response latency and the stimulus-reinforcer relation in autoshaping. ACTA ACUST UNITED AC 1983. [DOI: 10.3758/bf03199801] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
41
|
Can reinforcement by information be reconciled with a Pavlovian account of conditioned reinforcement? Behav Brain Sci 1983. [DOI: 10.1017/s0140525x00018082] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
42
|
|
43
|
Fantino E, Case DA, Altus D. Observing reward-informative and -uninformative stimuli by normal children of different ages. J Exp Child Psychol 1983. [DOI: 10.1016/0022-0965(83)90045-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
44
|
|
45
|
Abstract
The goal of this review is to compare two divergent lines of research on signal-centered behavior: the orienting reflex (OR) and autoshaping. A review of conditioning experiments in animals and humans suggests that the novelty hypothesis of the OR is no longer tenable. Only stimuli that represent biological "relevance" elicit ORs. A stimulus may be relevant a priori (i.e., unconditioned) or as a result of conditioning. Exposure to a conditioned stimulus (CS) that predicts a positive reinforcer causes the animal to orient to it throughout conditioning. Within the CS-US interval, the initial CS-directed orienting response is followed by US-directed tendencies. Experimental evidence is shown that the development and maintenance of the conditioned OR occur in a similar fashion both in response-independent (classical) and response-dependent (instrumental) paradigms. It is proposed that the conditioned OR and the signal-directed autoshaped response are identical. Signals predicting aversive events repel the subject from the source of the CS. It is suggested that the function of the CS is not only to signal the probability of US occurrence, but also to serve as a spatial cue to guide the animal in the environment.
Collapse
|
46
|
Green L. Preference as a function of the correlation between stimuli and reinforcement outcomes. LEARNING AND MOTIVATION 1980. [DOI: 10.1016/0023-9690(80)90015-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
47
|
Marlin NA, Sullivan JM, Berk AM, Miller RR. Preference for information about intensity of signaled tailshock. LEARNING AND MOTIVATION 1979. [DOI: 10.1016/0023-9690(79)90052-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
48
|
|
49
|
Katz HN. A test of the reinforcing properties of stimuli correlated with nonreinforcement. J Exp Anal Behav 1976; 26:45-56. [PMID: 16811930 PMCID: PMC1333489 DOI: 10.1901/jeab.1976.26-45] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The information hypothesis of conditioned reinforcement predicts that a stimulus that "reduces uncertainty" about the outcome of a trial will acquire reinforcing properties, even when the stimulus reliably predicts nonreinforcement. Four pigeons' key pecks produced one of two 5-sec stimuli with 0.50 probability according to a discriminated variable-interval schedule. One stimulus was followed by reinforcement; a second stimulus was followed by blackout. To the same extent, therefore, both stimuli reduced uncertainty about the possibility that food would arrive at the termination of the schedule interval. When a second key in the chamber was lighted, each peck on it could produce the stimulus preceding reinforcement, the stimulus preceding nonreinforcement, a novel stimulus, or no stimulus, across separate conditions. The stimulus preceding food maintained responding at substantial levels on the second, stimulus-producing, key. Such responding was not maintained by other stimuli. These data, replicated when the stimuli were reversed on the variable-interval schedule, do not support the prediction that uncertainty-reducing stimuli are necessarily conditioned reinforcers.
Collapse
|
50
|
Jwaideh AR, Mulvaney DE. Punishment of observing by a stimulus associated with the lower of two reinforcement frequencies. LEARNING AND MOTIVATION 1976. [DOI: 10.1016/0023-9690(76)90029-1] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|