Operant Conditioning B.F. SKINNER
Reinforcement in Operant Conditioning Behavior Consequence Patronize Elmo s Diner It s all a matter of consequences. Rewarding Stimulus Presented Tendency to tell jokes increases
Operant conditioning A form of learning in which responses come to be controlled by their consequences. Stimulus After the fact. Operant responses are said to be emitted rather than elicited.
Primary reinforcers Events that are inherently reinforcing because they satisfy biological needs. Examples include food, water, warmth, sex Secondary (conditioned) reinforcers Events that acquire reinforcing qualities by being associated with primary reinforcers. Examples include paycheck, attention, praise, grades
Skinner Box Speaker Signal lights Lever To food dispenser Food pellet Electric grid To shock generator A small enclosure in which an animal can make a specific response that is recorded while the consequences of the response are systematically controlled.
Cumulative Recorder Drum One response Series of rapid responses Pen Not responding Reinforcement slash Creates a graphic record of responding and reinforcement in a Skinner box as a function of time.
Acquisition and Shaping Acquisition The initial stages of learning a new response. Shaping Process of repeatedly reinforcing closer and closer approximations to desired response.
Extinction Cumulative s Rate decreasing High rate extinguished Resistance to Extinction When an organism continues to make a response after delivery of the reinforce has been terminated. Rate increasing Time
Schedules of Reinforcement Continuous Reinforcement Intermittent Reinforcement Reinforcer Reinforcer Reinforcer Reinforcer Reinforcer Reinforcer Reinforcer Reinforcer Reinforcer Reinforcer All of the time Some of the time
Pattern of Reinforcement Cumulative s Fixed-ratio (FR) Lower resistance to extinction Rapid responding Short pause after reinforcement Note: Higher ratios generate higher response rates Why would this reinforcement pattern result in a lower resistance to extinction? Time
Pattern of Reinforcement Cumulative s Variable-ratio (VR) Higher resistance to extinction High, steady rate without pauses Note: Higher ratios generate higher response rates Why would this reinforcement pattern result in a higher resistance to extinction? Time
Pattern of Reinforcement Cumulative s Fixed-interval (FI) Lower resistance to extinction Long pause after reinforcement yields scalloping effect Note: Short intervals generate higher rates overall Time
Pattern of Reinforcement Cumulative s Variable-interval (VI) Higher resistance to extinction Low, steady rate without pauses Note: Short intervals generate higher rates overall Time
Positive Reinforcement When a response is strengthened because it is followed by the presentation of a rewarding stimulus. Behavior Consequence Tendency to press lever increases Press lever Rewarding Stimulus Presented Food delivered
Negative Reinforcement When a response is strengthened because it is followed by the removal of an aversive (unpleasant) stimulus. Behavior Consequence Tendency to press lever increases Press lever Aversive Stimulus Removed Shock turned off
Positive or Negative Reinforcement? A toddler begins to cry in a toy store. The toddler s mother gives her a toy to quiet her down. Next time they go to the store, the child starts crying again. Positive Reinforcement
Positive or Negative Reinforcement? A teacher has a policy that requires all late students to attended detention. The class watches as a student is assigned detention. The next day, the student is early to class. Negative Reinforcement
Aversive Conditioning Escape & Avoidance Avoidance learning an organism acquires a response that prevents some aversive stimulation from occurring. Escape (CS) Light (US) Shock (CR) (UR) Fear
Conditioned Taste Aversion Unconditioned Stimulus (Illness) (CR) (UR) Nausea Conditioned Stimulus (Taste of Berries)
Punishment An event following a response weakens the tendency to make that response. Consequence If the mouse gets shocked when it presses the lever, the punishment will reduce the likeliness that the mouse will press the lever again. Press lever