Learning: Relatively permanent change in behavior due to experience Does NOT include temporary changes due to disease, fatigue, injury, maturation, or drugs, since these do NOT qualify as learning, even though they can alter behavior
Reinforcement: Any event that increases the probability that a response will recur Response: Any identifiable behavior Internal: Faster heartbeat Observable: Eating, scratching
Antecedents: Events that precede a response Consequences: Effects that follow a response
Russian physiologist who studied digestion Used dogs to study salivation when dogs were presented with meat powder Also known as Pavlovian or respondent conditioning Reflex: Automatic, non-learned response
Fig. 7-2, p. 220
How does this relate to Pavlov? Before Conditioning The dog gets food put in front of him and it is making him drool. The bell is not being used
Step by step approach: Before Conditioning Neutral Stimulus- a stimulus that does not immediately bring about a response Example: a bell doesn t get the dog to drool at first Unconditioned Stimulus- a stimulus that consistently brings about a response Example: food causes the dog to drool Unconditioned Response- an automatic response to an unconditioned stimulus. Example: the food (u.s) drooling (u.r) Everything is neutral and unconditioned because the dog hasn t learned the association yet because the food is not being paired with bell.
During Conditioning Now the bell is brought in Conditioned Stimulus- the once neutral stimulus becomes conditioned because you are pairing it. Now the bell is conditioned because it is being paired with the food. The neutral stimulus always becomes the conditioned stimuli Unconditioned Stimulus- the food. This remains unconditioned because the dog isn t trained or conditioned to drool every time it sees the food. Unconditioned Response- the salivation. Salivation is unconditioned because the dog isn t trained or conditioned to do it yet consistently
After Conditioning Everything now becomes conditioned and you don t need to you the unconditioned stimulus (food). The neutral stimulus now becomes conditioned. At first the bell was neutral because the dog didn t drool but now from conditioning the bell has meaning so it becomes conditioned. Conditioned Stimulus- Bell Conditioned Response- Drool
Fig. 7-3, p. 221
Would the following pictures work as an unconditioned stimulus for you? Remember and unconditioned stimulus evokes an automatic response.
To review Cartoon Pavlov- 3 minutes
Journal Write out what Jim does in your classical conditioning template. Before Conditioning: During Conditioning: After Conditioning: Office Clip One Minute
Weakening of a conditioned response through removal of reinforcement
Reappearance of a learned response following apparent extinction
Fig. 7-5, p. 222
Stimulus generalization: A tendency to respond to stimuli that are similar, but not identical, to a conditioned stimulus (e.g., responding to a buzzer when the conditioning stimulus was a bell) Stimulus discrimination: The learned ability to respond differently to similar stimuli (e.g., Anya will respond differently to various bells: alarms, school, timer)
Fig. 7-6b, p. 223
Phobia: Fear that persists even when no realistic danger exists (e.g., arachnophobia; fear of spiders) Conditioned emotional response (CER): Learned emotional reaction to a previously neutral stimulus
Desensitization: Decreasing fear or anxiety by exposing phobic people gradually to feared stimuli while they stay calm and relaxed Vicarious classical conditioning: Learning to respond emotionally to a stimulus by observing another s emotional reactions
Fig. 7-7, p. 224
Little Albert- six minutes Now let s figure out how this fits into classical conditioning. Chart out: Before Conditioning: During Conditioning: After Conditioning:
First let s check out the list The real way to treat a phobia. If you don t like clowns please don t look Real treatment of clown phobia- 4 minutes How not to treat a phobia. The media likes to take advantage of this balloon phobia maury Peach Phobia Maury- 3 minutes
Learning is based on the consequences of responding; we associate responses with their consequences. Everything is based in rewards and punishments. Thinking is secondary Law of effect (Thorndike): The probability of a response is altered by the effect it has: responses that lead to desired effects are repeated; those that lead to undesired effects are not
Any event that follows a response and increases its likelihood of recurring
Conditioning chamber (Skinner box): Apparatus designed to study operant conditioning in animals Response-contingent reinforcement: Reinforcement given after a desired response occurs
Fig. 7-9, p. 226
Operant Conditioning in Action Skinner Box- One minute Pigeons playing ping pong- 40 sec
Operant reinforcement most effective when given immediately after a correct response Response chain: A linked series of actions that leads to reinforcement Superstitious behaviors: Behaviors that are repeated because they appear to produce reinforcement, even though they are not necessary
Molding responses gradually in a step-bystep fashion to a desired pattern Successive approximations: Ever-closer matches Shaping a pigeon- one minute
When learned responses that are NOT reinforced gradually fade away Negative attention seeking: Using misbehavior to gain attention
Positive reinforcement: When a response is followed by a reward or other positive event Negative reinforcement: When a response is followed by the removal of an unpleasant event ie. You are more likely to use drugs and alcohol if they remove stress or anxiety
Stimuli that consistently precede a rewarded response tend to influence when and where the response will occur Operant stimulus generalization: Tendency to respond to stimuli similar to those that preceded operant reinforcement Operant stimulus discrimination: Occurs when one learns to differentiate between antecedent stimuli that signal either an upcoming reward or a nonreward condition
Job? Relationships? School? Substance abuse? Divorce? Connect the concepts you learned to each
Primary reinforcer: Non-learned and natural; satisfies physiological needs (e.g., food, water, sex) Secondary reinforcer: Learned reinforcer (e.g., money, grades, approval, praise); Gains reinforcing properties by associating with a primary reinforcer What are some examples of this in the real world? For starters look at the concept of money
Token reinforcer: Tangible secondary reinforcer (e.g., money, gold stars, poker chips) Only have meaning if paired with a primary reinforcer You should be able to use less of the token economy and eventually form a habit in the person. strange cartoon example: 1 min Example: Grades, tokens at an arcade etc. What are some other examples you can think of?
So easy even a monkey can do it!
Any event that follows a response and decreases the likelihood of it recurring (e.g., a spanking) Not negative reinforcement!!!!!! Response cost: Removal of a positive reinforcer after a response is made (e.g., Bob losing Xbox360 privileges)
Avoidance Learning-Learning to make a response in order to postpone or prevent discomfort. Example: Avoiding somebody you find annoying. Agreeing with someone instead of beginning an argument Aggression- punishment sets up an evnorment to learn aggression. Example: Spanking a child and then they become hostile towards others. They learned that through punishment
Schedules of reinforcement: Plans for determining which responses will be reinforced Continuous reinforcement: A reinforcer follows every correct response Partial reinforcement: Reinforcers do NOT follow every response Partial reinforcement effect: Responses acquired with partial reinforcement are more resistant to extinction
A set number of correct responses must be made to obtain a reinforcer Example: Receive five stickers for a prize, Sell so many products for a bonus What are some other examples?
Varied number of correct responses must be made to get a reinforcer Example: Slot machine, door to door sales What are some other examples?
A reinforcer is given only when a correct response is made after a set amount of time has passed since the last reinforced response. This promotes laziness sometimes because the person knows when the reward will occur. ie Examples: Paycheck every two weeks, Test in two weeks What are some other examples?
Reinforcement is given for the first correct response made after a varied amount of time has passed since the last reinforced response Example: Pop Quiz What are some other examples?
Model: Someone who serves as an example in observational learning Occurs by watching and imitating actions of another person or by noting consequences of a person s actions Occurs before direct practice is allowed
Pay attention to model Remember what was done Observer must be able to reproduce modeled behavior If a model is successful or his/her behavior is rewarded, behavior more likely to recur Bandura created modeling theory with classic Bo-Bo doll (inflatable clown) experiments
Bobo Doll Experiment- 6 minutes
Fig. 7-24, p. 245