Chapter 4

Chapter 5: Learning
 Learning – Any relatively permanent change in behavior brought about by

experience or practice. When people learn anything, part of the brain is
physically changed to record what was learned. Any change in the way an
organism behaves is learning. Most of our behavior is learned, while other
animals have more instincts. Humans are predisposed to things like learning a
language or walking, but they still must be learned first.
o A person born deaf might not ever develop language, even though the
hardware is there.
 Classical conditioning – The original theory of learning proposed by Ivan Pavlov,
who stumbled upon it by accident.
o Objects/situations become associated or linked with other kinds of
situations. Learning through association.
o Once the association is made, similar objects/situations can cause the
same response.
o Pavlov was in the lab studying the digestive system of dogs.
o Stimulus leads to response.
 Classical conditioning: Stimulus and response.
o Stimulus: An object, event, or experience that causes a response.
o Response: Reaction of an organism.
o Reflex response: Involuntary or uncontrolled.
o Stimulus and response is like cause and effect.
o If you put food in front of a dog, he will start drooling. Can be manipulated
as a way to teach a dog something.
 Classical conditioning: more specific definition:
o Learning to make a reflex response to a stimulus other than the original,
natural stimulus that normally produces the reflex.
o Association of ringing of bell with food for dogs.
 Elements of Classical Conditioning:
o Unconditioned Stimulus (UCS): Naturally occurring stimulus that leads to
an involuntary response. Unconditioned = unlearned.
o Unconditioned Response (UCR): Involuntary response to a naturally
occurring or unconditioned stimulus. Genetically “wired” into the nervous
system.
o UCS leads to UCR.
o Food (UCS) leads to salivating (UCR).
o Neutral Stimulus – Something that has no effect on the desired outcome.
In Pavlov’s case, it’s the bell that has no meaning to the dog.
o Conditioned Stimulus (CS) – Stimulus that becomes able to produce a
learned reflex response by being paired with the original unconditioned
stimulus. Conditioned is the same thing as learned.
o Conditioned Response (CR) – The learned reflex response to a
conditioned stimulus.
o A conditioned stimulus leads to a conditioned response.
o CS and CR – Natural response to an unnatural stimulus.
 Classical conditioning in humans
o Hate math because of bad experience in elementary school.
o Eat food that makes you sick, never eat that type of food again.
NS+UCSUCR
-BELL- -FOOD- -SALIVATING-
CSCR
-BELL- -SALIVATING-
 Classical Conditioning Concepts:

o #1: Conditioned stimulus must come before the unconditioned stimulus.
o #2: Conditioned stimulus and unconditioned stimulus must be close
together in time (seconds apart).
o #3: Neutral stimulus must be paired with the unconditioned stimulus
several times before conditioning occurs.
 Exception: Food especially if it has a negative effect like if it makes
you sick. Then you only have to learn it once.
o #4: Conditioned stimulus is usually distinctive or stands out from other
stimuli.
 Stimulus Generalization – The tendency to respond to stimuli similar to the
conditioned stimulus. Carries over to similar things.
o Trained dog to respond to dinner bell for food. Next time, a similar bell is
rung and the dog also responds to that because it is a similar stimulus.
o Little Albert – Initially, he would be scared of the white rat, but eventually
that would transform into him being scared of all white, fluffy things.
 Stimulus Discrimination:
o Tendency to distinguish between a conditioned stimulus and a similar
stimulus because the similar stimulus is never paired with the
unconditioned stimulus. The ability to separate different things.
 Dogs have a great sense of smell. They can be trained to
discriminate over time. Different dogs trained for different purposes.
Drug-sniffing dog, bomb-sniffing dog, arson investigation dog, etc.
 The dog knows the difference between the bell for the food, and the
bell for something else.
 Extinction (Classical Conditioning):
o Disappearance or weakening of a learned response following the removal
or absence of the UCS. Not reacting to a previously powerful stimulus is
extinction.
o You associate a power drill with the dentist. Then you haven’t been to the
dentist for a while. Then you are in a new environment such as a
woodworking shop and hear the drill again. Now the drill may be
associated with something different than before. The learned response will
disappear.
 Applications of Classical Conditioning:
o Conditioned Emotional Response – When the emotional response is
classically conditioned to occur to learned stimuli.
 Phobias (irrational fears) such as of elevators, spiders, flying,
clowns, fish, etc.
 Advertising
o John B. Watson – known for his “Little Albert” experiments in the 1920s,
back when science was free for all because there were no ethical
regulations. Unethical experimentation on an infant named Albert at Johns
Hopkins University. He was conditioned to be afraid of rats and by
association other white, fluffy things.
o Advertising application – associate a product with a celebrity, athlete, or
model.
o If you eat in front of the TV always and want to lose weight, stop watching
TV because it will be difficult for you to not feel hungry in front of the TV
because of the classical conditioning.
 Learning to Fear:
o Person can learn to fear almost anything if paired with something that
causes pain, surprise or embarrassment.
o Humans are biologically predisposed to acquire some fears easier than
others (heights, snakes, spiders, water). These four are inherently
dangerous. This comes from evolutionary traits.
o Public speaking is often the number one fear and death is number two.
People would rather die than speak in front of a crowd. Probably due to
embarrassment, especially for people are who reserved and introverted.
 Taste Aversion:
o Biological predisposition to associate sickness with taste more readily than
with sights or sound.
o Can develop after one pairing of food with illness even after time delay.
o Has survival value back from the hunting and gathering days with
“unknown” fruit.
o Taste-aversion is learning is how rats avoid poison.
o Taste-aversion is usually one-time learning.
 Classical vs. Operant Conditioning:
o Classical Conditioning – Forms associations between stimuli.
o Operant Conditioning – Forms association between voluntary behaviors
and the consequences that follow.
o Operant conditioning – If you want to train a dog to do something. As a
reward for successful behavior, you give the dog a biscuit. As a result, the
dog’s behavior will strengthen.
 Thorndike’s Experiments:
o Law of Effect – Behaviors followed by positive consequences are
strengthened, those followed by negative consequences are weakened.
o Cats escaping from the puzzle box. Over time, they took shorter times to
escape the box because getting out would be a positive consequence.
 Skinner’s Experiments:
o Behaviorist; wanted to study only observable, measurable behavior.
o Developed the Skinner Box to study operant conditioning.
o Used to manipulate a rat’s environment and therefore, manipulate his
behavior.
 Consequences:
o Reinforcement – Increases the probability that the behavior will occur
again.
 Alcoholics – Get dopamine and a feeling of escape from reality by
drinking.
o Punishment – Decreases the probability that the behavior will occur again.
 Lots of negatives go along with a drug/alcohol problem: DUI,
hangover, relationship/work issues. However, the reinforcement
has more validity for the person than the punishment. They would
rather lose their job than not get an escape from their miserable life.
o Reinforcement vs punishment: Reinforcement is much stronger than the
punishment, which is why the American prison system has a high
recidivism rate.
 Primary Reinforcer:
o An event or stimulus that is naturally reinforcing by meeting a basic
biological need.
 Food
 Drink
 Sex
 Oxygen
 Secondary Reinforcer:
o An event or stimulus that becomes reinforcing after being paired with a
primary reinforcer.
 Money
 Positive Reinforcement:
o Reinforcement of behavior by the addition or experiencing of something
pleasurable.
o Robbing a bank: The “free money”.
 Negative Reinforcement:
o Reinforcement of behavior by the removal, escape from, or avoidance of
something unpleasant.
 Positive vs Negative Reinforcement for Increasing Behavior:
o Positive reinforcement adds a desirable stimulus. Examples could include
getting a hug; receiving a paycheck.
o Negative reinforcement removes an aversive stimulus. Examples could
include fastening a seatbelt to turn off beeping.
 Punishment:
o Punishment by Application – Presenting something unpleasant following
unwanted behavior (e.g. spanking, speeding ticket).
o Punishment by Removal – Removing a reinforcing stimulus (something
desirable) -- (e.g. time-out).
 Negative Reinforcement versus Punishment by Removal:
o Examples of negative reinforcement: Stopping at a red light to avoid
getting in an accident. Mailing an income tax return by April 15 to avoid
paying a penalty. Obeying a parent before the parent reaches the count of
“three” to avoid getting a scolding.
o Examples of punishment by removal: Losing the privilege of driving
because you got into too many accidents. Having to lose some of your
money to pay the penalty for late tax filing. Being “grounded” (losing your
freedom) because of disobedience.
 Effective Punishment should be:
o Immediate
o Consistent
o Paired with reinforcement of the desired behavior.
 Continuous Reinforcement:
o Every response results in the delivery of the reinforcer.
 If you want the dog to acquire the behavior of “sit on command”,
every time he does it, he gets the biscuit.
 Faster acquisition, but prone to extinction.
 Partial Reinforcement:
o Reinforces a response only a part of the time.
o Results in slower acquisition in the beginning but shows greater resistance
to extinction later on.
o Only some of the responses are reinforced.
o It takes longer for the dog to learn the behavior, but it is less likely he will
forget it.
 Partial Reinforcement Schedules:
o Fixed-Ratio Schedule: Reinforces a response only after a specified
number of responses. Piecework pay (paid for a certain number of items
made). For a farm worker – every bushel of oranges you pick, you get a
dollar. For factory worker—for every article of clothing you make, you get
5 cents.
o Variable-Ratio Schedule: Reinforces a response after an unpredictable
number of responses. Hard to extinguish because of unpredictability. A
classical example of this is gambling.
o Fixed-Interval Schedule: Reinforces a response only after a specified time
has elapsed. Giving students a break every hour if they are working hard.
A paycheck that comes every two weeks.
o Variable-Interval Schedule: Reinforces a response at unpredictable time
intervals. Produces slow, steady responses. Catching fish after 2 minutes
then having to wait 20 minutes to catch the next one.
 Cognitive Learning:
o In the early days, the focus was on observable behavior.
o In the 1950s & 1960s, psychologists realized that thoughts, feelings &
expectations can influence behavior & are important.
o Cognition: Mental events that take place inside a person’s mind while
behaving.
o You can’t see a thought, but you can see the behavior resulting from a
thought.
 Latent Learning:
o Edward Tolman (1930)
o Learning remains hidden until an application is useful.
o Latent - existing but not yet developed or manifest; hidden; concealed.
o Three groups of rats – Once the rats learned they would be rewarded for
running the maze, they suddenly started demonstrating their knowledge of
how to run the maze.
o The learning was there, but there was no reinforcement for them to
demonstrate it.
 Learned Helplessness:
o Martin Seligman (1975)
o Not trying to escape from a situation due to a history of repeated failures
in the past.
o The dogs who were unable to avoid shock in the past did not avoid shock
when the opportunity became available because they repeatedly failed.
o Elizabeth Smart
 Observational Learning:
o Bandura (1961)
o Learning through imitation & modeling.
o May not involve external rewards.
o Learning/performance distinction.
o Bobo Doll Studies – little kids watch adults who were strangers interact
with inflatable dolls. Some would beat up the dolls, and others would play
nice. Some adults got punished for beating up the doll. Others got
reinforced. Those kids who saw the adults get reprimanded would not be
likely to do it.
o Example: Child imitates adult’s speech.
o The original terminology for this was “social learning”.
o You are learning through watching others instead of receiving
reinforcement.
o Watching someone get away with something or watching someone else
get punished for something can influence our own behavior.
o “Monkey see, monkey do”.
o Animals watch a lot by watching other animals.
 Four Elements of Observational Learning:
o Attention – the learner must first pay attention to the model.
o Memory – the learner must retain the memory of what was done.
o Imitation – the learner must be capable of reproducing the actions of the
model.
o Motivation – the learner must have the desire to perform the action.

Chapter 4

Загружено:

Сведения о документе

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Chapter 4

Загружено:

Авторское право:

Доступные форматы

Chapter 5: Learning

 Learning – Any relatively permanent change in behavior brought about by

 Classical Conditioning Concepts:

Вам также может понравиться