|
|
Animal Training by Operant ConditioningUsing Positive Reinforcement to Train Animal SubjectsTwo of the basic phenomena by which we, and animals, learn are classical conditioning and operant conditioning.
Classical conditioning is associative learning, also known as Pavlovian learning after the most famous example by Ivan Pavlov (1849-1936) whereby dogs were ‘conditioned’ to salivate when a bell was rung, after the ringing of a bell was paired with the arrival of food. It is ‘involuntary’ behavioral responses that are the cause of classical conditioning. Operant conditioning is the way in which behavior is shaped by consequences (both positive and negative). Operant conditioning was discovered by Burrhus Frederic Skinner (1904 – 1990). Skinner studied the way in which behavior was shaped when some responses were reinforced, some were punished, and others had no effect i.e. were ignored. He found that responses which were reinforced would become more frequent, whilst those that were punished, or had no effect, would become less frequent and eventually disappear. Which Technique?
Negative reinforcement is often confused with punishment but they are very different:
Reinforcement techniques shape behaviors as they occur. Punishment stops a behavior at that particular moment with no further guidance, and as a result other undesirable behaviors may occur in its place. Although different, both punishment and negative reinforcement are considered aversive approaches to animal training. As well as the problem of undesirable behaviors taking the place of the one being stopped with punishment, another problem with punishment is that the subject(s) being trained may start to avoid the ‘punisher’. With respect to these problems and regard for the welfare of animals, positive reinforcement should be used as the prominent method of training animals. Schedules of ReinforcementOnce a behavior has been established not every response has to be reinforced. There are different schedules that can be used for reinforcing a behavior once established:
Behavior that is rewarded on variable schedules is sometimes more resistant to extinction as the subject will keep performing the behaviour to try and get the reward/reinforcement. Effective ReinforcementReinforcement must be immediate and consistent for to be effective. If it is delivered too late the subject may already have moved on to a different behavior and be reinforced for that behavior instead. It is good to use secondary reinforcers to bridge the gap between the behavior occurring, and the subject receiving the stimulus, or reward. Secondary reinforcers commonly used are clickers, whistles and praise. These are delivered immediately after the behavior has been performed and then the reward (food, temperature change etc.) can be delivered once the target behavior has been marked. Some final pointers when training:
The copyright of the article Animal Training by Operant Conditioning in Pet Training is owned by Tamara McGaw. Permission to republish Animal Training by Operant Conditioning in print or online must be granted by the author in writing.
|
|
|
|
|
|
|
|