Week 6 — Lecture Outline · Learning
Course: Introduction to Psychology (PSYC 1) · Silver Oak University (fictional sample) · Prof. Bennett
Objectives covered: Objective 5 — Compare the major models of learning and memory and apply them to everyday behavior (this week: learning).
SLOs touched: A (apply concepts to real-world behavior) · B (reason scientifically about claims regarding mind and behavior)
Meeting pattern: 2 sessions × 75 min = 150 min. Segment minutes below total ~150; scale to your own pattern.
Week at a Glance
| The week's big question | "How does experience rewire what we do — and why isn't 'negative reinforcement' just a fancy word for punishment?" |
| By the end of the week, students can… | (1) define learning and label a classical-conditioning scenario with UCS / UCR / CS / CR; (2) tell the four operant consequences apart — positive/negative reinforcement (both increase behavior) and positive/negative punishment (both decrease it) — and stop confusing negative reinforcement with punishment; (3) name the schedules of reinforcement and explain why variable-ratio drives the most persistent behavior; (4) explain observational learning (Bandura's Bobo doll) and tell classical from operant conditioning. |
| Key vocabulary | learning, associative learning, classical conditioning, unconditioned stimulus (UCS), unconditioned response (UCR), neutral stimulus, conditioned stimulus (CS), conditioned response (CR), acquisition, extinction, spontaneous recovery, generalization, discrimination, operant conditioning, law of effect, reinforcement, punishment, positive/negative reinforcement, positive/negative punishment, primary/secondary reinforcer, fixed-ratio, variable-ratio, fixed-interval, variable-interval, shaping, observational learning, model, vicarious reinforcement, mirror neurons |
| Materials | slides (Deck 6), the week's readings + video links, one approved chatbot (Gemini / Claude / ChatGPT) for the AI-critique moment and the tutorial |
| Timing note | 8 segments, ~150 min total. Session 1 = Segments 1–4 (~75). Session 2 = Segments 5–8 (~75). |
Segment 1 — Hook & the Promise (8 min) · Session 1 opens
Hook. Ask the room one question and let hands fly: "Why do you check your phone the instant it buzzes — even mid-sentence, even when you swore you wouldn't?" Take three answers, then reframe: "You weren't born doing that. Something taught you — the same machinery that taught Pavlov's dogs to drool at a bell and a gambler to keep pulling a lever. That machinery is learning, and by Friday you'll be able to name exactly which kind is running every time your phone wins."
The promise (write it on the board): "By Friday you'll label any classical-conditioning story with UCS / UCR / CS / CR, tell the four operant consequences apart cold, and never again call negative reinforcement 'punishment' — the single most common mistake in this entire unit."
Why it matters line (memory hook): "Learning is experience leaving a mark on behavior — and almost everything you do all day is a mark left by something."
Segment 2 — Classical Conditioning: Pavlov's Accident (22 min)
Plain language first.
- Learning = a relatively permanent change in behavior or knowledge that comes from experience (not from maturation, fatigue, or injury). Most of it is associative learning — we learn that two things go together.
- Classical conditioning is learning an association between two stimuli, so a signal that predicts something important starts triggering the response that the important thing causes. Pavlov found it by accident — his dogs salivated at his assistant's footsteps before any food appeared.
The four-letter vocabulary (put it on a slide and keep it up all class):
- Unconditioned stimulus (UCS) — something that triggers a response automatically, no learning required (food in the mouth).
- Unconditioned response (UCR) — the automatic, unlearned reaction (salivating to the food).
- Neutral stimulus — a signal that, at first, means nothing (a bell).
- Conditioned stimulus (CS) — that once-neutral signal after it's been paired with the UCS enough times (the bell now predicts food).
- Conditioned response (CR) — the learned reaction to the CS (salivating to the bell alone).
Memory hook (put it on a slide):
"Unconditioned = unlearned (it's automatic). Conditioned = learned. The CR is the UCR's learned echo."
The processes (one line each):
- Acquisition — the CS and UCS get paired and the link forms.
- Extinction — present the CS without the UCS enough times and the CR fades.
- Spontaneous recovery — after a rest, the faded CR briefly pops back.
- Generalization — stimuli similar to the CS also trigger the CR (a buzzer, not just the bell).
- Discrimination — learning to respond to the CS but not to similar-but-different signals.
Brief, factual note on "Little Albert": Watson and Rayner (1920) showed fear can be classically conditioned in a human — they paired a white rat (neutral) with a loud, startling noise (UCS → fear UCR), and the infant came to fear the rat (CS → fear CR), then similar furry things (generalization). We reference it as historical evidence that emotions are learnable; by today's ethics rules it could not be run.
Segment 3 — The Signature Worked Example: Label It (20 min)
Set it up: "Watch me take an everyday story and label every piece. This is the move I want you doing by Friday — and it's exactly what the AI gets wrong."
One fully worked example (do every label out loud):
The story: Someone has had a painful dental visit. The drill causes pain.
- UCS = the dental drill (it causes pain automatically — no learning needed).
- UCR = pain / tensing up (the automatic reaction to the drill).
- During visits, the smell and sound of the dentist's office are present every time the drill is used — a once-neutral stimulus repeatedly paired with the UCS.
- CS = the smell/sound of the office — now, on its own, it predicts the drill.
- CR = anxiety / tension felt just walking in, before anything touches a tooth.
Land the labels with a check: "If the office got remodeled and smelled completely different, the anxiety would drop at first — that's the CS changing. If you went many times for a cleaning with no drill, the anxiety would fade — that's extinction."
Quick second pass (let the class call the labels): advertising — a soft drink (neutral) is paired again and again with happy music, attractive people, and summer fun (UCS → good feeling UCR), until the brand logo (CS) triggers a warm feeling (CR). Same five labels, new story.
Segment 4 — Operant Conditioning + the 2×2 + Quick Interaction (25 min) · Session 1 closes (~75)
Plain language first. Classical conditioning is about involuntary, reflexive responses to signals. Operant conditioning is about voluntary behavior and its consequences: behaviors followed by good outcomes happen more; behaviors followed by bad outcomes happen less. Thorndike called it the law of effect; Skinner mapped it in detail.
The one distinction that carries the week — say it twice:
Reinforcement always INCREASES a behavior. Punishment always DECREASES a behavior. "Positive" means you ADD a stimulus; "negative" means you REMOVE one. Positive/negative is about add vs. remove — NOT good vs. bad.
The fully worked 2×2 (build it on the board, one everyday example per cell):
| ADD a stimulus (positive) | REMOVE a stimulus (negative) | |
|---|---|---|
| INCREASE behavior (reinforcement) | Positive reinforcement: you praise a student → they participate more (added something pleasant). | Negative reinforcement: you take aspirin → the headache goes away → you take aspirin sooner next time (removed something aversive). |
| DECREASE behavior (punishment) | Positive punishment: you touch a hot stove → pain → you don't touch it again (added something aversive). | Negative punishment: a teen breaks curfew → phone taken away → they break curfew less (removed something pleasant). |
More machinery (one line each):
- Primary reinforcer = satisfies a built-in need (food, water, warmth). Secondary (conditioned) reinforcer = gains its power by association (money, grades, points).
- Schedules of reinforcement (full detail Segment 5) — reinforcing every time vs. some of the time changes how fast and how stubbornly a behavior is learned.
- Shaping = reinforcing successive approximations — small steps toward a target behavior (how you'd train a dog to roll over, or coach a beginner).
Interaction — Think-Pair-Share (rapid-fire, ~8 min): put five everyday scenarios on a slide; students decide which of the four cells each one is, solo (30 sec), compare with a neighbor (1 min), then vote by fingers (1 = pos. reinforcement … 4 = neg. punishment). Seed at least one that is negative reinforcement so the trap surfaces (e.g., "You buckle your seatbelt to make the annoying beeping stop, so you buckle up faster every time" → negative reinforcement, not punishment). Debrief the trap directly.
Segment 5 — Schedules of Reinforcement (22 min) · Session 2 opens
Hook back in: "Last session: reinforcement increases behavior. Today: how often you reinforce changes everything — and one schedule explains slot machines, your phone, and why some habits are so hard to break."
Plain language first — two questions define every schedule:
- Is reinforcement based on the number of responses (a ratio) or the passage of time (an interval)?
- Is the rule predictable (fixed) or unpredictable (variable)?
Cross those and you get four schedules:
| Ratio (count of responses) | Interval (time passing) | |
|---|---|---|
| Fixed (predictable) | Fixed-ratio: reward every Nth response — buy 10 coffees, get one free. High output, short pause after each reward. | Fixed-interval: first response after a set time — a weekly quiz every Monday. Effort spikes as the deadline nears (the "scalloped" pattern). |
| Variable (unpredictable) | Variable-ratio: reward after an unpredictable number of responses — a slot machine; phone notifications. Highest, steadiest, most extinction-resistant responding. | Variable-interval: first response after an unpredictable time — checking for a text that could arrive any minute. Slow, steady responding. |
Land the headline (memory hook):
"Variable-ratio is the jackpot schedule — unpredictable payoff after unpredictable effort. It produces the highest, most persistent behavior, which is exactly why slot machines and the little red notification dot are so hard to put down."
One more idea: behavior on a partial (intermittent) schedule is harder to extinguish than behavior rewarded every time — the gambler keeps pulling because the next one might pay.
Segment 6 — Observational Learning: The Bobo Doll (18 min)
Set it up: "You don't have to be reinforced yourself to learn. Sometimes you just watch someone else — and copy."
Plain language first.
- Observational learning = learning by watching a model and imitating, without direct reinforcement to you. Bandura showed this is a third route, alongside classical and operant conditioning.
- Bandura's Bobo doll study (1961, factual): preschoolers who watched an adult punch and kick an inflatable "Bobo" doll later imitated that aggression — including novel acts — far more than children who saw a calm adult. Watching was enough. It challenged the strict behaviorist claim that behavior must be directly reinforced to be learned.
- Vicarious reinforcement / punishment: we also watch what happens to the model. In Bandura's follow-up, children who saw the aggressive adult rewarded imitated more; those who saw the adult punished imitated less — even though all had learned the behavior. We copy what we see pay off.
- Mirror neurons (brief): neurons that fire both when we do an action and when we watch someone else do it; often discussed as a possible neural basis for imitation and empathy. (Keep it light — the details are still debated.)
Land it: "Models matter. Kids, employees, and the rest of us learn from what we see others do — and from whether it works out for them. That's why 'do as I say, not as I do' rarely wins."
Segment 7 — Misconceptions + Cures (20 min)
Name the misconceptions out loud, then cure each:
- ❌ "Negative reinforcement is just another word for punishment." (THE big one.)
✅ Cure: opposite jobs. Reinforcement — including negative reinforcement — INCREASES a behavior; punishment DECREASES it. Negative reinforcement removes something unpleasant so the behavior happens more (take aspirin → headache stops → you take aspirin sooner next time; buckle up → beeping stops → you buckle faster). Punishment makes a behavior happen less. "If the behavior went UP, it was reinforcement — full stop." - ❌ "Positive means good and negative means bad."
✅ Cure: in operant terms, positive = add a stimulus, negative = remove a stimulus. Positive punishment (adding pain) is "positive" and unpleasant; negative reinforcement (removing pain) is "negative" and pleasant. Read them as + / − math signs, not value judgments. - ❌ "Punishment is the best way to change behavior."
✅ Cure: reinforcement is usually more effective and has fewer side effects. Punishment can suppress a behavior temporarily but often breeds fear, avoidance, and resentment, and it teaches what not to do without teaching what to do. Reinforcing the desired behavior (or negatively reinforcing an alternative) tends to work better and last longer. - ❌ "Classical and operant conditioning are basically the same thing."
✅ Cure: classical = involuntary/reflexive responses to a signal (you don't choose to salivate); operant = voluntary behavior shaped by its consequences (you choose to press the lever because of what follows). Reflex vs. choice; signal vs. consequence.
Segment 8 — Technology Workflow + AI-Critique, Callback & Hand-off (15 min) · Session 2 closes (~75)
Technology workflow — the label-and-classify habit, on demand:
1. Pick any everyday behavior (a craving, a habit, a pet's trick, your phone).
2. Ask: is this classical (an automatic response to a signal) or operant (a voluntary behavior with a consequence)?
3. If classical, label UCS / UCR / CS / CR. If operant, find the cell of the 2×2 (add/remove × increase/decrease) and name the schedule if there is one.
4. Notice which trap you slid toward — calling negative reinforcement "punishment" is the one to catch.
AI-critique moment (students verify, not consume):
Paste this to an approved chatbot: "In this scenario, is the behavior being shaped by negative reinforcement or punishment? 'A driver buckles their seatbelt to stop the car's annoying warning chime; over time they buckle up faster and faster.' Label the UCS/UCR/CS/CR if it's classical, or the operant term if it's operant."
Then check its work against today's 2×2. Models frequently miscall negative reinforcement "punishment," or swap the CS and UCS in a classical scenario. The behavior here goes up (buckling faster) and an aversive stimulus (the chime) is removed → negative reinforcement, and it's operant, not classical. Your job all semester: the tool drafts, you judge.
Callback + tease:
- Callback: "Conditioning is everywhere we looked today — your phone, ads, a habit, a trained pet. Naming the mechanism is the skill."
- Tease next week: "We learned how experience changes behavior. Next week: how experience gets stored — memory, why you blank on a name, and how a confident memory can be flat-out wrong."
Hand-off (the week's graded work):
- Lecture Tutorial 6 (AI tutor, share-link submission) — classical labels, the operant 2×2, schedules, and observational learning.
- Quiz 6 (end of week) and Discussion 6 ("Conditioning in the Wild" — spot conditioning in everyday life and analyze the mechanism).
- Assignment 6 — label classical scenarios, classify operant ones (mind the negative-reinforcement trap), identify schedules/processes, and design a habit plan.
Instructor FAQ — Common Stumbles
| Student says / does | Quick cure |
|---|---|
| "Negative reinforcement = punishment, right?" | No — opposite jobs. Reinforcement increases a behavior; punishment decreases it. Negative reinforcement removes something unpleasant to make a behavior happen more (aspirin stops the headache → you take it sooner). |
| Reads "positive/negative" as "good/bad." | It's add vs. remove a stimulus, like + and − signs. Positive punishment adds something unpleasant; negative reinforcement removes something unpleasant. |
| Swaps UCS and CS. | The UCS works automatically with no learning (food, the drill). The CS is the once-neutral signal that earned its power by pairing (the bell, the office smell). Ask: which one needed training? |
| Confuses UCR and CR. | Same response, different trigger. UCR = the automatic reaction to the UCS; CR = the learned reaction to the CS. The CR is the UCR's learned echo. |
| Calls shaping the same as reinforcement. | Shaping reinforces successive approximations — small steps toward a goal — not the finished behavior all at once. |
| Thinks variable-interval drives the most responding. | It's variable-ratio (unpredictable count) that produces the highest, most persistent responding — slot machines, notifications. |
| Treats observational learning as just operant conditioning. | In observational learning the learner is not directly reinforced — watching a model is enough (Bobo doll). Vicarious reinforcement is watching the model get rewarded. |
| Mixes up classical and operant. | Classical = involuntary response to a signal (salivate to a bell). Operant = voluntary behavior shaped by its consequence (press a lever for a reward). Reflex vs. choice. |
Scope flag
This outline stays within Objective 5's learning half — classical conditioning (UCS/UCR/CS/CR and its processes), operant conditioning (reinforcement vs. punishment, schedules, shaping), and observational learning. Memory — encoding/storage/retrieval, forgetting, and the misinformation effect — is Week 7 and is only teased here. The historical figures and studies named (Pavlov, Watson & Rayner's "Little Albert," Thorndike, Skinner, Bandura's Bobo doll) are referenced factually as part of the discipline's real history; the instructor and institution remain fictional. "Little Albert" is noted briefly and non-sensationally as historical evidence that fear can be conditioned.
~ Prof. Bennett's edition · Fall 2026 · built with thecoursemaker.com