Week 1 — Lecture Outline · Foundations & Types of Data
Course: Introduction to Statistics (MATH 11) · Silver Oak University (fictional sample) · Prof. Rivera
Objectives covered: Objective 1 — Distinguish populations from samples and identify appropriate sampling and study designs.
SLOs touched: A (reason quantitatively from data) · B (communicate results to a non-technical audience)
Meeting pattern: 2 sessions × 75 min = 150 min. Segment minutes below total ~150; scale to your own pattern.
Week at a Glance
| The week's big question | "Where do data come from, and when can the numbers be trusted to speak for more people than we actually measured?" |
| By the end of the week, students can… | (1) tell a population from a sample, and a parameter from a statistic; (2) classify a variable by its level of measurement (nominal / ordinal / interval / ratio); (3) name a sampling method and say whether it's likely to be biased; (4) tell an observational study from an experiment and explain why correlation isn't causation. |
| Key vocabulary | population, sample, parameter, statistic, census, variable, categorical vs. quantitative, nominal/ordinal/interval/ratio, simple random sample (SRS), stratified, cluster, systematic, convenience/voluntary-response, bias (sampling, undercoverage, nonresponse, response), observational study, experiment, confounding variable |
| Materials | slides (Deck 1), the week's readings + video links, a spreadsheet (Google Sheets or Excel), one approved chatbot (Gemini / Claude / ChatGPT) for the AI-critique moment and the tutorial |
| Timing note | 8 segments, ~150 min total. Session 1 = Segments 1–4 (~75). Session 2 = Segments 5–8 (~75). |
Segment 1 — Hook & the Promise (8 min) · Session 1 opens
Hook. "Raise your hand if, in the last 24 hours, you generated data." Wait. Then: every hand should be up.
- Your phone counted your steps. Spotify logged every song. A delivery app timed your driver. A professor's LMS logged when you opened this page.
- "You are walking around producing data constantly. This course is about the other side of that: how anyone turns numbers like yours into a claim about people they never met — and how to tell the honest claims from the garbage."
The promise (write it on the board): "By the end of this week you can look at any statistic in the wild — a poll, a 'studies show,' a campus survey — and ask the three questions that decide whether it deserves your trust: Who was measured? How were they picked? What was actually recorded?"
Why it matters line (memory hook): "Statistics is not about the math. It's about trusting a number that describes people you didn't count."
Segment 2 — Population vs. Sample, Parameter vs. Statistic (20 min)
Plain language first.
- A population is everyone (or everything) we want to know about. A sample is the part we actually measured. We measure the sample because measuring the whole population is usually impossible, too slow, or too expensive.
- A census is the rare case where we do measure the whole population.
- Two more words, same split: a number that describes the population is a parameter; the matching number from the sample is a statistic.
Memory hook (put it on a slide):
Population → Parameter. Sample → Statistic. "The letters line up."
One fully worked example (do every step out loud).
Claim: "62% of Silver Oak undergraduates feel stressed about money."
A pollster emailed all 18,000 undergrads (the population they care about). 900 replied (the sample). Of those 900, 558 said yes.
- Sample statistic: 558 / 900 = 0.62 → 62%. This is a statistic (it came from the sample).
- The true campus-wide percentage — what we'd get if all 18,000 answered honestly — is the parameter. We never see it directly; the statistic is our best estimate of it.
- Notation preview (notation comes after the idea): we write the population proportion as p (parameter) and the sample proportion as p̂ ("p-hat," statistic). "The hat means 'measured,' not 'true.'"
Land the key idea: the statistic (62%) is what we have; the parameter is what we want. The whole rest of the course is the bridge from one to the other.
Segment 3 — Levels of Measurement (25 min)
Plain language first. Before you can summarize a variable, you have to know what kind of thing it is. There are two big families and four levels.
- Categorical (qualitative) — labels or groups. Two levels:
- Nominal — names with no order. (Major, eye color, yes/no, jersey number.)
- Ordinal — ordered categories, but the gaps aren't equal/measurable. (Letter grade, "small/medium/large," 1–5 satisfaction.)
- Quantitative (numerical) — actual amounts you can do arithmetic on. Two levels:
- Interval — ordered, equal gaps, but no true zero (zero doesn't mean "none"). (Temperature in °F/°C, calendar year.)
- Ratio — ordered, equal gaps, and a true zero so ratios make sense. (Height, weight, age, income, count of anything.)
Memory hook: N–O–I–R (like the French for "black," noir) — Nominal, Ordinal, Interval, Ratio, in order of how much math they permit.
One fully worked example (classify and justify each):
A campus survey records: student ID, major, class standing (Fr/So/Jr/Sr), GPA, Fahrenheit temp of their dorm, number of cups of coffee yesterday.
- Student ID → nominal (it's a number, but it's a name — averaging IDs is nonsense).
- Major → nominal.
- Class standing → ordinal (ordered, but "Fr→So" isn't a measurable, equal jump).
- GPA → ratio-like, treated quantitative (0.0 means "no grade points" — a true zero).
- Dorm temperature (°F) → interval (0°F isn't "no heat").
- Cups of coffee → ratio (0 means none; 4 cups is twice 2).
The "test" to give students: Does zero mean 'none'? If yes → ratio. Are the gaps equal but zero is arbitrary? → interval. Ordered labels, fuzzy gaps? → ordinal. Just names? → nominal.
Segment 4 — Misconceptions + Quick Interaction (22 min) · Session 1 closes (~75)
Name the misconceptions out loud, then cure each:
- ❌ "If it's a number, it's quantitative."
✅ Cure: zip codes, phone numbers, jersey numbers, and student IDs are numbers that label. The test isn't "is it a number," it's "does arithmetic mean anything." You can't average area codes. - ❌ "A bigger sample is automatically a better sample."
✅ Cure: size doesn't fix bias. A huge sample drawn the wrong way is confidently wrong. (We'll see the Literary Digest disaster next session — 2.4 million responses, dead wrong.) "How you pick beats how many you pick." - ❌ "Population means a lot of people."
✅ Cure: the population is whoever the question is about — it could be 30 students in one section, or every transaction at one store. It's defined by the claim, not by being large. - ❌ "Sample and population are fixed words."
✅ Cure: they're roles. This year's graduates can be a sample of all graduates ever, or the whole population of this year's graduates. Same people, different question.
Interaction — Think-Pair-Share (rapid-fire classification, ~12 min):
Put 6 variables on a slide; students decide level of measurement solo (30 sec), compare with a neighbor (1 min), then class votes by fingers (1=nominal … 4=ratio). Suggested items: number on a soccer jersey · ranking in a marathon (1st/2nd/3rd) · temperature in °C · annual income · blood type · year a car was made.
(Answers: nominal · ordinal · interval · ratio · nominal · interval.)
Debrief the two that always split the room: jersey number (nominal, not ratio) and marathon ranking (ordinal — 1st-to-2nd isn't the same time gap as 2nd-to-3rd).
Segment 5 — How We Pick: Sampling Methods (25 min) · Session 2 opens
Hook back in: "Last session: a statistic is only as good as the sample behind it. Today: how do you pick a sample you can actually trust?"
Plain language first — the gold standard.
- Simple Random Sample (SRS): every individual has an equal chance, and every group of that size is equally likely. Like names in a hat. This is the benchmark all other methods are judged against.
The four probability methods (each with a one-line picture):
- Simple random — names in a hat. Fair, but can miss small groups by luck.
- Stratified — split the population into meaningful groups (strata) first, then random-sample within each. Use when you want every subgroup represented (e.g., sample within each class standing).
- Cluster — split into natural groups (clusters), randomly pick whole clusters, measure everyone in them. Cheaper when the population is spread out (e.g., pick 5 dorm buildings, survey everyone in them).
- Systematic — order the list, pick every k-th person after a random start. Easy when you have a line or a list.
Memory hook: Stratified = sample within every group. Cluster = sample whole groups. (The classic mix-up; say it twice.)
The methods to distrust (name them as traps):
- Convenience sample — whoever's easy to reach (your friends, the front row). Cheap and almost always biased.
- Voluntary response — people opt in (online polls, "text YES to…"). The angry and the passionate over-reply.
Worked example:
Goal: estimate average study hours for all 18,000 undergrads.
- Stand outside the library at 9 p.m. and ask passers-by → convenience, and worse, biased toward studiers (library at 9 p.m.!).
- Email a random 800 pulled from the registrar's full list → SRS, the trustworthy move.
- Want freshmen–seniors all represented → random sample within each class standing → stratified.
Segment 6 — Bias, and a Famous Failure (18 min)
Plain language: bias is error baked into the method — it pushes results the same wrong direction no matter how big the sample gets. Four kinds to recognize:
- Sampling/undercoverage bias — some of the population can't be reached or is left out of the frame.
- Nonresponse bias — the people who don't answer differ from those who do.
- Response bias — the question or setting pushes answers (leading wording, sensitive topics, interviewer effects).
- Voluntary-response bias — the opt-in crowd isn't typical.
The famous failure (the story that sticks):
1936, Literary Digest presidential poll. They mailed 10 million ballots and got 2.4 million back — a colossal sample — and confidently predicted Landon would beat Roosevelt. Roosevelt won in a landslide.
What went wrong? Their mailing list came from car registrations, telephone directories, and magazine subscribers — in the Depression, that skewed wealthy (undercoverage), and only motivated people mailed ballots back (nonresponse). A young George Gallup polled a few thousand people chosen well and got it right.
The lesson, in one line: 2.4 million wrongly-chosen people lost to a few thousand well-chosen ones. Method beats size.
Callback: this is the cure to the Segment-4 misconception "bigger is better." Point back to it explicitly.
Segment 7 — Observational Study vs. Experiment; Correlation ≠ Causation (20 min)
Plain language first.
- In an observational study, you watch and record — you don't change anything. (Survey coffee drinkers; track who gets sick.)
- In an experiment, you deliberately impose a treatment and compare. (Randomly assign some students to a new tutoring tool, some to none, compare scores.)
- Only an experiment with random assignment can support a cause-and-effect claim. Observational studies can show a link, never prove the arrow.
The reason — confounding (worked example):
Headline: "Students who drink coffee get higher grades." (Observational.)
A confounding variable is a third thing tangled with both: maybe hours studying drives both the coffee and the grades. The data can't separate "coffee → grades" from "studying → grades AND coffee."
Picture: coffee — grades looks like a straight arrow, but a hidden "study hours" pulls both strings.
Memory hook: "Correlation is a handshake, not a push." Two things moving together ≠ one shoving the other.
Misconception + cure:
- ❌ "They found a strong correlation, so X causes Y."
✅ Cure: ask "could a third variable explain both?" and "was anything randomly assigned?" If nothing was assigned, you have a link, not a cause.
Quick mini-debate (genuinely arguable, ~4 min): "A study finds people who use a budgeting app save more money. Should the app advertise 'our app makes you save more'?" Have students argue both sides; surface the confounder (people who download budgeting apps may already be savers).
Segment 8 — Technology Workflow + AI-Critique, Callback & Hand-off (12 min) · Session 2 closes (~75)
Technology workflow — draw an SRS in a spreadsheet (exact steps):
1. Put your list of names/IDs in column A (say A2:A101 for 100 people).
2. In B2 type =RAND() and fill down to B101 — a random number beside each person.
3. Select A:B → Data ▸ Sort range → sort by column B.
4. The top 10 rows are your simple random sample. (Re-sorting reshuffles — that's the randomness working.)
- Google Sheets: =RAND(); Excel identical. For a quick reproducible pick, =RANDBETWEEN(1,100) also works.
AI-critique moment (students verify, not consume):
Paste this to an approved chatbot: "Classify these variables by level of measurement: zip code, marathon finishing place, temperature in °C, annual income."
Then check its work against NOIR. Chatbots often call zip code "ratio" (it's nominal) and stumble on °C ("ratio"? — it's interval, no true zero). Your job all semester: the tool drafts, you judge. This is exactly how the weekly Lecture Tutorial works — you'll catch the model, not trust it.
Callback + tease:
- Callback: "Every trustworthy number this term rides on this week — who was measured and how they were picked."
- Tease next week: "Now that we can get good data, Week 2 is the first thing we do with it: turn a pile of numbers into a picture — histograms, shapes, and the outliers that lie to you."
Hand-off (the week's graded work):
- Lecture Tutorial 1 (AI tutor, share-link submission) — population/sample, NOIR, sampling & bias.
- Quiz 1 (end of week) and Discussion 1 ("Spot the biased sample" — post one real-world example).
Instructor FAQ — Common Stumbles
| Student says / does | Quick cure |
|---|---|
| "Is a student ID quantitative? It's a number." | Apply the test — does averaging it mean anything? No. It's a nominal label that happens to look numeric. |
| "What's the difference between interval and ratio again?" | One question: does zero mean 'none'? Yes → ratio (income, age, counts). No, zero is just a mark → interval (°F, °C, calendar year). |
| Confuses stratified and cluster. | Stratified = sample within every group; cluster = sample whole groups and measure all of them. Stratified wants representation; cluster wants cheapness. |
| "The sample was 2 million people, so it's reliable." | Size never fixes bias. Literary Digest, 1936: 2.4M responses, wrong winner. Method beats size. |
| Calls a strong correlation "proof" of cause. | Ask: was anything randomly assigned? If no, it's observational → a link, not a cause. Hunt the confounder. |
| "Population = a big group." | Population is whoever the question is about — could be 25 people. It's a role, not a size. |
| Thinks a census is a kind of sample. | A census measures the whole population — the opposite of sampling. Rare because it's usually too costly/slow. |
| Picks convenience sample because it's "random enough." | Convenience ≠ random. "Whoever's nearby" has a hidden pattern (library at 9 p.m. = studiers). Only a chance-based method earns the word random. |
Scope flag
This outline stays within Objective 1. The °F/°C interval-vs-ratio nuance and the Literary Digest case are added context (not strictly required by the objective) — kept because they make the misconceptions stick; cut them if you want a leaner 60-minute version.
~ Prof. Rivera's edition · Fall 2026 · built with thecoursemaker.com