Back to the Introduction to Statistics outline The Course Maker
Introduction to Statistics outline
Week 1 · Lecture outline

Week 1 — Lecture Outline · Foundations & Types of Data

Introduction to Statistics · MATH 11 Fall 2026 · Prof. Rivera Fictional sample

Course: Introduction to Statistics (MATH 11) · Silver Oak University (fictional sample) · Prof. Rivera
Objectives covered: Objective 1 — Distinguish populations from samples and identify appropriate sampling and study designs.
SLOs touched: A (reason quantitatively from data) · B (communicate results to a non-technical audience)
Meeting pattern: 2 sessions × 75 min = 150 min. Segment minutes below total ~150; scale to your own pattern.


Week at a Glance

The week's big question "Where do data come from, and when can the numbers be trusted to speak for more people than we actually measured?"
By the end of the week, students can… (1) tell a population from a sample, and a parameter from a statistic; (2) classify a variable by its level of measurement (nominal / ordinal / interval / ratio); (3) name a sampling method and say whether it's likely to be biased; (4) tell an observational study from an experiment and explain why correlation isn't causation.
Key vocabulary population, sample, parameter, statistic, census, variable, categorical vs. quantitative, nominal/ordinal/interval/ratio, simple random sample (SRS), stratified, cluster, systematic, convenience/voluntary-response, bias (sampling, undercoverage, nonresponse, response), observational study, experiment, confounding variable
Materials slides (Deck 1), the week's readings + video links, a spreadsheet (Google Sheets or Excel), one approved chatbot (Gemini / Claude / ChatGPT) for the AI-critique moment and the tutorial
Timing note 8 segments, ~150 min total. Session 1 = Segments 1–4 (~75). Session 2 = Segments 5–8 (~75).

Segment 1 — Hook & the Promise (8 min) · Session 1 opens

Hook. "Raise your hand if, in the last 24 hours, you generated data." Wait. Then: every hand should be up.
- Your phone counted your steps. Spotify logged every song. A delivery app timed your driver. A professor's LMS logged when you opened this page.
- "You are walking around producing data constantly. This course is about the other side of that: how anyone turns numbers like yours into a claim about people they never met — and how to tell the honest claims from the garbage."

The promise (write it on the board): "By the end of this week you can look at any statistic in the wild — a poll, a 'studies show,' a campus survey — and ask the three questions that decide whether it deserves your trust: Who was measured? How were they picked? What was actually recorded?"

Why it matters line (memory hook): "Statistics is not about the math. It's about trusting a number that describes people you didn't count."


Segment 2 — Population vs. Sample, Parameter vs. Statistic (20 min)

Plain language first.
- A population is everyone (or everything) we want to know about. A sample is the part we actually measured. We measure the sample because measuring the whole population is usually impossible, too slow, or too expensive.
- A census is the rare case where we do measure the whole population.
- Two more words, same split: a number that describes the population is a parameter; the matching number from the sample is a statistic.

Memory hook (put it on a slide):

Population → Parameter. Sample → Statistic. "The letters line up."

One fully worked example (do every step out loud).

Claim: "62% of Silver Oak undergraduates feel stressed about money."
A pollster emailed all 18,000 undergrads (the population they care about). 900 replied (the sample). Of those 900, 558 said yes.
- Sample statistic: 558 / 900 = 0.62 → 62%. This is a statistic (it came from the sample).
- The true campus-wide percentage — what we'd get if all 18,000 answered honestly — is the parameter. We never see it directly; the statistic is our best estimate of it.
- Notation preview (notation comes after the idea): we write the population proportion as p (parameter) and the sample proportion as ("p-hat," statistic). "The hat means 'measured,' not 'true.'"

Land the key idea: the statistic (62%) is what we have; the parameter is what we want. The whole rest of the course is the bridge from one to the other.


Segment 3 — Levels of Measurement (25 min)

Plain language first. Before you can summarize a variable, you have to know what kind of thing it is. There are two big families and four levels.

  • Categorical (qualitative) — labels or groups. Two levels:
  • Nominal — names with no order. (Major, eye color, yes/no, jersey number.)
  • Ordinal — ordered categories, but the gaps aren't equal/measurable. (Letter grade, "small/medium/large," 1–5 satisfaction.)
  • Quantitative (numerical) — actual amounts you can do arithmetic on. Two levels:
  • Interval — ordered, equal gaps, but no true zero (zero doesn't mean "none"). (Temperature in °F/°C, calendar year.)
  • Ratio — ordered, equal gaps, and a true zero so ratios make sense. (Height, weight, age, income, count of anything.)

Memory hook: N–O–I–R (like the French for "black," noir) — Nominal, Ordinal, Interval, Ratio, in order of how much math they permit.

One fully worked example (classify and justify each):

A campus survey records: student ID, major, class standing (Fr/So/Jr/Sr), GPA, Fahrenheit temp of their dorm, number of cups of coffee yesterday.
- Student ID → nominal (it's a number, but it's a name — averaging IDs is nonsense).
- Major → nominal.
- Class standing → ordinal (ordered, but "Fr→So" isn't a measurable, equal jump).
- GPA → ratio-like, treated quantitative (0.0 means "no grade points" — a true zero).
- Dorm temperature (°F) → interval (0°F isn't "no heat").
- Cups of coffee → ratio (0 means none; 4 cups is twice 2).

The "test" to give students: Does zero mean 'none'? If yes → ratio. Are the gaps equal but zero is arbitrary? → interval. Ordered labels, fuzzy gaps? → ordinal. Just names? → nominal.


Segment 4 — Misconceptions + Quick Interaction (22 min) · Session 1 closes (~75)

Name the misconceptions out loud, then cure each:

  • "If it's a number, it's quantitative."
    Cure: zip codes, phone numbers, jersey numbers, and student IDs are numbers that label. The test isn't "is it a number," it's "does arithmetic mean anything." You can't average area codes.
  • "A bigger sample is automatically a better sample."
    Cure: size doesn't fix bias. A huge sample drawn the wrong way is confidently wrong. (We'll see the Literary Digest disaster next session — 2.4 million responses, dead wrong.) "How you pick beats how many you pick."
  • "Population means a lot of people."
    Cure: the population is whoever the question is about — it could be 30 students in one section, or every transaction at one store. It's defined by the claim, not by being large.
  • "Sample and population are fixed words."
    Cure: they're roles. This year's graduates can be a sample of all graduates ever, or the whole population of this year's graduates. Same people, different question.

Interaction — Think-Pair-Share (rapid-fire classification, ~12 min):
Put 6 variables on a slide; students decide level of measurement solo (30 sec), compare with a neighbor (1 min), then class votes by fingers (1=nominal … 4=ratio). Suggested items: number on a soccer jersey · ranking in a marathon (1st/2nd/3rd) · temperature in °C · annual income · blood type · year a car was made.
(Answers: nominal · ordinal · interval · ratio · nominal · interval.)
Debrief the two that always split the room: jersey number (nominal, not ratio) and marathon ranking (ordinal — 1st-to-2nd isn't the same time gap as 2nd-to-3rd).


Segment 5 — How We Pick: Sampling Methods (25 min) · Session 2 opens

Hook back in: "Last session: a statistic is only as good as the sample behind it. Today: how do you pick a sample you can actually trust?"

Plain language first — the gold standard.
- Simple Random Sample (SRS): every individual has an equal chance, and every group of that size is equally likely. Like names in a hat. This is the benchmark all other methods are judged against.

The four probability methods (each with a one-line picture):
- Simple random — names in a hat. Fair, but can miss small groups by luck.
- Stratified — split the population into meaningful groups (strata) first, then random-sample within each. Use when you want every subgroup represented (e.g., sample within each class standing).
- Cluster — split into natural groups (clusters), randomly pick whole clusters, measure everyone in them. Cheaper when the population is spread out (e.g., pick 5 dorm buildings, survey everyone in them).
- Systematic — order the list, pick every k-th person after a random start. Easy when you have a line or a list.

Memory hook: Stratified = sample within every group. Cluster = sample whole groups. (The classic mix-up; say it twice.)

The methods to distrust (name them as traps):
- Convenience sample — whoever's easy to reach (your friends, the front row). Cheap and almost always biased.
- Voluntary response — people opt in (online polls, "text YES to…"). The angry and the passionate over-reply.

Worked example:

Goal: estimate average study hours for all 18,000 undergrads.
- Stand outside the library at 9 p.m. and ask passers-by → convenience, and worse, biased toward studiers (library at 9 p.m.!).
- Email a random 800 pulled from the registrar's full list → SRS, the trustworthy move.
- Want freshmen–seniors all represented → random sample within each class standingstratified.


Segment 6 — Bias, and a Famous Failure (18 min)

Plain language: bias is error baked into the method — it pushes results the same wrong direction no matter how big the sample gets. Four kinds to recognize:
- Sampling/undercoverage bias — some of the population can't be reached or is left out of the frame.
- Nonresponse bias — the people who don't answer differ from those who do.
- Response bias — the question or setting pushes answers (leading wording, sensitive topics, interviewer effects).
- Voluntary-response bias — the opt-in crowd isn't typical.

The famous failure (the story that sticks):

1936, Literary Digest presidential poll. They mailed 10 million ballots and got 2.4 million back — a colossal sample — and confidently predicted Landon would beat Roosevelt. Roosevelt won in a landslide.
What went wrong? Their mailing list came from car registrations, telephone directories, and magazine subscribers — in the Depression, that skewed wealthy (undercoverage), and only motivated people mailed ballots back (nonresponse). A young George Gallup polled a few thousand people chosen well and got it right.
The lesson, in one line: 2.4 million wrongly-chosen people lost to a few thousand well-chosen ones. Method beats size.

Callback: this is the cure to the Segment-4 misconception "bigger is better." Point back to it explicitly.


Segment 7 — Observational Study vs. Experiment; Correlation ≠ Causation (20 min)

Plain language first.
- In an observational study, you watch and record — you don't change anything. (Survey coffee drinkers; track who gets sick.)
- In an experiment, you deliberately impose a treatment and compare. (Randomly assign some students to a new tutoring tool, some to none, compare scores.)
- Only an experiment with random assignment can support a cause-and-effect claim. Observational studies can show a link, never prove the arrow.

The reason — confounding (worked example):

Headline: "Students who drink coffee get higher grades." (Observational.)
A confounding variable is a third thing tangled with both: maybe hours studying drives both the coffee and the grades. The data can't separate "coffee → grades" from "studying → grades AND coffee."
Picture: coffee — grades looks like a straight arrow, but a hidden "study hours" pulls both strings.

Memory hook: "Correlation is a handshake, not a push." Two things moving together ≠ one shoving the other.

Misconception + cure:
- ❌ "They found a strong correlation, so X causes Y."
Cure: ask "could a third variable explain both?" and "was anything randomly assigned?" If nothing was assigned, you have a link, not a cause.

Quick mini-debate (genuinely arguable, ~4 min): "A study finds people who use a budgeting app save more money. Should the app advertise 'our app makes you save more'?" Have students argue both sides; surface the confounder (people who download budgeting apps may already be savers).


Segment 8 — Technology Workflow + AI-Critique, Callback & Hand-off (12 min) · Session 2 closes (~75)

Technology workflow — draw an SRS in a spreadsheet (exact steps):
1. Put your list of names/IDs in column A (say A2:A101 for 100 people).
2. In B2 type =RAND() and fill down to B101 — a random number beside each person.
3. Select A:B → Data ▸ Sort range → sort by column B.
4. The top 10 rows are your simple random sample. (Re-sorting reshuffles — that's the randomness working.)
- Google Sheets: =RAND(); Excel identical. For a quick reproducible pick, =RANDBETWEEN(1,100) also works.

AI-critique moment (students verify, not consume):

Paste this to an approved chatbot: "Classify these variables by level of measurement: zip code, marathon finishing place, temperature in °C, annual income."
Then check its work against NOIR. Chatbots often call zip code "ratio" (it's nominal) and stumble on °C ("ratio"? — it's interval, no true zero). Your job all semester: the tool drafts, you judge. This is exactly how the weekly Lecture Tutorial works — you'll catch the model, not trust it.

Callback + tease:
- Callback: "Every trustworthy number this term rides on this week — who was measured and how they were picked."
- Tease next week: "Now that we can get good data, Week 2 is the first thing we do with it: turn a pile of numbers into a picture — histograms, shapes, and the outliers that lie to you."

Hand-off (the week's graded work):
- Lecture Tutorial 1 (AI tutor, share-link submission) — population/sample, NOIR, sampling & bias.
- Quiz 1 (end of week) and Discussion 1 ("Spot the biased sample" — post one real-world example).


Instructor FAQ — Common Stumbles

Student says / does Quick cure
"Is a student ID quantitative? It's a number." Apply the test — does averaging it mean anything? No. It's a nominal label that happens to look numeric.
"What's the difference between interval and ratio again?" One question: does zero mean 'none'? Yes → ratio (income, age, counts). No, zero is just a mark → interval (°F, °C, calendar year).
Confuses stratified and cluster. Stratified = sample within every group; cluster = sample whole groups and measure all of them. Stratified wants representation; cluster wants cheapness.
"The sample was 2 million people, so it's reliable." Size never fixes bias. Literary Digest, 1936: 2.4M responses, wrong winner. Method beats size.
Calls a strong correlation "proof" of cause. Ask: was anything randomly assigned? If no, it's observational → a link, not a cause. Hunt the confounder.
"Population = a big group." Population is whoever the question is about — could be 25 people. It's a role, not a size.
Thinks a census is a kind of sample. A census measures the whole population — the opposite of sampling. Rare because it's usually too costly/slow.
Picks convenience sample because it's "random enough." Convenience ≠ random. "Whoever's nearby" has a hidden pattern (library at 9 p.m. = studiers). Only a chance-based method earns the word random.

Scope flag

This outline stays within Objective 1. The °F/°C interval-vs-ratio nuance and the Literary Digest case are added context (not strictly required by the objective) — kept because they make the misconceptions stick; cut them if you want a leaner 60-minute version.

~ Prof. Rivera's edition · Fall 2026 · built with thecoursemaker.com