Inside the Models

This walkthrough follows the published presidential-approval model, which gives a single likely-to-approve probability. The simulator also offers an ordered 4-point view of the same question that works a little differently — here we stay with the published top-2-box model.

What is a model?

A model is a simplified street map. A real map of a city leaves out most of what's actually there — the trees, the traffic lights, the side streets that don't matter for your trip. What it keeps is enough to get you from one place to another.

Most streets are left out. The map keeps just the route that gets you there.

A statistical model works the same way. It's a simplified picture of how an outcome moves — what nudges it up, what pulls it down — kept simple enough that a person can read it. The simplification isn't a flaw; it's the whole point. A perfect 1:1 reproduction of reality wouldn't help anyone navigate.

The variables shown below aren't the only things that could matter. They're the optimal combination drawn from every candidate variable in the dataset — selected because, together, they predict the outcome better than other combinations would. A different set might do almost as well; a much smaller set usually does worse; adding more variables past a certain point stops helping.

Loading model and microdata…

One platform, four related models

The shape of the outcome decides which model fits it. The platform fits four related families and reports them all in the same headline units. This page walks through one of them, one respondent at a time.

—

Every dot above is one prediction. To see where those numbers come from, pull one case out of the cloud — pick the highest, the lowest, a specific one, or leave it on a random draw:

Featured respondent

—

respondent #

We've pulled our featured one out of the cloud. Here is what they told the interviewer:

Their answers

These six answers are everything the model knows about this respondent.

How the model thinks about this respondent

The model runs those inputs through an equation. Each non-reference input has a coefficient — a number, learned when the model was fit on all — respondents, that says how much this input pushes the predicted probability up or down. Reference inputs contribute zero; the constant absorbs them. Continuous predictors contribute their coefficient times the measured value.

The calculation for our featured one, row by row:

The total at the bottom is on the log-odds scale. The logistic function σ(x) = 1 / (1 + e^-x) squashes it back into a probability between 0 and 1:

Log-odds total:

—

Predicted probability for our featured one:

—

What actually happened?

We know our featured one's actual outcome. The prediction above was computed from the recorded inputs and fitted coefficients, not from this case's observed outcome. So we can put the model-implied probability and the actual result side by side:

Model predicted

—

before knowing the outcome

Actually happened

—

What if this one had different inputs?

Purple = changes this one respondent only

The dropdowns and sliders below start at this one's actual values. Change any to a different value, then click Apply. The σ panel below recomputes the prediction for this one only — the rest of the sample stays untouched. The locked baseline at the top of the page stays where it is, so you can compare side by side.

Your what-if prediction

New log-odds total:

—

New predicted probability:

—

They're one of —

Our featured one's prediction is one number. The model runs the same calculation for every respondent in the sample, producing — predicted probabilities. Plotted together:

Each bar is a 5-percentage-point bin. The vertical line marks the weighted mean of all those probabilities — the headline:

—

Our featured one sits at —. Plenty of respondents sit far above or far below the headline — the spread on either side is what the headline hides.

How well does the model sort people like our featured one?

That last section compared our featured one's prediction against what actually happened for that one respondent. We can do the same comparison across the whole sample — we know the actual outcome for every respondent. The model assigns each case a predicted probability from its recorded inputs and the fitted coefficients. So we can ask, after the fact: did the model give higher predicted probabilities to actual approvers and lower ones to actual non-approvers?

The same cloud, split by what actually happened:

Actual approvers got an average predicted probability of —; actual non-approvers averaged —. The gap between those means is Tjur's R², in probability units (shown here as percentage points):

—

0 pp means no separation — the predicted probability tells you nothing about which group the respondent belongs to. 100 pp means perfect separation. Real-world models land in between.

This number measures how well the published model separates the two groups on the data it was fit on, and it stays fixed through the scenarios below. (An all-or-nothing scenario homogenizes one predictor across every respondent, which mechanically shrinks the gap and falsely suggests the model "got worse" under intervention — but nothing about the model has changed.)

And what if everyone had answered differently?

Blue = changes every respondent

Earlier you watched the featured case's prediction shift when you changed one of its inputs. Only that one moved — the cloud and the headline stayed put because the rest of the sample didn't change. The simulator below applies the same kind of swap to everyone: pin one or more inputs to a single value, and every case in the sample gets that value. The headline tiles and the cloud below redraw to show the shift.

This is a teaching version of the population simulator. For the full release-day scenario tool, use the All-or-Nothing page.

No scenario applied — showing baseline.

Baseline headline

—

Under scenario

—

95% CI: —

Change

—

percentage points