What data does NoPunt use to make picks?

Every pick starts with NFL play-by-play data from nflfastR/nfl_data_py, covering every play of every game since 1999. From that we compute Expected Points Added (EPA), success rate, win probability, personnel groupings, and down/distance context. We don't use power rankings from elsewhere or last week's box score -- we use the play-by-play.

How does the NoPunt prediction model work?

Three logistic regression classifiers, each trained on a different rolling window (24, 33, and 64 games) of every NFL game since 2002. They vote. Majority wins. Features include rolling EPA splits, home/road deltas, rest days, divisional and primetime indicators, QB-level adjustments, and the Vegas spread as a calibration anchor.

What do the confidence tiers (S, A+, A, B, C) mean?

Not every pick is equal. Tiers come from the ensemble's own conviction -- the three models' vote and our predicted margin -- not the gap to the Vegas line. S-tier needs a unanimous 3-0 vote plus a 10+ point predicted margin -- our strongest conviction. A+ is unanimous with a 7+ point margin, A is unanimous with a 65%+ win probability, and B is any other unanimous pick. C is a split 2-1 decision -- a lean only. Tier hit rates are computed live from every completed pick.

When are picks published each week?

Tuesday morning: results sync scores last week's games, then stats sync pulls fresh play-by-play. Wednesday 8am ET: the model runs against the upcoming slate, emits tier-graded picks, and they go live on the site. As games complete Sunday through Tuesday, the results sync attaches final scores and recomputes season ROI.

2025 SEASON FINAL · PICKS RETURN SEPTEMBER 2026

METHODOLOGY · THE FULL SHOW YOUR WORK

HOW NOPUNT
PICKS GAMES.

Q: What is the Edge?

Beyond the straight-up call on every game, the model flags a curated set — the Edge — the plays it would actually bet. That flagged set, and the betting strategy behind it, is available to Pro members. The full straight-up prediction record stays public.

Q: How can I verify NoPunt's track record?

Every pick ever published is on the results page. We don't delete losses. We don't edit screenshots. The full ledger is 1,615 games on file at 66.5% accuracy. You can also pull the data programmatically via our public JSON API.

We’re a model, not a take artist. Every pick comes from the same pipeline: play-by-play data goes in, a tier-graded pick comes out, the result gets logged whether we won or lost. Here’s every step.

THE DATA

Every pick starts with NFL play-by-play from the same public source the analytics community uses (nflfastR / nfl_data_py). That gets us, for every play of every game since 1999:

Expected Points Added (EPA) -- the change in expected points a play caused.
Success rate -- did the play meaningfully advance the chain?
Win probability before and after.
Personnel groupings, formation, down/distance, field position.

From that raw stream we compute weekly team-strength estimates: offensive EPA per play, defensive EPA per play, success rates split by pass and rush. We don’t use power rankings from elsewhere. We don’t use last week’s box score. We use the play-by-play.

THE MODEL

Three logistic regression classifiers, each trained on a different rolling window (24, 33, and 64 games) of every NFL game since 2002. They vote. Majority wins. Features include:

Rolling EPA splits for each team (8-game and season-long).
Home / road performance deltas.
Rest days, divisional indicator, primetime indicator.
QB-level adjustments via 4-week QB EPA above replacement.
Vegas spread as a calibration anchor.

Each model outputs a win probability. The ensemble takes the majority vote and averages the confidence. That gets converted to a fair-line price and compared against the live market to flag value. The tier, though, comes from the ensemble itself: how unanimously the three models vote and how large a margin they project — not the size of the gap to the Vegas line.

THE TIERS

Not every pick is created equal. We grade confidence on a 5-tier scale so you know when to lean in vs. lay off:

TIER	CRITERIA	HIT RATE
S	Unanimous 3-0 vote, 10+ pt predicted margin	83.7% (76-91%)
A+	Unanimous 3-0 vote, 7+ pt predicted margin	75.1% (69-81%)
A	Unanimous 3-0 vote, 65%+ win probability	68.0% (59-77%)
B	Unanimous 3-0 vote	66.3% (63-70%)
C	Split 2-1 vote — lean only	58.3% (53-63%)

Hit rates show the point estimate with a 95% bootstrap confidence interval, computed by the validation engine over every completed pick. The bands update as games go final.

THE EDGE

On top of the straight-up call for every game, the model flags a curated set — the Edge — the plays it would actually bet. That flagged set, and the betting strategy behind it, is reserved for Pro members. The full straight-up prediction record stays public.

THE PIPELINE

Tuesday morning -- results sync scores last week's games, then stats sync pulls fresh PBP and refreshes team-strength estimates.
Wednesday 8am ET -- model runs against the upcoming slate, emits tier-graded picks. Picks are written to Supabase and go live on the site.
Sunday → Monday → Tuesday -- as games complete, the results sync attaches final scores, marks W/L, and recomputes season ROI.
Every step runs in GitHub Actions. The cron logs are public.

THE RECEIPTS

Every pick we’ve ever published is on the results page. We don’t delete losses. We don’t edit screenshots. The full ledger is 1,615 games on file at 66.5% accuracy.

You can also pull the data programmatically:

GET /api/picks.json-- this week’s slate
GET /api/picks/[season]/[week].json -- any historical week with results
/rss.xml -- RSS of the last 50 completed games

07B

LIVE VALIDATION

Theory is nice. Numbers are better. Here’s the model’s performance, with 95% bootstrap confidence intervals on the validated metrics:

ACCURACY

66.5%

95% CI 64-69%

UNANIMOUS (3-0)

68.8%

841-381

SPLIT VOTE (2-1)

58.3%

229-164

ATS RATE

51.5%

814-768

When all three sub-models agree, accuracy jumps to 68.8%. When they split 2-1, we still pick the majority side but accuracy drops to 58.3%. The gap validates the ensemble design.

SEASON-BY-SEASON ACCURACY

2020

67.2%172-84

2021

62.5%170-102

2022

65.7%178-93

2023

64.7%176-96

2024

69.5%189-83

2025

68.0%185-87

CALIBRATION

A well-calibrated model’s 70% confidence picks should win about 70% of the time. Below: predicted confidence (x) vs actual win rate (y) for the raw ensemble. Points near the diagonal mean the model knows what it knows.

EXPECTED CALIBRATION ERROR6.5%

Each dot represents a confidence bucket from the validated calibration engine. The dashed line is perfect calibration. Points above the line mean the model was underconfident (winning more than predicted); below means overconfident. Expected Calibration Error (ECE) is the average gap between predicted and actual across buckets — lower is better.

The raw ensemble runs a few points underconfident in the middle buckets: the pick is a hard 3-model vote while the shown probability is the mean of three probabilities, and the vote carries information the mean doesn’t. So the confidence we displayis corrected with a per-season Platt calibration (fit only on prior seasons, frozen within a season, and clamped so it can never flip which side we picked). The displayed number is the honest one; this chart shows the raw model it’s built from.

ENSEMBLE AGREEMENT

NoPunt runs three independent logistic regression models. Each casts a vote. Unanimous (3-0) picks historically hit at a higher rate than split (2-1) decisions.

UNANIMOUS (3-0)

68.8%

841-381 · 1222 PICKS

MAJORITY (2-1)

58.3%

229-164 · 393 PICKS

CONFIDENCE DISTRIBUTION

How often does the model output high-confidence picks vs close calls? A top-heavy distribution means the model is decisive; a flat one means most games look like coin flips.

64

842

528

161

15

40-50%

50-60%

60-70%

70-80%

80-90%

55% W

64% W

69% W

76% W

87% W

Bar height = number of picks in each confidence bucket. Win rates shown below each bar. The model concentrates picks in the 55-65% range with selective high-confidence calls at higher tiers.

NO BLACK BOX.

That’s the methodology. If something here doesn’t add up, tell us -- every pick is publicly verifiable.

GET THE EDGE FULL LEDGER →THIS WEEK'S PICKS → POWER RANKINGS →SEASON REVIEW →

2025 SEASON FINAL · PICKS RETURN SEPTEMBER 2026

METHODOLOGY · THE FULL SHOW YOUR WORK

HOW NOPUNT
PICKS GAMES.

THE DATA

Every pick starts with NFL play-by-play from the same public source the analytics community uses (nflfastR / nfl_data_py). That gets us, for every play of every game since 1999:

Expected Points Added (EPA) -- the change in expected points a play caused.
Success rate -- did the play meaningfully advance the chain?
Win probability before and after.
Personnel groupings, formation, down/distance, field position.

THE MODEL

Three logistic regression classifiers, each trained on a different rolling window (24, 33, and 64 games) of every NFL game since 2002. They vote. Majority wins. Features include:

Rolling EPA splits for each team (8-game and season-long).
Home / road performance deltas.
Rest days, divisional indicator, primetime indicator.
QB-level adjustments via 4-week QB EPA above replacement.
Vegas spread as a calibration anchor.

THE TIERS

Not every pick is created equal. We grade confidence on a 5-tier scale so you know when to lean in vs. lay off:

TIER	CRITERIA	HIT RATE
S	Unanimous 3-0 vote, 10+ pt predicted margin	83.7% (76-91%)
A+	Unanimous 3-0 vote, 7+ pt predicted margin	75.1% (69-81%)
A	Unanimous 3-0 vote, 65%+ win probability	68.0% (59-77%)
B	Unanimous 3-0 vote	66.3% (63-70%)
C	Split 2-1 vote — lean only	58.3% (53-63%)

Hit rates show the point estimate with a 95% bootstrap confidence interval, computed by the validation engine over every completed pick. The bands update as games go final.

THE EDGE

THE PIPELINE

Tuesday morning -- results sync scores last week's games, then stats sync pulls fresh PBP and refreshes team-strength estimates.
Wednesday 8am ET -- model runs against the upcoming slate, emits tier-graded picks. Picks are written to Supabase and go live on the site.
Sunday → Monday → Tuesday -- as games complete, the results sync attaches final scores, marks W/L, and recomputes season ROI.
Every step runs in GitHub Actions. The cron logs are public.

THE RECEIPTS

Every pick we’ve ever published is on the results page. We don’t delete losses. We don’t edit screenshots. The full ledger is 1,615 games on file at 66.5% accuracy.

You can also pull the data programmatically:

GET /api/picks.json-- this week’s slate
GET /api/picks/[season]/[week].json -- any historical week with results
/rss.xml -- RSS of the last 50 completed games

07B

LIVE VALIDATION

Theory is nice. Numbers are better. Here’s the model’s performance, with 95% bootstrap confidence intervals on the validated metrics:

ACCURACY

66.5%

95% CI 64-69%

UNANIMOUS (3-0)

68.8%

841-381

SPLIT VOTE (2-1)

58.3%

229-164

ATS RATE

51.5%

814-768

When all three sub-models agree, accuracy jumps to 68.8%. When they split 2-1, we still pick the majority side but accuracy drops to 58.3%. The gap validates the ensemble design.

SEASON-BY-SEASON ACCURACY

2020

67.2%172-84

2021

62.5%170-102

2022

65.7%178-93

2023

64.7%176-96

2024

69.5%189-83

2025

68.0%185-87

CALIBRATION

EXPECTED CALIBRATION ERROR6.5%

ENSEMBLE AGREEMENT

NoPunt runs three independent logistic regression models. Each casts a vote. Unanimous (3-0) picks historically hit at a higher rate than split (2-1) decisions.

UNANIMOUS (3-0)

68.8%

841-381 · 1222 PICKS

MAJORITY (2-1)

58.3%

229-164 · 393 PICKS

CONFIDENCE DISTRIBUTION

How often does the model output high-confidence picks vs close calls? A top-heavy distribution means the model is decisive; a flat one means most games look like coin flips.

64

842

528

161

15

40-50%

50-60%

60-70%

70-80%

80-90%

55% W

64% W

69% W

76% W

87% W

Bar height = number of picks in each confidence bucket. Win rates shown below each bar. The model concentrates picks in the 55-65% range with selective high-confidence calls at higher tiers.

NO BLACK BOX.

That’s the methodology. If something here doesn’t add up, tell us -- every pick is publicly verifiable.

GET THE EDGE FULL LEDGER →THIS WEEK'S PICKS → POWER RANKINGS →SEASON REVIEW →

HOW NOPUNTPICKS GAMES.

THE DATA

THE MODEL

THE TIERS

THE EDGE

THE PIPELINE

THE RECEIPTS

LIVE VALIDATION

CALIBRATION

ENSEMBLE AGREEMENT

CONFIDENCE DISTRIBUTION

HOW NOPUNTPICKS GAMES.

THE DATA

THE MODEL

THE TIERS

THE EDGE

THE PIPELINE

THE RECEIPTS

LIVE VALIDATION

CALIBRATION

ENSEMBLE AGREEMENT

CONFIDENCE DISTRIBUTION

HOW NOPUNT
PICKS GAMES.

HOW NOPUNT
PICKS GAMES.