Best Films 2016-2026 — Critic Consensus vs. Quilty Score | Quilty
Benchmark · Updated 2026-05-23

Critic Consensus vs. Quilty Score

How our greenlight model relates to films critics loved — the 50 top-consensus features of 2016–2026.

Critic averages tell us how a finished film was received. The Quilty Score asks a different question: would the underlying material have been worth greenlighting in the first place? This page puts both lenses side-by-side so the disagreements are useful, not embarrassing.

A note on sources

IMDb, Rotten Tomatoes, Metacritic, and the consensus average shown on this page were transcribed from a public "Best Movies of 2016–2026" graphic. They are unverified against the source APIs; treat them as directional. Quilty scores come from our production database and are calculated by the V5 ensemble.

Coverage

8 / 50

films currently scored by Quilty

Average Quilty

74

across the scored subset

Largest Positive Delta

Nope

+2.9 vs. consensus

Show
Sort by
#1 · Drama

Parasite (2019)

76.1
Quilty
IMDb
8.5
RT
99%
MC
97
Avg
94.8
Quilty − Consensus -18.7
#2 · Drama

Moonlight (2016)

76.8
Quilty
IMDb
7.4
RT
98%
MC
99
Avg
91.5
Quilty − Consensus -14.7
#3 · Romance

Portrait of a Lady on Fire (2019)

Not scored
IMDb
8.1
RT
97%
MC
95
Avg
91.4
#4 · Drama

The Father (2020)

Not scored
IMDb
8.2
RT
98%
MC
88
Avg
91.3
#5 · Action

Mad Max: Fury Road (2015)

Not scored
IMDb
8.1
RT
97%
MC
90
Avg
90.3
#6 · Animation

Spider-Man: Across the Spider-Verse (2023)

Not scored
IMDb
8.6
RT
95%
MC
86
Avg
89.9
#7 · Drama

Aftersun (2022)

Not scored
IMDb
7.6
RT
96%
MC
95
Avg
89.5
#8 · Biopic

Oppenheimer (2023)

Not scored
IMDb
8.3
RT
93%
MC
90
Avg
89.4
#9 · Drama

The Zone of Interest (2023)

Not scored
IMDb
7.5
RT
93%
MC
98
Avg
89.3
#10 · Romance

Past Lives (2023)

80.4
Quilty
IMDb
7.8
RT
95%
MC
94
Avg
89.3
Quilty − Consensus -8.9
#11 · Drama

Minari (2020)

Not scored
IMDb
7.4
RT
98%
MC
89
Avg
88.1
#12 · Crime

The Irishman (2019)

Not scored
IMDb
7.8
RT
95%
MC
94
Avg
88.0
#13 · Action

Godzilla Minus One (2023)

Not scored
IMDb
8.3
RT
98%
MC
81
Avg
87.8
#14 · Musical

La La Land (2016)

70.8
Quilty
IMDb
8.0
RT
91%
MC
94
Avg
87.7
Quilty − Consensus -16.9
#15 · Drama

The Worst Person in the World (2021)

Not scored
IMDb
7.7
RT
96%
MC
90
Avg
87.6
#16 · Drama

Anatomy of a Fall (2023)

Not scored
IMDb
7.7
RT
96%
MC
90
Avg
87.6
#17 · Drama

Drive My Car (2021)

Not scored
IMDb
7.5
RT
97%
MC
91
Avg
87.5
#18 · Drama

Nomadland (2020)

Not scored
IMDb
7.3
RT
93%
MC
93
Avg
87.1
#19 · Animation

The Boy and the Heron (2023)

Not scored
IMDb
7.4
RT
93%
MC
91
Avg
87.1
#20 · Drama

The Banshees of Inisherin (2022)

Not scored
IMDb
7.7
RT
96%
MC
87
Avg
86.9
#21 · Animation

Soul (2020)

Not scored
IMDb
8.0
RT
95%
MC
83
Avg
86.0
#22 · Drama

TÁR (2022)

Not scored
IMDb
7.4
RT
91%
MC
92
Avg
86.0
#23 · Drama

Killers of the Flower Moon (2023)

Not scored
IMDb
7.6
RT
93%
MC
89
Avg
86.0
#24 · Comedy

Poor Things (2023)

Not scored
IMDb
7.8
RT
92%
MC
88
Avg
85.9
#25 · Sci-Fi

Dune: Part Two (2024)

Not scored
IMDb
8.5
RT
92%
MC
79
Avg
85.8
#26 · Drama

Sound of Metal (2020)

Not scored
IMDb
7.7
RT
97%
MC
82
Avg
85.6
#27 · Action

Top Gun: Maverick (2022)

Not scored
IMDb
8.2
RT
96%
MC
78
Avg
85.4
#28 · Comedy

The Holdovers (2023)

78.1
Quilty
IMDb
7.9
RT
97%
MC
82
Avg
85.3
Quilty − Consensus -7.2
#29 · Drama

A Hero (2021)

Not scored
IMDb
7.5
RT
97%
MC
82
Avg
85.2
#30 · Thriller

Decision to Leave (2022)

Not scored
IMDb
7.3
RT
94%
MC
88
Avg
84.4
#31 · Sci-Fi

Everything Everywhere All at Once (2022)

67.5
Quilty
IMDb
7.8
RT
94%
MC
81
Avg
84.3
Quilty − Consensus -16.8
#32 · Drama

Perfect Days (2023)

Not scored
IMDb
7.9
RT
96%
MC
89
Avg
84.3
#33 · Action

Mission: Impossible – Dead Reckoning (2023)

Not scored
IMDb
7.7
RT
96%
MC
81
Avg
84.2
#34 · Comedy

Palm Springs (2020)

Not scored
IMDb
7.4
RT
94%
MC
83
Avg
84.1
#35 · Comedy

Licorice Pizza (2021)

Not scored
IMDb
7.1
RT
91%
MC
90
Avg
84.0
#36 · Animation

The Mitchells vs. the Machines (2021)

Not scored
IMDb
7.6
RT
97%
MC
81
Avg
83.5
#37 · Drama

Another Round (2020)

Not scored
IMDb
7.7
RT
93%
MC
81
Avg
83.2
#38 · Drama

CODA (2021)

Not scored
IMDb
8.0
RT
94%
MC
75
Avg
82.3
#39 · Action

John Wick: Chapter 4 (2023)

Not scored
IMDb
7.7
RT
94%
MC
78
Avg
82.1
#40 · Animation

Suzume (2022)

Not scored
IMDb
7.6
RT
96%
MC
77
Avg
81.9
#41 · Drama

The Fabelmans (2022)

Not scored
IMDb
7.5
RT
92%
MC
84
Avg
81.2
#42 · Action

Furiosa: A Mad Max Saga (2024)

Not scored
IMDb
7.8
RT
90%
MC
79
Avg
80.9
#43 · War

All Quiet on the Western Front (2022)

Not scored
IMDb
7.8
RT
90%
MC
76
Avg
80.6
#44 · Drama

Challengers (2024)

Not scored
IMDb
7.4
RT
88%
MC
82
Avg
79.1
#45 · Drama

Society of the Snow (2023)

Not scored
IMDb
7.8
RT
90%
MC
72
Avg
79.0
#46 · Action

The Northman (2022)

63.8
Quilty
IMDb
7.0
RT
90%
MC
82
Avg
79.0
Quilty − Consensus -15.2
#47 · Sci-Fi

Blade Runner 2049 (2017)

Not scored
IMDb
8.0
RT
88%
MC
81
Avg
79.0
#48 · Action

The Batman (2022)

Not scored
IMDb
7.8
RT
85%
MC
72
Avg
76.9
#49 · Action

Civil War (2024)

Not scored
IMDb
7.1
RT
81%
MC
75
Avg
76.4
#50 · Sci-Fi

Nope (2022)

78.5
Quilty
IMDb
6.8
RT
83%
MC
77
Avg
75.6
Quilty − Consensus +2.9

What the deltas tell us

Where Quilty and the critic consensus disagree, the breakdown almost always points at a single pillar. That is where the model is doing useful work — and also where it is most testable.

Quilty rated higher than consensus

  • Nope (2022)

    Quilty 78.5 · Consensus 75.6

    +2.9
  • The Holdovers (2023)

    Quilty 78.1 · Consensus 85.3

    -7.2
  • Past Lives (2023)

    Quilty 80.4 · Consensus 89.3

    -8.9

Quilty rated lower than consensus

  • Parasite (2019)

    Quilty 76.1 · Consensus 94.8

    -18.7
  • La La Land (2016)

    Quilty 70.8 · Consensus 87.7

    -16.9
  • Everything Everywhere All at Once (2022)

    Quilty 67.5 · Consensus 84.3

    -16.8

How to read a gap

A negative delta does not mean Quilty disagreed with critics on quality. Story & Craft scores are usually high on this list — the gap typically comes from Commercial Viability or Production Reality, which the critic score does not measure. Expand any row to see which pillar moved.

How we use this dataset internally

This page is also a calibration asset. The scored subset gives the product team a fixed cohort to track when prompts, models, or weights change.

Prompt regression

Re-score the cohort after a prompt change. Material movement on these high-consensus films is a flag worth reviewing.

Weight tuning

Compare pillar deltas before and after weight changes. The cohort is small enough to inspect by hand and large enough to be useful.

Red-team divergences

Films where consensus and Quilty disagree the most are the cases we use to pressure-test the model and the caveat copy.

Yearly refresh

New top-consensus films get added each year. The script regenerates the data file from production scores so the page stays current.

42 films still need a Quilty score

We will run a controlled benchmark-scoring pass using the V5 orchestrator in historical mode (per release year) and update this page once the cohort is complete. No critic data is fed into the model — this dataset is for evaluation, not training.

Last regenerated: 2026-05-23. Source consensus values: user-supplied graphic (unverified). Quilty scores: production quilty_scores_v5.