Next photoDodge Van
BLT
BLTPeople mostly said yes
Benchmark image 01

Bacon Lettuce Tomato

BLT "Sandwich"

A perfectly legible BLT sits on toasted bread, the kind of canonical positive example that makes even the worst eval look solved. If your model misses this one, it does not need fine-tuning; it needs adult supervision.

Under development: this benchmark and its published results are provisional, not final.

Human
96.3% yes3.7% no
Model average
100.0% yes0.0% no
Most aligned models
31-way tie
Meta / Llamameta-llama/llama-3.2-11b-vision-instructGPT / OpenAIopenai/o3-proKimi / Moonshotmoonshotai/kimi-k2.5+28 more
Least aligned models
31-way tie
Meta / Llamameta-llama/llama-3.2-11b-vision-instructGPT / OpenAIopenai/o3-proKimi / Moonshotmoonshotai/kimi-k2.5+28 more
At a glance

How this photo split the room

Human distribution
96.3% yes, 3.7% no over 656 explicit votes.
Model average distribution
100.0% yes, 0.0% no across the current model set.
Closest current models
100.0% yes.

31-way tie

Least aligned models
3.7 point gap.

31-way tie

Legacy GPT-4o baseline
100.0% yes with a 3.7 point gap against humans.
Biggest model gap
3.7 percentage points on this image.
Current classification
People mostly said yes
Why this page matters

This is the compact read on where humans, models, and comments start disagreeing about the same image.

Model spread

How models line up against the crowd

Bars run from more no on the left to more yes on the right. The marker shows the human yes rate.

Amazonamazon/nova-lite-v1

0.0% no100.0% yes

Rank #17Gap 3.7%

Amazonamazon/nova-pro-v1

0.0% no100.0% yes

Rank #30Gap 3.7%

Claudeanthropic/claude-opus-4.6

0.0% no100.0% yes

Rank #23Gap 3.7%

Claudeanthropic/claude-sonnet-4.6

0.0% no100.0% yes

Rank #22Gap 3.7%

Baidu / ERNIEbaidu/ernie-4.5-vl-28b-a3b

0.0% no100.0% yes

Rank #27Gap 3.7%

Geminigoogle/gemini-2.5-pro

0.0% no100.0% yes

Rank #7Gap 3.7%

Geminigoogle/gemini-3-flash-preview

0.0% no100.0% yes

Rank #32Gap 3.7%

Geminigoogle/gemini-3.1-pro-preview

0.0% no100.0% yes

Rank #10Gap 3.7%

Googlegoogle/gemma-3-12b-it

0.0% no100.0% yes

Rank #11Gap 3.7%

Googlegoogle/gemma-3-27b-it

0.0% no100.0% yes

Rank #15Gap 3.7%

Meta / Llamameta-llama/llama-3.2-11b-vision-instruct

0.0% no100.0% yes

Rank #1Gap 3.7%

Meta / Llamameta-llama/llama-4-maverick

0.0% no100.0% yes

Rank #26Gap 3.7%

Meta / Llamameta-llama/llama-4-scout

0.0% no100.0% yes

Rank #13Gap 3.7%

MiniMaxminimax/minimax-01

0.0% no100.0% yes

Rank #29Gap 3.7%

Pixtral / Mistralmistralai/pixtral-large-2411

0.0% no100.0% yes

Rank #16Gap 3.7%

Kimi / Moonshotmoonshotai/kimi-k2.5

0.0% no100.0% yes

Rank #4Gap 3.7%

GPT / OpenAIopenai/gpt-4.1

0.0% no100.0% yes

Rank #31Gap 3.7%

GPT / OpenAIopenai/gpt-4.1-mini

0.0% no100.0% yes

Rank #18Gap 3.7%

GPT / OpenAIopenai/gpt-4o

0.0% no100.0% yes

Rank #9Gap 3.7%

GPT / OpenAIopenai/gpt-4o-2024-11-20

0.0% no100.0% yes

Rank #25Gap 3.7%

GPT / OpenAIopenai/gpt-4o-mini

0.0% no100.0% yes

Rank #21Gap 3.7%

GPT / OpenAIopenai/gpt-5.4

0.0% no100.0% yes

Rank #20Gap 3.7%

GPT / OpenAIopenai/gpt-5.4-pro

0.0% no100.0% yes

Rank #8Gap 3.7%

GPT / OpenAIopenai/o3

0.0% no100.0% yes

Rank #24Gap 3.7%

GPT / OpenAIopenai/o3-pro

0.0% no100.0% yes

Rank #3Gap 3.7%

Qwenqwen/qwen-2-vl-72b-instruct

0.0% no100.0% yes

Rank #12Gap 3.7%

Qwenqwen/qwen2.5-vl-32b-instruct

0.0% no100.0% yes

Rank #14Gap 3.7%

Qwenqwen/qwen2.5-vl-72b-instruct

0.0% no100.0% yes

Rank #28Gap 3.7%

Qwenqwen/qwen3.5-397b-a17b

0.0% no100.0% yes

Rank #6Gap 3.7%

Grok / xAIx-ai/grok-4-fast

0.0% no100.0% yes

Rank #5Gap 3.7%

Z.AI / GLMz-ai/glm-4.6v

0.0% no100.0% yes

Rank #19Gap 3.7%

Selected human comments

  • It is a photo of a sandwich.
  • personally, i wouldn't eat the tomatoes, but other people like them
  • Needs more bacon!

meta-llama/llama-3.2-11b-vision-instruct comments

  • The image shows a typical sandwich with two slices of bread and various fillings in between, including lettuce, tomato, and possibly bacon or ham, along with a yellow spread.
  • A sandwich is a food item consisting of two or more slices of bread, often with fillings such as meats, cheeses, vegetables, and condiments. A sandwich can be made with any type of bread, and can be cut into halves or quarters. Examples of sandwiches include BLTs, club sandwiches, and paninis.

google/gemini-3-flash-preview comments

  • This is a quintessential sandwich. It features a filling (lettuce, tomato, protein, and cheese) layered between two distinct, horizontal slices of toasted bread. It meets all structural and cultural criteria for a sandwich.
  • This is a quintessential sandwich. It features a variety of fillings, including lettuce, tomato, and meat, layered between two distinct slices of toasted leavened bread.
Vote card

Generated summary for this photo

Vote card for Bacon Lettuce Tomato