Previous photoSub SandwichNext photoGrilled Cheese
People dressed as a sandwich
PPLHuman knife-edge
Benchmark image 04

Sandwich Costume

Human "Sandwich"

A parade line of humans dressed as bread, cheese, meat, and tomato forms a structurally convincing sandwich that still fails the crucial requirement of being lunch. It is the kind of edge case that makes literalists sound insane and compositionalists sound worse.

Under development: this benchmark and its published results are provisional, not final.

Human
40.9% yes59.1% no
Model average
8.7% yes91.3% no
Most aligned model
meta-llama/llama-3.2-11b-vision-instruct
Meta / Llamameta-llama/llama-3.2-11b-vision-instruct
Least aligned model
anthropic/claude-sonnet-4.6
Claudeanthropic/claude-sonnet-4.6
At a glance

How this photo split the room

Human distribution
40.9% yes, 59.1% no over 656 explicit votes.
Model average distribution
8.7% yes, 91.3% no across the current model set.
Closest current model
31.4% yes.

meta-llama/llama-3.2-11b-vision-instruct

Least aligned model
59.1 point gap.

anthropic/claude-sonnet-4.6

Legacy GPT-4o baseline
0.0% yes with a 40.9 point gap against humans.
Biggest model gap
59.1 percentage points on this image.
Current classification
Human knife-edge
Why this page matters

This is the compact read on where humans, models, and comments start disagreeing about the same image.

Model spread

How models line up against the crowd

Bars run from more no on the left to more yes on the right. The marker shows the human yes rate.

Amazonamazon/nova-lite-v1

100.0% no0.0% yes

Rank #17Gap 40.9%

Amazonamazon/nova-pro-v1

100.0% no0.0% yes

Rank #30Gap 40.9%

Baidu / ERNIEbaidu/ernie-4.5-vl-28b-a3b

100.0% no0.0% yes

Rank #27Gap 40.9%

Geminigoogle/gemini-3-flash-preview

100.0% no0.0% yes

Rank #32Gap 40.9%

Geminigoogle/gemini-3.1-pro-preview

100.0% no0.0% yes

Rank #10Gap 40.9%

Googlegoogle/gemma-3-12b-it

100.0% no0.0% yes

Rank #11Gap 40.9%

Googlegoogle/gemma-3-27b-it

100.0% no0.0% yes

Rank #15Gap 40.9%

Meta / Llamameta-llama/llama-4-maverick

100.0% no0.0% yes

Rank #26Gap 40.9%

Meta / Llamameta-llama/llama-4-scout

100.0% no0.0% yes

Rank #13Gap 40.9%

MiniMaxminimax/minimax-01

100.0% no0.0% yes

Rank #29Gap 40.9%

GPT / OpenAIopenai/gpt-4.1

100.0% no0.0% yes

Rank #31Gap 40.9%

GPT / OpenAIopenai/gpt-4.1-mini

100.0% no0.0% yes

Rank #18Gap 40.9%

GPT / OpenAIopenai/gpt-4o

100.0% no0.0% yes

Rank #9Gap 40.9%

GPT / OpenAIopenai/gpt-4o-2024-11-20

100.0% no0.0% yes

Rank #25Gap 40.9%

GPT / OpenAIopenai/gpt-4o-mini

100.0% no0.0% yes

Rank #21Gap 40.9%

GPT / OpenAIopenai/gpt-5.4

100.0% no0.0% yes

Rank #20Gap 40.9%

GPT / OpenAIopenai/gpt-5.4-pro

100.0% no0.0% yes

Rank #8Gap 40.9%

GPT / OpenAIopenai/o3

100.0% no0.0% yes

Rank #24Gap 40.9%

GPT / OpenAIopenai/o3-pro

100.0% no0.0% yes

Rank #3Gap 40.9%

Qwenqwen/qwen2.5-vl-32b-instruct

100.0% no0.0% yes

Rank #14Gap 40.9%

Z.AI / GLMz-ai/glm-4.6v

100.0% no0.0% yes

Rank #19Gap 40.9%

Kimi / Moonshotmoonshotai/kimi-k2.5

98.4% no1.6% yes

Rank #4Gap 39.2%

Qwenqwen/qwen2.5-vl-72b-instruct

97.4% no2.6% yes

Rank #28Gap 38.3%

Qwenqwen/qwen-2-vl-72b-instruct

96.0% no4.0% yes

Rank #12Gap 36.9%

Pixtral / Mistralmistralai/pixtral-large-2411

94.0% no6.0% yes

Rank #16Gap 34.9%

Qwenqwen/qwen3.5-397b-a17b

92.7% no7.3% yes

Rank #6Gap 33.6%

Grok / xAIx-ai/grok-4-fast

91.9% no8.1% yes

Rank #5Gap 32.8%

Geminigoogle/gemini-2.5-pro

90.1% no9.9% yes

Rank #7Gap 31.0%

Meta / Llamameta-llama/llama-3.2-11b-vision-instruct

68.6% no31.4% yes

Rank #1Gap 9.4%

Claudeanthropic/claude-opus-4.6

2.0% no98.0% yes

Rank #23Gap 57.1%

Claudeanthropic/claude-sonnet-4.6

0.0% no100.0% yes

Rank #22Gap 59.1%

Selected human comments

  • No, it is a photo of a human sandwich.
  • only if they were all smooshed together! but separated? nope, that's ingredients
  • Is that Ranger Igor wearing the tomato? This is clearly a flock of Rangers, not a sandwich!

meta-llama/llama-3.2-11b-vision-instruct comments

  • A sandwich typically consists of two slices of bread with fillings in between. In this image, the bread is replaced by a slice of meat, cheese, and tomato. Therefore, it is not a sandwich.
  • A sandwich is defined as two or more slices of bread with fillings in between. These individuals are wearing costumes that resemble slices of cheese, meat, and tomato, and are not consuming them as a meal.

anthropic/claude-sonnet-4.6 comments

  • Five people costumed as sandwich components (bread, cheese, meat, pickle, tomato, bread) marching in parade formation. Collectively, they ARE a sandwich. Arguably the most structurally accurate sandwich ever documented.
  • Five people dressed as sandwich components (bread, cheese, meat, onion, tomato, bread) in a parade. Collectively, they form a sandwich. Technically assembled, this BLT-ish creation qualifies, even if the ingredients are human.
Vote card

Generated summary for this photo

Vote card for Sandwich Costume