Previous photoBacon Lettuce TomatoNext photoSub Sandwich
Human 7.0% yes93.0% no Model average 0.4% yes99.6% no Most aligned model meta-llama/llama-3.2-11b-vision-instruct meta-llama/llama-3.2-11b-vision-instruct Least aligned models 29-way tie openai/o3-promoonshotai/kimi-k2.5x-ai/grok-4-fast+26 more Human distribution 7.0% yes, 93.0% no over 656 explicit votes. Model average distribution 0.4% yes, 99.6% no across the current model set. Closest current model 11.4% yes. Least aligned models 7.0 point gap. Legacy GPT-4o baseline 0.0% yes with a 7.0 point gap against humans. Biggest model gap 7.0 percentage points on this image. Current classification People mostly said no 

DVNPeople mostly said no
Benchmark image 02
Dodge Van
1979 Dodge RAM van "Sandwich"
A late-70s Dodge van is parked here like someone tried to jailbreak the ontology with Detroit sheet metal. It is the purest negative control in the set: all sandwich discourse, zero mayo.
Under development: this benchmark and its published results are provisional, not final.
At a glance
How this photo split the room
meta-llama/llama-3.2-11b-vision-instruct
29-way tie
Why this page matters
This is the compact read on where humans, models, and comments start disagreeing about the same image.
Model spread
How models line up against the crowd
Bars run from more no on the left to more yes on the right. The marker shows the human yes rate.
amazon/nova-lite-v1
amazon/nova-pro-v1
anthropic/claude-opus-4.6
anthropic/claude-sonnet-4.6
baidu/ernie-4.5-vl-28b-a3b
google/gemini-2.5-pro
google/gemini-3-flash-preview
google/gemini-3.1-pro-preview
google/gemma-3-12b-it
google/gemma-3-27b-it
meta-llama/llama-4-maverick
meta-llama/llama-4-scout
minimax/minimax-01
moonshotai/kimi-k2.5
openai/gpt-4.1
openai/gpt-4.1-mini
openai/gpt-4o
openai/gpt-4o-2024-11-20
openai/gpt-4o-mini
openai/gpt-5.4
openai/gpt-5.4-pro
openai/o3
openai/o3-pro
qwen/qwen-2-vl-72b-instruct
qwen/qwen2.5-vl-32b-instruct
qwen/qwen2.5-vl-72b-instruct
qwen/qwen3.5-397b-a17b
x-ai/grok-4-fast
z-ai/glm-4.6v
mistralai/pixtral-large-2411
meta-llama/llama-3.2-11b-vision-instruct
Vote card
Generated summary for this photo



Selected human comments
meta-llama/llama-3.2-11b-vision-instruct comments
google/gemini-3-flash-preview comments