Previous photoWaffle Ice CreamNext photoCigarette Sandwich
Human 79.4% yes20.6% no Model average 99.0% yes1.1% no Most aligned model meta-llama/llama-3.2-11b-vision-instruct meta-llama/llama-3.2-11b-vision-instruct Least aligned models 29-way tie openai/o3-promoonshotai/kimi-k2.5x-ai/grok-4-fast+26 more Human distribution 79.4% yes, 20.6% no over 654 explicit votes. Model average distribution 99.0% yes, 1.1% no across the current model set. Closest current model 68.6% yes. Least aligned models 20.6 point gap. Legacy GPT-4o baseline 100.0% yes with a 20.6 point gap against humans. Biggest model gap 20.6 percentage points on this image. Current classification Split concept 

SLJSplit concept
Benchmark image 17
Sloppy Joe
Sloppy joe "Sandwich"
A sloppy joe leaks seasoned meat out of a bun with the chaotic confidence of legacy code that somehow still pays revenue. It is clearly sandwich-shaped, even if the change-management story is grim.
Under development: this benchmark and its published results are provisional, not final.
At a glance
How this photo split the room
meta-llama/llama-3.2-11b-vision-instruct
29-way tie
Why this page matters
This is the compact read on where humans, models, and comments start disagreeing about the same image.
Model spread
How models line up against the crowd
Bars run from more no on the left to more yes on the right. The marker shows the human yes rate.
meta-llama/llama-3.2-11b-vision-instruct
baidu/ernie-4.5-vl-28b-a3b
amazon/nova-lite-v1
amazon/nova-pro-v1
anthropic/claude-opus-4.6
anthropic/claude-sonnet-4.6
google/gemini-2.5-pro
google/gemini-3-flash-preview
google/gemini-3.1-pro-preview
google/gemma-3-12b-it
google/gemma-3-27b-it
meta-llama/llama-4-maverick
meta-llama/llama-4-scout
minimax/minimax-01
mistralai/pixtral-large-2411
moonshotai/kimi-k2.5
openai/gpt-4.1
openai/gpt-4.1-mini
openai/gpt-4o
openai/gpt-4o-2024-11-20
openai/gpt-4o-mini
openai/gpt-5.4
openai/gpt-5.4-pro
openai/o3
openai/o3-pro
qwen/qwen-2-vl-72b-instruct
qwen/qwen2.5-vl-32b-instruct
qwen/qwen2.5-vl-72b-instruct
qwen/qwen3.5-397b-a17b
x-ai/grok-4-fast
z-ai/glm-4.6v
Vote card
Generated summary for this photo



Selected human comments
meta-llama/llama-3.2-11b-vision-instruct comments
google/gemini-3-flash-preview comments