Previous photoPaniniNext photoChicken Wrap
Human 51.5% yes48.5% no Model average 33.3% yes66.7% no Most aligned model qwen/qwen3.5-397b-a17b qwen/qwen3.5-397b-a17b Least aligned models 10-way tie qwen/qwen-2-vl-72b-instructmeta-llama/llama-4-scoutqwen/qwen2.5-vl-32b-instruct+7 more Human distribution 51.5% yes, 48.5% no over 656 explicit votes. Model average distribution 33.3% yes, 66.7% no across the current model set. Closest current model 43.6% yes. Least aligned models 51.5 point gap. Legacy GPT-4o baseline 46.0% yes with a 5.5 point gap against humans. Biggest model gap 51.5 percentage points on this image. Current classification Human knife-edge 

CPBHuman knife-edge
Benchmark image 14
Cookie PB
Cookie and peanut butter "Sandwich"
Two cookies with peanut-butter filling are stacked into a dessert sandwich that feels like it was greenlit by a startup with no adult in finance. It breaks the bread prior while preserving the sandwich geometry almost too cleanly.
Under development: this benchmark and its published results are provisional, not final.
At a glance
How this photo split the room
qwen/qwen3.5-397b-a17b
10-way tie
Why this page matters
This is the compact read on where humans, models, and comments start disagreeing about the same image.
Model spread
How models line up against the crowd
Bars run from more no on the left to more yes on the right. The marker shows the human yes rate.
amazon/nova-lite-v1
baidu/ernie-4.5-vl-28b-a3b
meta-llama/llama-4-maverick
meta-llama/llama-4-scout
mistralai/pixtral-large-2411
openai/gpt-4.1-mini
openai/gpt-4o-mini
qwen/qwen-2-vl-72b-instruct
qwen/qwen2.5-vl-32b-instruct
qwen/qwen2.5-vl-72b-instruct
amazon/nova-pro-v1
z-ai/glm-4.6v
meta-llama/llama-3.2-11b-vision-instruct
minimax/minimax-01
anthropic/claude-opus-4.6
openai/gpt-5.4
openai/gpt-4o-2024-11-20
google/gemma-3-27b-it
moonshotai/kimi-k2.5
google/gemma-3-12b-it
google/gemini-3.1-pro-preview
qwen/qwen3.5-397b-a17b
x-ai/grok-4-fast
openai/gpt-4o
openai/o3-pro
openai/o3
openai/gpt-5.4-pro
anthropic/claude-sonnet-4.6
google/gemini-2.5-pro
google/gemini-3-flash-preview
openai/gpt-4.1
Vote card
Generated summary for this photo



Selected human comments
qwen/qwen3.5-397b-a17b comments
qwen/qwen2.5-vl-72b-instruct comments