Model breakdown
Qwenqwen/qwen2.5-vl-72b-instruct
Profile
Qwen 2.5 VL 72BQwen
Release
2025-02-19Available on OpenRouter
Specs
72B32,768 tokens
Capabilities
Text + ImageOpen multimodal reasoning, video understanding, and agentic vision tasks
Training
Not publicly disclosedQwen
Rank#63
-1229.8alignment score
74.0%crowd match
Mean gap26.0%
Human match74.0%
Best fitBacon Lettuce Tomato
Average vote64.6%
64.6%model yes
62.8%human yes
Workload3.1K evals
3.1Kevals
154iterations
2.9Mtokens
Photo-by-photo

Model Results

Breaking down how close the model answered each question, compared to humans.

Dodge Van
Photo 02Dodge Van
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
Gap7.0%
Model readLeans no
Sub Sandwich
Photo 03Sub Sandwich
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap5.5%
Model readLeans yes
Qwenqwen/qwen2.5-vl-72b-instruct
2.6% yes97.4% no
Gap38.3%
Model readLeans no
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap4.4%
Model readLeans yes
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
Gap54.2%
Model readLeans no
Hamburger
Photo 08Hamburger
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap27.0%
Model readLeans yes
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap40.6%
Model readLeans yes
Hot Dog
Photo 10Hot Dog
Qwenqwen/qwen2.5-vl-72b-instruct
1.9% yes98.0% no
Gap37.9%
Model readLeans no
Qwenqwen/qwen2.5-vl-72b-instruct
98.0% yes1.9% no
Gap32.5%
Model readLeans yes
Avocado Tea
Photo 12Avocado Tea
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap7.2%
Model readLeans yes
Panini
Photo 13Panini
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap7.6%
Model readLeans yes
Cookie PB
Photo 14Cookie PB
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
Gap51.5%
Model readLeans no
Chicken Wrap
Photo 15Chicken Wrap
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
Gap22.6%
Model readLeans no
Qwenqwen/qwen2.5-vl-72b-instruct
94.2% yes5.8% no
Gap27.9%
Model readLeans yes
Sloppy Joe
Photo 17Sloppy Joe
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
Gap20.6%
Model readLeans yes
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
Gap29.8%
Model readLeans no
Qwenqwen/qwen2.5-vl-72b-instruct
98.0% yes1.9% no
Gap42.3%
Model readLeans yes
Bagel PB&J
Photo 20Bagel PB&J
Qwenqwen/qwen2.5-vl-72b-instruct
98.0% yes1.9% no
Gap51.4%
Model readLeans yes
PhotoVote SplitHuman responseGapRead
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
3.7%absolute gap
Leans yesPeople mostly said yes
Dodge Van
Photo 02Dodge Van
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
7.0%absolute gap
Leans noPeople mostly said no
Sub Sandwich
Photo 03Sub Sandwich
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
5.5%absolute gap
Leans yesPeople mostly said yes
Qwenqwen/qwen2.5-vl-72b-instruct
2.6% yes97.4% no
38.3%absolute gap
Leans noHuman knife-edge
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
4.4%absolute gap
Leans yesPeople mostly said yes
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
8.3%absolute gap
Leans yesPeople mostly said yes
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
54.2%absolute gap
Leans noHuman knife-edge
Hamburger
Photo 08Hamburger
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
27.0%absolute gap
Leans yesSplit concept
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
40.6%absolute gap
Leans yesHuman knife-edge
Hot Dog
Photo 10Hot Dog
Qwenqwen/qwen2.5-vl-72b-instruct
1.9% yes98.0% no
37.9%absolute gap
Leans noSplit concept
Qwenqwen/qwen2.5-vl-72b-instruct
98.0% yes1.9% no
32.5%absolute gap
Leans yesSplit concept
Avocado Tea
Photo 12Avocado Tea
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
7.2%absolute gap
Leans yesPeople mostly said yes
Panini
Photo 13Panini
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
7.6%absolute gap
Leans yesPeople mostly said yes
Cookie PB
Photo 14Cookie PB
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
51.5%absolute gap
Leans noHuman knife-edge
Chicken Wrap
Photo 15Chicken Wrap
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
22.6%absolute gap
Leans noSplit concept
Qwenqwen/qwen2.5-vl-72b-instruct
94.2% yes5.8% no
27.9%absolute gap
Leans yesSplit concept
Sloppy Joe
Photo 17Sloppy Joe
Qwenqwen/qwen2.5-vl-72b-instruct
100.0% yes0.0% no
20.6%absolute gap
Leans yesSplit concept
Qwenqwen/qwen2.5-vl-72b-instruct
0.0% yes100.0% no
29.8%absolute gap
Leans noSplit concept
Qwenqwen/qwen2.5-vl-72b-instruct
98.0% yes1.9% no
42.3%absolute gap
Leans yesHuman knife-edge
Bagel PB&J
Photo 20Bagel PB&J
Qwenqwen/qwen2.5-vl-72b-instruct
98.0% yes1.9% no
51.4%absolute gap
Leans yesHuman knife-edge
qwen/qwen2.5-vl-72b-instruct Sandwich Benchmark Breakdown | opensandwich.ai