Why are we so quick to call it deception? Their figure is quite clear. They aren... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Alifatisk 31 days ago \| parent \| context \| favorite \| on: Qwen3.6-Plus: Towards real world agents Why are we so quick to call it deception? Their figure is quite clear. They aren't fiddling with the graph or hiding the labels, they are clearly stating which models it compares against. But I agree on the sentiment that the standard practice should be to bench against the latest SOTA models.

patates 31 days ago | [–]

Even if openly stated, why would they be comparing to a previous generation if not for deception?

Laziness? Lack of time? It's not like the latest generation of the SOTA models were released yesterday.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact