something kinda cute that new Qwen does a lot is initially hallucinate, then say it doesn’t know something, then eventually realizes it actually does, and corrects itself major progress on mitigating hallucinations, this is hard for non-reasoning models
feedsImage
4o, 4.1, and V3 barrel ahead with their hallucinations. Sonnet (non-thinking) just knows it
7,02K