Qwen3.7-Max: The Agent Frontier
- goldenarm - 32652 sekunder sedanThe non-hallucination rate in AA-omniscience is SOTA, better than Opus 4.7, Gemini 3.1 Pro and GPT5.5! Congrats to the team
- briga - 26625 sekunder sedanI was getting dangerously close to my weekly Claude Code limit last night so I had Claude set up Qwen3.6 with llama.cpp and OpenCode. Honestly it's a great (free!) alternative to Claude Code--certainly more than good enough for a lot of smaller less complex tasks. I'm excited to try this new version. The fact that open-source models are so close to the frontier is very impressive.
- tekacs - 35452 sekunder sedanAs they start to release more proprietary models, I so wish that they partnered with one of the major US hyperscalers to allow using these models through something US-domiciled.
Totally understand why it may not be reasonable or in their best interest (and that the US is _absolutely_ not doing the same reflexively). But it would be lovely to be able to try these out on production workloads in earnest.
- maxdo - 8822 sekunder sedanNo opus 4.7 , gpt5.5 , Gemini flash 3.5 in benchmarks
- goyozi - 41826 sekunder sedanThese are very good numbers. I still don’t get why they don’t compare against latest competitor versions in these posts, it’s not like we’re all not going to notice.
- tarruda - 37356 sekunder sedanLooking forward to more open weight releases from Qwen, especially 122B and 397B.
- flakiness - 24585 sekunder sedanI'm using pi agent and love to try qwen models (hosted). What are the good options? The official provider doesn't include Alibaba. Is OpenRouter etc. fast enough?
(As a reference, DeepSeek v4 is severely throttled on these proxy services.)
- ndom91 - 30447 sekunder sedanIs this one of those ones where they'll drop the huggingface release a week later? Or do we know for sure that this is staying proprietary?
- eddyaipt - 31021 sekunder sedanThe pattern I trust most is adding a small verification artifact after every external action. Agents usually fail from silent state drift faster than from lack of reasoning depth.
- jdw64 - 28677 sekunder sedanQWEN really hits the sweet spot it's cheap, fast, and actually good.
- eleventen - 11082 sekunder sedanChecking openrouter (it's not available yet) and, uh, what's up with the spike in Qwen usage from early april here? https://openrouter.ai/qwen
Is this normal humans kicking the tires on a new model, or a few whales doing serious benchmarks?
- LAC-Tech - 1795 sekunder sedanTrying to buy Qwen credits and get an API key is a challenge all in itself. So many site redirects.
- bratao - 37989 sekunder sedanIt is super strange that all last (3?) releases they keep comparing older models such as Opus-4.6.
- bsenftner - 37057 sekunder sedanAny reports from people using their coding agent(s)?
- aliljet - 23142 sekunder sedanWhere can a user reasonably host this in an affordable way to access the local LLM revolution?
- XCSme - 35260 sekunder sedanAny info on pricing and latency?
- xiaoluolyg - 23686 sekunder sedancongrats to qwen teams, remarkable
- hmaddipatla - 27390 sekunder sedanThe tokenomics and value for capability, context and latency look like they could deliver super competitive offer - what would it take for you to switch??
- cft - 22492 sekunder sedanDownloading this and cancelling Google Antigravity Pro at the same time:
I had a Google Pro account that I inherited from buying a Pixel 9 XL - it's free for a year after a flagship Pixel phone purchase. After a year they started charging for it, and i tolerated it, because Flash was usable in Antigravity for dumb auxiliary tasks that I did not want to waste GPT/Opus on. It had a separate generous quota from Gemini 3.1 Pro. Now with Flash 3.5 they combined the quotas with Pro, such that on a Google pro account you can work 4-5 hours per week in Flash. And by the way, 3.1 Pro is useless for programming, compared to Codex/Opus
- joshjob42 - 21262 sekunder sedanI really like what Qwen are doing, and a lot of these Chinese labs, but until I can ask their models what happened during the student protests in 1989 or why human rights groups are upset about the Uighurs and the model gives me a straight answer I'm just not able to trust these models with anything of substance.
- indigodaddy - 23951 sekunder sedanIs it multimodal/vision?
- esafak - 33472 sekunder sedanDoes anyone have experience with the Alibaba Cloud Model Studio that serves these qwen models?
- wolvoleo - 11116 sekunder sedan[dead]
- spacebacon - 29285 sekunder sedan[flagged]
- hydra-f - 34290 sekunder sedan[dead]
- storus - 18238 sekunder sedan[dead]
- tonyspiro - 24042 sekunder sedan[flagged]
- kevinsimper - 41690 sekunder sedan[flagged]
- DeathArrow - 6247 sekunder sedan[dead]
- nikhilpareek13 - 35099 sekunder sedan[dead]
- howmayiannoyyou - 35313 sekunder sedanI can't bring myself to use any model that trains or sends telemetry back to my country's primary competitor/adversary. I don't care how much money is saved.
- dfansteel - 35318 sekunder sedanCan anyone check its knowledge base for me? I’m honestly not able to run it and the Qwen models I can run censor information critical towards the Chinese government.
Tiananmen Square is the first place to start.
Nördnytt! 🤓