Discussion about this post

User's avatar
JP's avatar

Spot on. Ran into this exact problem with the Qwen 3.5 launch. A viral post claimed the new 4B model "matches an 80B model" and people took it at face value. The 80B model is MoE with only 3B active parameters. So it's 4B dense vs 3B active, not 4B vs 80B. Architecture matters more than the headline number. Broke it down properly here: https://reading.sh/your-laptop-is-an-ai-server-now-370bad238461?sk=1cf7a4391e614720ecbd6e9bc3f076a2

No posts

Ready for more?