MoE routing: global rules lose to learned heuristics
A four-Qwen tool-use audit: the dense 27B sits at 5.6% errors, all three routed MoEs cluster at 10-12% regardless of per-expert capacity, fine-tune target, or quantization. Routing costs a roughly fixed reliability budget; post-training redistributes where it gets spent.