====== Expert Routing ====== The mechanism in [[concepts:moe|Mixture-of-Experts]] architectures that selects which experts process a given input. Typically implemented as a learned gating network producing a sparse distribution over experts. The quality of routing directly affects MoE efficiency and performance. See also: [[concepts:moe]], [[concepts:kimi_linear]], [[papers:attention_residuals]]