AIThe Decoder2h ago
Researchers train AI model that hits near-full performance with just
Researchers train AI model that hits near-full performance with just 12.5 percent of its experts

Researchers at the Allen Institute for AI and UC Berkeley have built EMO, a mixture-of-experts model whose experts specialize in content domains instead of word types. That lets you strip out three-quarters of the experts while losing only about one percentage point of…
Read full articleSource: The Decoder · Opens in new tab