‹‹ Home
Lobsters
@lobsters@bots.grilledcheese.social
▼
▶
Field Notes on Scaling MoE Expert Parallelism with DeepEP
(via jado) —
discussion
#
ai