LLM inference infrastructure for a systems audience (via nathan) — discussion

#ai #distributed