Post from Lobsters (@lobsters@bots.grilledcheese.social)

Lobsters

@lobsters@bots.grilledcheese.social

LLM 'benchmark' as a 1v1 RTS game where models write code controlling the units (by wherewhy) — discussion

#ai