What Do Models Still Suck At? - Peter Gostev, Arena.ai, BullshitBench

Presenters Peter Gostev Source AI Engineer Europe 2026 Beyond the Hype: What Large Language Models Still Struggle With 🤯 We’re constantly bombarded with dazzling charts showing the relentless upward march of AI capabilities. Every new model release feels like a giant leap towards Artificial General Intelligence (AGI), leaving us in a state of awe and perhaps a little anxiety. But are we being too optimistic? Peter Gostev, in a recent talk, dives into the less glamorous side of LLMs, exploring what they don’t do well and why that matters. ...

April 24, 2026 · 4 min