Could Frontier Labs’ Internal Agents Already Go Rogue?photo: meetup
Past event · talks-ideas

Could Frontier Labs’ Internal Agents Already Go Rogue?

When
EDT
Where
30 Adelaide East, Industrious Office 12th Floor Common Area30 Adelaide East, 12th Floor, Toronto, ON
This event has already happened.

Why we picked it

This is a ticketed event. You can register [here](https://luma.com/trajec-2gru). ​ Could an AI company’s internal coding agents create a “rogue deployment”, a set of agents running without human knowledge or permission? In February and March 2026, [METR](https://metr.org/?utm_source=luma), the organization behind the [time horizons graph](https://metr.org/time-horizons/), conducted a pilot of a process to assess just that. Anthropic, Google DeepMind, Meta, and OpenAI gave us access to their most capable internal LLMs and a wide range of non-public information. We concluded that, while internal agents plausibly had the means, motive, and opportunity to start small rogue deployments, they didn’t have the means to avoid human detection indefinitely. METR researcher Thomas Broadley explains the process, the six key facts that informed our conclusion, and how we expect risk to evolve over the next few months. ​You can watch a livestream of the talk [here](https://www.youtube.com/@Trajecto

Last verified · Sourced from meetup

Share

You found us through this show.
Let us find the next one for you.

Every Thursday: five picks like this one, chosen by a human who lives in Toronto. Skip the scrolling.

Get Thursday's picks →