Parallax Labs

Open science and tools for white-box auditing of frontier AI agents.

Black-box and white-box methods offer complementary views of agents' behaviour. Both are essential for frontier AI audits.

MORE ABOUT US →

Built with frontier labs and institutions

Our research and open infrastructure are co-developed with organisations at the forefront of agentic evaluations.

From first principles to frontier-scale audits

We study how agents encode latent beliefs, goals, plans, and their impact on model behaviours at scale.

2026 · Shapira et al.

Agents of Chaos

A technical report on failure cases of stateful agentic systems based on the OpenClaw harness when interacting in a complex environment including productivity and messaging tools, other agents and multiple human red-teamers.

In collaboration with
Covered by
Tech Report Agentic evalLong-horizon

Meet our Team

An experienced team working on interpretability and frontier AI evaluations.

Mario Giulianelli
Co-founder · Research Lead
Associate Professor, UCL
Gabriele Sarti
Co-founder · R&D Lead
Postdoc, Northeastern
David Bau
Trustee
Assistant Professor, Northeastern
Yonatan Belinkov
Trustee
Associate Professor, Technion
Seraphina Goldfarb-Tarrant
Trustee
Head of AI Safety, Cohere