Parallax Labs

Open science and tools for white-box auditing of frontier AI agents.

Black-box and white-box methods offer complementary views of agents' behaviour. Both are essential for frontier AI audits.

MORE ABOUT US →

Built with frontier labs and institutions

Our research and open infrastructure are co-developed with organisations at the forefront of agentic evaluations.

Technion — Israel Institute of Technology

From first principles to frontier-scale audits

We study how agents encode latent beliefs, goals, plans, and their impact on model behaviours at scale.

2026 · Arghal et al.

A Behavioural and Representational Evaluation of Goal-directedness in Language Model Agents

We develop cognitive map probes recovering an agent's approximate beliefs about its environment, and use them to prototype white-box evaluations explaining suboptimal actions in light of the agent's imperfect beliefs in simple 2D gridworlds.

In collaboration with

ICML 2026 Goal-directednessBelief probing

CODE | DATA | DEMO

2026 · Shapira et al.

Agents of Chaos

A technical report on failure cases of stateful agentic systems based on the OpenClaw harness when interacting in a complex environment including productivity and messaging tools, other agents and multiple human red-teamers.

In collaboration with

Covered by

Tech Report Agentic evalLong-horizon

2025 · Summerfield et al.

Lessons from a Chimp: AI ‘Scheming’ and the Quest for Ape Language

In this critical review about recent claims about misaligned goals in LM agents, we argue in favor of a research programme grounded in robust theoretical frameworks and better reporting standards.

In collaboration with