our work

We work on technical AI safety research to advance how frontier AI systems reason about and represent nonhuman animal welfare — building the benchmarks, evaluations, and open-source tools the field needs.

Projects

Research·Benchmark

MANTA: Do LLMs Hold Their Values?

A multi-turn adversarial benchmark of 1,088 five-turn conversations that escalate from implicit scenarios into sustained social, cultural, economic, pragmatic, and epistemic pressure. It measures what single-turn tests miss: four of seven frontier models shifted ranking once their animal-welfare values were placed under pressure.

MANTA benchmark visualization