our work
We work on technical AI safety research to advance how frontier AI systems reason about and represent nonhuman animal welfare — building the benchmarks, evaluations, and open-source tools the field needs.
Projects
Research·Benchmark
MANTA: Do LLMs Hold Their Values?
A multi-turn adversarial benchmark of 1,088 five-turn conversations that escalate from implicit scenarios into sustained social, cultural, economic, pragmatic, and epistemic pressure. It measures what single-turn tests miss: four of seven frontier models shifted ranking once their animal-welfare values were placed under pressure.
