ASIMOV Benchmarkv2

Can AI Perceive Physical Danger
and Intervene?

Abhishek Jindal1, Dmitry Kalashnikov1, Oscar Chang1, Divya Garikapati1, Alex Hofer1, Anirudha Majumdar1,2, Pierre Sermanet, Vikas Sindhwani1,
1Google DeepMind, 2Princeton University
ASIMOV-2.0 Physical Safety Benchmark Components and Key Questions

Abstract

When AI interacts with the physical world --- as a robot or an assistive agent --- new safety challenges emerge beyond those of purely ``digital AI". In such interactions, the potential for physical harm is direct and immediate. How well do state-of-the-art foundation models understand common-sense facts about physical safety, e.g. that a box may be too heavy to lift, or that a hot cup of coffee should not be handed to a child?

In this paper, our contributions are three-fold: first, we develop a highly scalable approach to continuous physical safety benchmarking of Embodied AI systems, grounded in real-world injury narratives and operational safety constraints. To probe multimodal safety understanding, we turn these narratives and constraints into photorealistic images and videos capturing transitions from safe to unsafe states, using advanced generative models. Secondly, we comprehensively analyze the ability of major foundation models to perceive risks, reason about safety, and trigger interventions; this yields insights into their deployment readiness for safety-critical agentic applications.

Finally, we develop a post-training paradigm to teach models to explicitly reason about embodiment-specific safety constraints provided through system instructions. The resulting models generate thinking traces that make safety reasoning interpretable and transparent, achieving state of the art performance in constraint satisfaction evaluations.

ASIMOV-2.0 Data Generation Recipe (a) and Grounding Sources (b, c)
ASIMOV-2.0 Data Generation Recipe (a) and Grounding Sources (b, c)


BibTeX

@article{jindal2025,
  author    = {Abhishek Jindal and Dmitry Kalashnikov and Oscar Chang and Divya Garikapati and Anirudha Majumdar and Pierre Sermanet and Vikas Sindhwani},
  title     = {Can AI Perceive Physical Danger and Intervene?},
  journal   = {arXiv preprint arXiv:2509.21651},
  url       = {https://arxiv.org/abs/2509.21651},
  year      = {2025},
  note      = {Version 2. Project page: \url{https://asimov-benchmark.github.io/v2/},
}