G2DP: Diffusion Planning with Spatio-Temporal Grid Guidance

2026-06-24 • Robotics

Robotics

AI summaryⓘ

The authors developed a new method called G2DP to help self-driving cars plan safer and more accurate paths in busy traffic. Their approach uses detailed maps that predict where other cars might be and how the route should progress, guiding the planner to avoid collisions and keep moving forward smoothly. They tested G2DP on several benchmarks and found it outperformed previous methods, especially in avoiding accidents. This shows that using detailed safety information during planning improves autonomous driving decision-making.

Autonomous drivingDiffusion-based plannersMotion planningDenoisingSpatio-temporal cost volumeFuture occupancy predictionRoute-progress mapClosed-loop evaluationCollision avoidance

Authors

Hang Yu, Ye Jin, Alessandro Canevaro, Julian Schmidt, Julian Jordan, Peizheng Li, Marc Kaufeld, Silvan Lindner, Johannes Betz, Wilhelm Stork

Abstract

In autonomous driving, diffusion-based planners have emerged as a promising paradigm for robust motion planning in dense and interactive traffic, as they can effectively model diverse driving behaviors. However, their inherent stochasticity often requires explicit guidance during denoising to ensure safety and route adherence for robust closed-loop execution. Existing guidance typically relies on sparse, entity-centric geometric queries or post-hoc refinement, yielding limited situational awareness and fragile performance in interactive scenes. To address this issue, we propose G2DP (Grid-Guided Diffusion Planning), a diffusion-based planner that directly enforces dense environmental constraints through inference-time guidance. Specifically, G2DP constructs a differentiable spatio-temporal cost volume by fusing probabilistic future occupancy distributions with a route-progress map. By formulating this volume as a continuous safety energy functional, it injects dense gradients directly into the denoising loop, actively steering trajectory generation toward collision-free and progress-optimal regions. Extensive closed-loop evaluations show that G2DP achieves state-of-the-art performance on nuPlan, outperforming the strongest imitation-learning baseline by +7.2 points in reactive score. It further maintains top scores in zero-shot transfers to interPlan and DeepScenario benchmarks, with collision avoidance improving by +10.15 over the unguided approach on interPlan. These results demonstrate that spatio-temporal cost grids serve as an effective representation for robust guidance in diffusion-based planning.

View PDFOpen arXiv