Cambrian-S: Towards Spatial Supersensing in Video
-
Updated
Nov 10, 2025 - Python
Cambrian-S: Towards Spatial Supersensing in Video
G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
Visual Spatial Tuning
[ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models
Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
[NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering
[arXiv preprint] STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes
Add a description, image, and links to the spatial-understanding topic page so that developers can more easily learn about it.
To associate your repository with the spatial-understanding topic, visit your repo's landing page and select "manage topics."