Computer Vision for Spatial and Physical Intelligence

Detailed ICVSS 2026 Programme

LECTURES

Speakers	Syllabus	Titles & Abstracts
Sara Beery Massachusetts Institute of Technology, US	Datasets, Benchmarking, Metrics, Fair Comparisons, Active Testing, AI as a Judge	The Evolving Science of Evaluation
Serge Belongie University of Copenhagen, DK	Object Recognition, Fine-Grained Analysis, Subordinate Category Recognition	Challenges in Fine-Grained Image Analysis
Nicolas Heess Google DeepMind, UK	Robot Learning	Building general purpose robots: Robot learning in the era generative AI
Aleksander Hołyński Google DeepMind & Columbia University, US	3D, AGI	3D in Age of AGI
Dinesh Jayaraman University of Pennsylvania, US	Robot learning, vision-language-action models, resource efficiency	Stages of Robot Learning
Aniruddha Kembhavi Meta, UK	Multimodal AI, Data centric ML, Vision and Language models, Multimodal reasoning	From Molmo to Muse Spark: Data centric recipes for state of the art multimodal models
Ruoshi Liu University of Maryland, College Park and Amazon Frontier AI & Robotics (FAR), US	3D Vision, Generative Models, Robotics	What Can Robots Learn from Large-Scale Internet Visual Data?
Richard Newcombe Meta Reality Labs Research, USA	AI, Robotics, Human Behavior understanding, Wearable Devices	Rethinking Problems in AI, Robotics, and Human Behavior understanding at the Frontier of Wearable Devices
Marco Pavone Stanford University & NVIDIA, US	Physical AI, Reasoning Models, Vision–Language–Action (VLA) Architectures, Robotics, Autonomous Systems	Reasoning Models for Physical AI: From Fundamentals to Real-World Applications
Davide Scaramuzza University of Zurich, CH	physical intelligence, robotics, AI, computer vision, event cameras	Agile Robotics: from Frame Cameras to Neuromorphic Sensors
Cordelia Schmid INRIA, FR	Video Understanding, 3D Vision, World Models, Robotic Manipulation, Imitation Learning	Seeing, Understanding, Doing: How Video Models Are Reshaping Robotics
Carlo Sferrazza University of Texas at Austin & Amazon Frontier AI and Robotics, US	humanoid robots, robot learning, embodied AI, sim2real learning	Humanoid Robot Learning: From Foundations to Visual Perception
Antonio Torralba Massachusetts Institute of Technology, US	Reflections on Computer Vision	What Just Happened? Reflections on Computer Vision since ICVSS 2007
Silvia Zuffi IMATI-CNR, IT	3D Reconstruction and Understanding	Modeling, Capturing, and Understanding Animals in 3D

READING GROUP

Speakers	Syllabus	Rules of Engagement
	Reading Group: Meeting with Mentors	Reading Group: Meeting with Mentors

ESSAY COMPETITION (WITH PRIZE!)

Speakers	Syllabus	Rules of Engagement
Fabio Galasso Sapienza University of Rome, Italy	Essay Competition	Essay Competition

INDUSTRY MEETS STUDENTS

More industries are coming soon!

POSTER SESSION (WITH PRIZE!)

POSTERS SUBMITTED TO ICVSS 2026

Detailed ICVSS 2026 Programme

Web statistics