Computer Vision for Spatial Intelligence

Detailed ICVSS 2025 Programme

LECTURES

Speakers	Syllabus	Titles & Abstracts
Daniel Cremers Technical University of Munich, DE	3D Computer Vision	3D Computer Vision in the Age of Deep Learning
Dima Damen University of Bristol, UK	Video Understanding, Multi-modal Learning, Egocentric Videos, Hand-Object Interactions	Video Understanding Out of the Frame - an Egocentric Perspective
Andrew Davison Imperial College, UK	Representations for real-time reconstruction and mapping; new computing architectures and sensors; distributed computation	From SLAM to Spatial AI
Christoph Feichtenhofer FAIR, Meta, US	Visual recognition, instance segmentation, vision-language models, video understanding	Demystifying the impact of data for image and video understanding
Vittorio Ferrari Meta Reality Labs, CH	generative models, diffusion, transformers, controllable generation	Generating images and videos with diffusion models
Leonidas Guibas Stanford University, US	Foundation Models, Scene Understanding	Foundation Models for 3D / 4D Scene Understanding and Content Creation
Phillip Isola Massachusetts Institute of Technology, US	Representational Alignment	Understanding Representational Alignment in Vision and Beyond
Ishan Misra GenAI, Meta, US	video generation, flow matching, diffusion, foundation models, multimodal learning	Foundation models for video generation, editing, and personalization.
Matthias Niessner Technical University of Munich, DE	AI Avatars	Photo-realistic AI Avatars
Gerard Pons-Moll University of Tübingen, DE	3D humans, neural implicit fields, 3D Gaussian Splats, generative models, embodiment, LLMs, multi-view diffusion, human-object interaction, humans and 3D scenes.	Real Virtual Humans: The Path from Statistical Models to Neural Avatars that Act and Behave
Fatih Porikli Australian National University, Qualcomm, AUS & US	Generative AI on the Edge	Onboarding Generative AI on the Edge
Stefano Soatto Amazon and University of California Los Angeles, US	Reasoning, LLM, World Models	Emergence of Reasoning in Language and World Models
Gul Varol École des Ponts ParisTech, FR	humans, generative models, 3D and language, human motion, hands, sign language	Dynamic Humans: Generating 3D Human Motion with Language
Andrea Vedaldi University of Oxford, UK	Spatial Intelligence, Visual Geometry, 3D Generative AI	Spatial Intelligence: The New Frontier of Computer Vision

READING GROUP

Speakers	Syllabus	Rules of Engagement
Stefano Soatto Amazon and University of California Los Angeles, US	Reading Group: Meeting with Mentors	Reading Group: Meeting with Mentors

ESSAY COMPETITION (WITH PRIZE!)

Speakers	Syllabus	Rules of Engagement
Fabio Galasso Sapienza University of Rome, Italy	Essay Competition	Essay Competition

INDUSTRY MEETS STUDENTS

Industrial Panel

POSTER SESSION (WITH PRIZE!)

POSTERS SUBMITTED TO ICVSS 2025

Detailed ICVSS 2025 Programme

Web statistics