LECTURES
| Speakers | Syllabus | Titles & Abstracts |
| Sara Beery Massachusetts Institute of Technology, US |
Datasets, Benchmarking, Metrics, Fair Comparisons, Active Testing, AI as a Judge | The Evolving Science of Evaluation |
| Serge Belongie University of Copenhagen, DK |
Object Recognition, Fine-Grained Analysis, Subordinate Category Recognition | Challenges in Fine-Grained Image Analysis |
| Nicolas Heess Google DeepMind, UK |
TBA | TBA |
| Aleksander Hołyński Google DeepMind & Columbia University, US |
TBA | TBA |
| Dinesh Jayaraman University of Pennsylvania, US |
Robot learning, vision-language-action models, resource efficiency | Stages of Robot Learning |
| Aniruddha Kembhavi Meta, UK |
Multimodal AI, Data centric ML, Vision and Language models, Multimodal reasoning | From Molmo to Muse Spark: Data centric recipes for state of the art multimodal models |
| Ruoshi Liu University of Maryland, College Park and Amazon Frontier AI & Robotics (FAR), US |
3D Vision, Generative Models, Robotics | What Can Robots Learn from Large-Scale Internet Visual Data? |
| Richard Newcombe Meta Reality Labs Research, USA |
TBA | TBA |
| Marco Pavone Stanford University & NVIDIA, US |
Physical AI, Reasoning Models, Vision–Language–Action (VLA) Architectures, Robotics, Autonomous Systems | Reasoning Models for Physical AI: From Fundamentals to Real-World Applications |
| Davide Scaramuzza University of Zurich, CH |
physical intelligence, robotics, AI, computer vision, event cameras | Agile Robotics: from Frame Cameras to Neuromorphic Sensors |
| Cordelia Schmid INRIA, FR |
Video Understanding, 3D Vision, World Models, Robotic Manipulation, Imitation Learning | Seeing, Understanding, Doing: How Video Models Are Reshaping Robotics |
| Carlo Sferrazza University of Texas at Austin & Amazon Frontiers AI and Robotics, US |
humanoid robots, robot learning, embodied AI, sim2real learning | Humanoid Robot Learning: From Foundations to Visual Perception |
| Antonio Torralba Massachusetts Institute of Technology, US |
TBA | TBA |
| Silvia Zuffi IMATI-CNR, IT |
3D Reconstruction and Understanding | Modeling, Capturing, and Understanding Animals in 3D |
READING GROUP
| Speakers | Syllabus | Rules of Engagement |
| |
Reading Group: Meeting with Mentors | Reading Group: Meeting with Mentors |
ESSAY COMPETITION (WITH PRIZE!)
| Speakers | Syllabus | Rules of Engagement |
| Fabio Galasso Sapienza University of Rome, Italy |
Essay Competition | Essay Competition |
INDUSTRY MEETS STUDENTS
- Ambarella-VISLAB, US
- Artificialy, CH
- Google, US
- Naver Labs, FR
- Onfido, UK
- Panasonic, US
- Qualcomm, US
- Meta - Reality Labs Research, US
- Siemens, DEU
- Toshiba Europe, UK
- Toyota Europe Motor, BE
Facebook group
Twitter
Computer Vision for Spatial and Physical Intelligence