Sharing recent progress from Physical Intelligence and why it is an exciting time to push the frontier in general purpose robotics

About Quan Vuong
Quan Vuong is co-founder at Physical Intelligence. His research focuses on generalist robotics and algorithms that enable intelligent behaviors through large scale learning.

About Jost Tobias Springenberg
Tobias is currently a research scientist at Physical Intelligence where he works on bringing AI into the real world and understanding the fundamentals of sequential decision making (e.g. imitation and reinforcement learning). He likes his machine learning models big and his data to be plentiful and focuses most of his research on engineering driven machine learning at scale for robotics.
Before joining Physical Intelligence Tobias was a research scientist Google Deepmind in London within the control team which generally focuses on applications of ML to control for science and robotics. Before that he was a researcher at the University of Freiburg working with the Machine Learning Group and Computer Vision Groups. Tobias holds a BSc. in Cognitive Science from the University of Osnabrueck – from which he still retains an interest in understanding human cognition – and a MSc. in Computer Science from the University of Freiburg.

Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: https://www.ai.engineer/newsletter

AI Engineer

Physical Intelligence presents their vision language action (VLA) models that enable robots to perform complex, dexterous tasks in real-world environments. Unlike traditional robotics limited to structured factory settings, their approach combines multimodal AI with robotics to create models that can generalize across different robots and environments. They've developed a comprehensive data collection pipeline using human teleoperation to gather training data, scaling from 3,800 hours to over 10,000 hours of successful robot demonstrations. Their latest model, PI-0.5, can perform long-horizon tasks up to 10 minutes in unseen homes, demonstrating true generalization capabilities. The company emphasizes that software intelligence, rather than hardware, is the main bottleneck in robotics scaling.

Robotics: why now? - Quan Vuong and Jost Tobias Springberg, Physical Intelligence