All Episodes
Embodied AI 101 — 44 episodes
Claw-Eval: Toward Trustworthy and Transparent Evaluation of Autonomous Agents
LIBERO-Para: Paraphrase Robustness in Robotic Manipulation
YOR: Your Own Mobile Manipulator for Generalizable Robotics
EgoSim: Egocentric World Simulator for Embodied Interaction Generation
Accelerating Video World Models: From Generative Videos to Real-Time Simulators
From Tokens to Thoughts: Continuous Latent Reasoning in Large Models and Robot Control
CaP-X: Coding Agents for Physical eXecution
DoRA: Weight-Decomposed Low-Rank Adaptation
AI Model Collapse: What Happens When AI Trains on Its Own Outputs
PhAIL: Benchmarking Vision-Language-Action Models on Real-World Bin-Picking
Co-training Large Behavior Models: Data Modalities and Training Strategies for Robot Manipulation
HyDRA: Hybrid Memory for Dynamic Video World Models
# WildWorld: Dynamic World Modeling with Actions and Explicit State
Omni-WorldBench: Evaluating Interactive 4D World Models
SIMART: From Static Meshes to Sim-Ready Articulated Models
EgoSim: An Egocentric World Simulator for Embodied Interaction
Digit's New Motor Cortex: Sim-to-Real RL for Whole-Body Control
EgoNav: Diffusion-Based Humanoid Navigation from Human Egocentric Video
CaP-X: A Code-as-Policy Framework for Robot Manipulation
Embodied Intelligence Breakthrough: Generalist AI’s GEN-1 Robots
CaP-X: LMs' First Physical Exam
AI Model Collapse: The Danger of Training on AI-Generated Data
High-Level Automated Reasoning with Qwen2.5-7B
Co-Training Large Behavior Models: Multimodal Data for Robot Manipulation
HyDRA: Hybrid Memory for Dynamic Video World Models
DexWM: Leveraging Human Videos for Dexterous Robot World Models
World Models in Robotics
SIMART: Decomposing Monolithic Meshes into Sim-Ready Articulated Assets
LeWorldModel: A Stable JEPA World Model from Pixels
World Models for Robots: The Next Big Leap?
Harnessing Long-Running AI in Embodied Systems
HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations
TurboQuant: Redefining AI Efficiency with Extreme Compression
DexWM: Learning Dexterous Object Manipulation from Human Videos
FlashAttention-3: Fast & Accurate Attention with Asynchrony & Low-Precision
When AI Trains on Its Own Output: The Model Collapse Problem
MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation
LeWorldModel: Stable End-to-End JEPA from Pixels
EgoVerse: An Egocentric Data Ecosystem for Scaling Robot Learning
HSImul3R: Physics-Driven Reconstruction of Human–Scene Interactions
MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation
DreamZero: World Action Models Are Zero-Shot Policies
Kinema4D: A 4D Generative Simulator for Embodied AI
VEGA-3D: Teaching multimodal LLMs spatial reasoning through video generation