❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers

📝 The paper is available here:
https://d4rt-paper.github.io/

Our Gaussian Material Synthesis paper:
https://users.cg.tuwien.ac.at/zsolnai/gfx/gaussian-material-synthesis/

Tweet link: https://x.com/GoogleDeepMind/status/2014352808426807527

Our Patreon if you wish to support us: https://www.patreon.com/TwoMinutePapers

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Adam Bridges, Benji Rabhan, B Shang, Cameron Navor, Christian Ahlin, Eric T, Fred R, Gordon Child, Juan Benet, Michael Tedder, Owen Skarpness, Richard Sundvall, Ryan Stankye, Steef, Taras Bobrovytsky, Tazaur Sagenclaw, Tybie Fitzhugh, Ueli Gallizzi
 
My research: https://cg.tuwien.ac.at/~zsolnai/

Two Minute Papers's resource offers insights, tutorials, and resources for researchers and enthusiasts interested in computer science and artificial intelligence. Readers can learn about  research papers, breakthroughs, and trends in the field of AI. With concise summaries, analysis, and visualizations, Two Minute Papers provides  guidance and expertise for understanding complex research topics in a digestible format.

Two Minute Papers

Google DeepMind, UCL, and Oxford have released D4RT, a single-transformer model capable of 4D scene reconstruction (3D space + time) from video. Unlike previous approaches that chained multiple specialized models for depth, motion, and camera pose, D4RT handles all three simultaneously in one unified architecture. It outputs dynamic point clouds up to 300x faster than prior methods, can track objects through occlusion by leveraging temporal context, and achieves sub-pixel detail by feeding original high-resolution pixels back into the decoder. Trade-offs include lack of photorealistic rendering and the need for an extra meshing step for physics or 3D printing use cases.

DeepMind’s New AI Tracks Objects Faster Than Your Brain