Check out Inngest and let your AI agents wear a harness now!
https://www.inngest.com/docs?utm_source=youtube&utm_medium=video&utm_campaign=yt-bycl-3

In this video, we'll dive into the latest hype: Recursive Language Model, why it's actually pretty promising, and how it will change the way we use RAG. 

Check out my latest project: Intuitive AI Academy
We just wrote a new piece on MoE and Distillation!
https://intuitiveai.academy/
limited time code "EARLY" for 40% off yearly plan!


My Newsletter
https://mail.bycloud.ai/

my project: find, discover & explain AI research semantically
https://findmypapers.ai/

My Patreon
https://www.patreon.com/c/bycloud

Recursive Language Models
[Paper] https://arxiv.org/abs/2512.24601

Context Rot
[Blog] https://research.trychroma.com/context-rot 

ChatGPT doesn't use RAG
[Blog] https://manthanguptaa.in/posts/chatgpt_memory/



Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI

This video is supported by the kind Patrons & YouTube Members: 
🙏Spam Maj, Alex, Chris LeDoux, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon, Lame Plane, Matej Macak, Len Mo, saylikhapekar


[Discord] https://discord.gg/NhJZGtH
[Twitter] https://twitter.com/bycloudai
[Patreon] https://www.patreon.com/bycloud
[Business Inquiries] bycloud@smoothmedia.co
[Profile & Banner Art] https://twitter.com/pygm7
[Video Editor] @Booga04 
[Ko-fi] https://ko-fi.com/bycloudai

ByCloud's resource offers insights, tutorials, and resources for cloud computing enthusiasts, developers, and IT professionals. Readers can learn about cloud architecture, DevOps practices, and cloud-native technologies. With articles, tutorials, and case studies, ByCloud provides  guidance and expertise for leveraging cloud computing to build scalable and resilient applications.

bycloud

Recursive Language Models (RLM) propose treating the LLM context window as an external environment rather than a place to stuff tokens. Instead of loading a full document into context, the root model receives only constant-size metadata and writes Python code to probe a persistent variable holding the content. Sub-calls handle isolated local reasoning over slices, returning only summaries to the root, which acts as an orchestrator. This avoids context rot (performance degradation with longer inputs) and outperforms RAG on dense reasoning benchmarks like ULong, where GPT-5 alone scores under 0.1% while RLM with GPT-5 reaches 58%. RLM is not a replacement for RAG in speed-sensitive workloads, but for high-value tasks requiring global reasoning over large artifacts, it offers a fundamentally better architecture. The paper benchmarks RLM at up to 10 million token inputs without loading them into the model context.