A deep dive into OpenClaw architecture covering memory management, context window optimization, cost reduction, and tool integration. Key topics include long-term vs short-term memory models, compaction and memory flush configuration, vector memory search via QMD backend, prompt caching to reduce API costs by up to 90% on repeated tokens, and using Composio as an MCP server to manage tools more efficiently. Practical configuration changes are demonstrated in openclaw.json, with commands for monitoring token usage and session hygiene.

•38m watch time

Sort: