A deep dive into OpenClaw architecture covering memory management, context window optimization, cost reduction, and tool integration. Key topics include long-term vs short-term memory models, compaction and memory flush configuration, vector memory search via QMD backend, prompt caching to reduce API costs by up to 90% on repeated tokens, and using Composio as an MCP server to manage tools more efficiently. Practical configuration changes are demonstrated in openclaw.json, with commands for monitoring token usage and session hygiene.
ā¢38m watch time
Sort: