A JavaOne 2026 talk covering caching strategies for agentic Java systems across three layers: in-process caching with Caffeine for ultra-low latency, distributed caching with Redisson and Valkey for shared state, and semantic caching using Vector Similarity Search to reduce latency and cost when scaling LLM access.
Sort: