A real-time product recommendation system combines Snowflake Online Feature Serving with Two-Tower Neural Networks to deliver low-latency personalization. The architecture uses batch-computed item embeddings and real-time user embeddings, with features served at sub-100ms latency via configurable target_lag settings. Delta

6m read timeFrom medium.com
Post cover image
Table of contents
Evolving Beyond Batch-Only ArchitecturesArchitecture OverviewWhat is Online Feature Serving?Implementation Deep DiveGet Sho Tanaka’s stories in your inboxReferences

Sort: