A comprehensive system design walkthrough for building a personalized video search system on AWS using machine learning. The architecture combines a text-based search engine (ElasticSearch/OpenSearch with inverted indexes) and a visual search engine using dual-encoder models: BERT for text queries and ViT for video frame

Sort: