Wikimedia Deutschland's AI project lead Philippe Saade discusses the Wikidata Embedding Project — a vector database built on top of Wikidata's 119 million item knowledge graph. The project embeds ~30 million items (those linked to Wikipedia pages) using Jina AI's embedding V3 model with Matryoshka embeddings at 512 dimensions,

21m read timeFrom stackoverflow.blog
Post cover image
Table of contents
TRANSCRIPT

Sort: