Wikimedia Deutschland's AI project lead Philippe Saade discusses the Wikidata Embedding Project — a vector database built on top of Wikidata's 119 million item knowledge graph. The project embeds ~30 million items (those linked to Wikipedia pages) using Jina AI's embedding V3 model with Matryoshka embeddings at 512 dimensions,
Table of contents
TRANSCRIPTSort: