Hello, openSUSE community! My name is Akash Kumar, and I was a Google Summer of Code (GSoC) 2025 mentee with the openSUSE organization. This blog post highli...

OpenSUSE's platform is a resource for Linux enthusiasts and open-source software developers, offering insights into the OpenSUSE distribution, Linux desktop environments, and open-source community initiatives. Through articles, tutorials, and community forums, OpenSUSE offers insights into Linux system administration, software packaging, and desktop customization. Readers can learn about installing OpenSUSE, configuring Linux services, and contributing to open-source projects to enhance their Linux skills and engage with the OpenSUSE community.

openSUSE

A GSoC 2025 project that built an end-to-end semantic video search engine capable of finding specific moments within videos using natural language queries. The system uses a two-part architecture: an ingestion pipeline that processes videos with AI models (TransNetV2, WhisperX, BLIP, VideoMAE) to extract shots, transcripts, captions, and actions, then segments them intelligently and enriches them with LLM-generated summaries; and a search application with FastAPI backend that performs hybrid text-visual searches using ChromaDB vector database and Reciprocal Rank Fusion for result ranking, paired with a Streamlit frontend for user interaction.

GSoC 2025, Building a Semantic Search Engine for Any Video

Part 1: The Ingestion Pipeline - Teaching the Machine to Watch TV

Part 2: The Search Application - Reaping the Rewards