DuckDB team demonstrates how to discover and analyze DuckDB usage across GitHub repositories by querying the GitHub API with DuckDB itself. The approach involves using DuckDB's HTTP capabilities to fetch repository data, processing JSON responses with SQL, and automating the workflow with GitHub Actions to generate daily reports in Markdown format. The solution includes pagination handling, data filtering, and visualization of historical trends through Git commit analysis.

12m read timeFrom duckdb.org
Post cover image
Table of contents
IntroductionData Retrieval from GitHubSaving Data to a Markdown FileAutomating with GitHub WorkflowVisualizing Historical DataConclusion
1 Comment

Sort: