GitHub built their own search engine, called Blackbird, for code search due to poor user experience and scalability issues with existing solutions. Searching code is different from general text search and requires unique features and capabilities. GitHub's code search system handles over 45 million repositories, 115 TB of code, and 15.5 billion documents.

12m read timeFrom github.blog
Post cover image
Table of contents
Just use grep?A search index primerIndexing 45 million repositoriesLet’s build an indexLife of a queryIn summary
1 Comment

Sort: