GitHub built their own search engine, called Blackbird, for code search due to poor user experience and scalability issues with existing solutions. Searching code is different from general text search and requires unique features and capabilities. GitHub's code search system handles over 45 million repositories, 115 TB of code, and 15.5 billion documents.
Table of contents
Just use grep?A search index primerIndexing 45 million repositoriesLet’s build an indexLife of a queryIn summary1 Comment
Sort: