This project maps over 400,000 GitHub projects, clustering them based on common stargazers. Using data from GitHub's public activity events, Jaccard Similarity, and AWS for processing, the map visualizes connections between repositories. The layout was computed with ngraph.forcelayout and rendered with maplibre. Clusters and labels were generated with the help of various tools, including ChatGPT. The project is open-source under the MIT license, with contributions welcome.
3 Comments
Sort: