"Lessons from building GitHub code search" by Luke Francl (Strange Loop 2023) - YouTube

In this talk, I'll share some lessons we learned building a high-performance code search engine, designed to meet GitHub's large scale. GitHub code search is the world's largest publicly available code search engine, with more than 60 million repositories and over 160 TB of content indexed. To build it, we had to turn the unique content-addressable nature of Git repositories to our advantage. I'll cover the key strategies we used, including using deduplication and repository similarity to reduce indexing workload, full index compaction to remove deleted documents, multiple levels of sharding, and load balancing. Come discover how we turned code search from a frustrating experience to a powerful feature for our users.
"Lessons from building GitHub code search" by Luke Francl (Strange Loop 2023) - YouTube #ruby #rubydeveloper #rubyonrails #code #github #search" #youtube https://rubyonrails.ba/single/lessons-from-building-github-code-search-by-luke-francl-strange-loop-2023-youtube

Nezir Zahirovic

Contractor Ruby On Rails (8+ years) / MCPD .Net / C# / Asp.Net / CSS / SQL / (11 years)

related articles