Back to articles
How I Search 10,000+ AWS GitHub Repos in 10 Seconds
How-ToTools

How I Search 10,000+ AWS GitHub Repos in 10 Seconds

via Dev.toAjit

The Problem Every AWS developer knows this pain: you need a reference architecture or sample code, and you end up with 47 browser tabs open across GitHub, AWS docs, Stack Overflow, and random blog posts from 2019. GitHub search gives you 2,000 results with zero context. ChatGPT confidently hallucinates repos that don't exist. Stack Overflow answers are outdated. What I Built I built an AI-powered search engine that indexes 10,000+ repos from AWS's official GitHub organizations: aws-samples : 8,031 repos awslabs : 993 repos aws-solutions-library-samples : 315 repos aws-ia : 234 repos aws-solutions : 72 repos How It Works The search uses a hybrid approach: 70% BM25 (keyword matching) — catches exact AWS service names 30% FAISS (semantic vector search) — understands what you mean, not just what you type Each repo is classified by Amazon Bedrock (Nova Pro) across 22 metadata fields : solution type, AWS services used, complexity, freshness, setup time estimate, and more. Auto-indexed twice

Continue reading on Dev.to

Opens in a new tab

Read Full Article
6 views

Related Articles