Back to projects
2026aiFull Stack AI Engineer● In Progress

Agentify.

AI agent discovery platform powered by a 5-agent LangGraph pipeline. Scrapes GitHub, Reddit, and HuggingFace; hybrid BM25 + pgvector retrieval; personalized ranked results with LLMOps observability.

Agentify
01Overview

Agentify is a production-grade discovery platform for developers seeking the best AI agents, Claude skills, MCP tools, and agentic development resources. A 5-agent LangGraph pipeline continuously scrapes GitHub repositories, Reddit threads, HuggingFace model pages, and developer blogs, then parses, embeds, evaluates, and surfaces the most relevant tools — personalized per developer based on their stack, goals, and history. The retrieval layer combines BM25 keyword search with pgvector semantic search, re-ranked by a HuggingFace Text Ranking model for p95 latency under 200ms. An automated eval harness measures Precision@5, NDCG, and MRR against a golden test set. LLMOps observability runs through LangSmith; scheduled ingestion pipelines run on Prefect — all on free-tier infrastructure. The system demonstrates every pattern expected of a mid-level AI engineer in 2026: RAG, multi-agent orchestration, hybrid retrieval, LLM evaluation, and LLMOps observability.

02Problem & Solution

Problem

The AI tooling ecosystem has exploded to 4.3M+ AI repositories on GitHub. There is no canonical, quality-scored, personalized index of agents, skills, and MCP tools — developers waste hours on Discord and Reddit finding what already exists.

Solution

5-agent LangGraph pipeline (scrape → parse → embed → evaluate → surface) with hybrid BM25 + pgvector retrieval and a HuggingFace re-ranker. Quality scoring (0–100) from an automated eval harness gives developers trust signals beyond GitHub star count.

03Highlights
  • 015-agent LangGraph pipeline: scrape → parse → embed → evaluate → surface
  • 02Hybrid BM25 + pgvector retrieval re-ranked with HuggingFace Text Ranking model; p95 latency < 200ms
  • 038 HuggingFace task types integrated across the production pipeline
  • 04LLM eval harness with Precision@5, NDCG, and MRR on a 50+ query golden test set
  • 05LLMOps observability via LangSmith; scheduled ingestion via Prefect — entirely free-tier infrastructure
04Metrics
  • 500+Agents Indexed
  • 5LangGraph Agents
  • 8HF Task Types
  • > 0.80Target P@5
08Stack

frontend

  • Next.js
  • TypeScript

backend

  • LangGraph
  • BM25
  • HuggingFace

database

  • pgvector
  • Supabase

deployment

  • Vercel

other

  • LangSmith
  • Prefect
09Links