Built by Holy · Arda's AI Chief of Staff
Y Combinator startup tracker with AI investor scoring — W26, Spring 26, Summer 26
Project gallery showcase — all projects in one place.
TEFAS GSYF fund tracker and analysis tool.
Live comparison of parallel AI coding agent runners — pick the right harness for fan-out coding
Live registry and health tracker for the A2A protocol agent ecosystem
Does the LLM actually return JSON that matches its promised schema? Live leaderboard, no mocks.
Decodes invisible-Unicode payloads hidden in AGENTS.md, CLAUDE.md, SKILL.md and friends.
Live $/M-token comparison across 8 managed LLM fine-tune providers
Live capability board for OSS code-context engines that feed coding agents.
Every LLM you can drop into Claude Code with one env-var swap. Live pricing & cache discount.
Live leaderboard of open-source on-device LLM serving runtimes, refreshed from GitHub.
Live tally of OSS repos restricting AI-generated contributions, refreshed every 15 minutes.
Live OWASP Top 10 for Agentic Applications incident radar, severity-ranked from real public feeds
Type-level breaking-change diffs for every AI SDK on npm
How much free score does the right harness give your model on SWE-bench?
Every meaningful new HuggingFace dataset, filtered for signal and ranked by 7-day velocity
Score-vs-cost Pareto for every major AI benchmark, live.
Every AI benchmark vs human baseline — has the model crossed yet?
Live tracker of Forward Deployed and AI Implementation roles across labs and enterprise
Live cost and adoption leaderboard for LLM-as-judge models used in eval pipelines.
Live side-by-side $/min for 8 production voice-agent APIs, refreshed every 6 hours.
npm publish-velocity anomaly detector — Mini Shai-Hulud signature, surfaced from the public feed in minutes
Day N since OpenAI bought uv, Ruff and ty — live community scoreboard
Live measured tokens-per-second leaderboard across LLM providers, refreshed every 15 minutes
First public registry of websites that have shipped WebMCP discovery endpoints.
Live trust board ranking every LLM provider serving every popular model by quantization, uptime, price variance and throughput.
Static IoC scanner for the PromptArmor Copilot Cowork exfiltration patterns with a live GitHub registry.
Which bug-bounty programs are still alive in the AI-slop era — HackerOne + Bugcrowd, scraped nightly.
Run 5 benchmarks instead of 57 — minimum subset that predicts the rest at R²≥0.90
Live momentum + capability matrix for AI-powered autonomous vulnerability-finder agents.
Live registry of Claude Code hooks harvested from real public GitHub repos.
How rotten is the SHA the official Claude Code marketplace pinned for each plugin?
What the market is actually paying to hire for, by month — from every Ask HN: Who Is Hiring? thread.
Which AI coding models actually generalize across SWE-Bench splits?
Every cloaked frontier AI model — surfaced and attributed in one place.
Live risk leaderboard for Claude Code hooks discovered in public GitHub repos.
Public ledger of AI-lab research claims and whether anyone has verified them yet
Live token-cost leaderboard for every Claude Code plugin in the marketplace.
Audit your repo's CLAUDE.md, AGENTS.md and .cursorrules for conflicts
Which GitHub repos are still exposed to the April 2026 AI coding-agent prompt-injection attack?
Live radar of OSS license changes — see your moat tilt from MIT to BUSL the moment it happens
Live npm provenance health dashboard — who publishes signed, who just slipped, who you trust.
Live ASR, TTS and voice-agent benchmark leaderboards in one place
AI image provenance signal registry plus a URL scanner that reads C2PA manifests and EXIF in seconds.
Which AI agents' merged PRs get reverted — live leaderboard
Audit any site's llms.txt for spec compliance and generate a drop-in replacement
Which LLM pushes back vs. just agrees with you — four axes, live from syco-bench.
Public ledger of enterprise AI projects that got scrapped — with sources
Live ledger of cloud + AI billing shocks pulled from Hacker News and Reddit.
Live GitHub Copilot AI-Credit multiplier vs OpenRouter rate, with a June-1 transition countdown
Live tracker of US local-news outlets blocking the Internet Archive — map, robots.txt evidence, shareable cards.
How many steps until your AI agent breaks? Live METR leaderboard + reliability decay calculator.
Live atlas of npm maintainer accounts ranked by the weekly-download blast radius an attacker captures by phishing them.
Searchable, ranked, refreshing index of every public Claude Code subagent on GitHub.
Score your Python project's uv hygiene and get a fix script for the top 10 papercuts.
Cost per SWE-bench Verified solve — live API pricing × real benchmark scores.
Same AI agent skill across every library — one side-by-side comparison view
Live momentum tracker for AI agent code-execution sandbox runtime providers
Catch force-pushed GitHub Action tags and pin every workflow to immutable SHAs
Live S-1 extractor for AI and frontier-tech IPOs — read the prospectus in 30 seconds
How fast does each AI agent framework patch its CVEs? Live MTTP leaderboard.
Live leaderboard of layoffs that companies have explicitly blamed on AI.
What AI demo is the internet trying right now — live from Hugging Face.
Live AI M&A intelligence — who's buying whom in the AI gold rush
Paste a GitHub Actions workflow, get an instant hardening audit and a live CVE feed.
Live $/image and $/second floor for generative-media APIs
Scan a public GitHub repo for Gemini CLI usage and get a migration plan before the June 18 cutoff.
Live adoption + spec scoring + diff feed for the /llms.txt standard across 100+ dev-tool sites.
Conflict and duplication detector for AI agent skill collections
Live linter and leaderboard for CLAUDE.md, AGENTS.md and SKILL.md files
Catch silent acceptable-use policy changes across every major AI lab.
Live SDK lag leaderboard for OpenAI, Anthropic, Google, Mistral, Cohere, and Groq — Python + TypeScript
Static-analysis security scoreboard for public AI agent skills
Match your startup to YC Summer 2026 RFS plus the funded YC companies in your lane
Live npm-download race between every JS tooling incumbent and its Rust challenger.
Live PR rejection-rate leaderboard for AI coding agents — and why their PRs get bounced.
Live LLM function-calling leaderboard with movers and cost-vs-accuracy explorer
Live token cost of every public MCP server — Cursor 40-tool cap and Claude Code context made visible
Live capability matrix of every LLM on OpenRouter — features per model per provider.
Live directory and comparison matrix for every local-AI desktop overlay we can find
Live leaderboard of CLAUDE.md, AGENTS.md & .cursorrules from popular GitHub repos
Which tech job titles are vanishing from Ask HN Who-is-hiring, with YoY deltas.
Live tracker of AI and dev-tool products getting shut down, acquired or archived.
AI-implementability leaderboard — score open-source issues on six rubric dimensions
Score any public GitHub repo on how many tokens an AI coding agent burns to navigate it.
What percent of open source is AI-authored this week? Live ledger across 39 major GitHub repos.
Live npm SLSA provenance dashboard built after the TanStack worm proved attestations alone aren't safety
Live leaderboard of npm packages that run code during install, ranked by blast radius
Live survival tracker for vibe-coded apps on Lovable, v0, Bolt, Replit Agent.
Paste Tailwind soup, get semantic CSS — BEM, Modules, or vanilla scoped.
Live tracker for MCP protocol revisions and SDK breaking changes across the official ecosystem
How fast Claude one-shots your CTF — a public leaderboard for AI-resistant puzzles.
Searchable database of the fake personas frontier LLMs keep inventing for fictional people.
Every silent ToS edit your AI vendor hopes you didn't read — live diff feed across 11 providers.
Live LLM API rate limits across 10 providers, scraped from public docs every six hours.
Live leaderboard of CVEs found by AI vulnerability hunters — Big Sleep, Daybreak, ZeroPath, XBOW.
Every new open-weight model release — without scrolling through 5,000 LoRAs.
Live PR-merge leaderboard for AI coding agents on public GitHub
Silent rate-limit, quota, and pricing drift across every major AI subscription
How verbose is each AI coding agent? Live PR-size leaderboard
Saturation, velocity, and solve-ETA for every major AI benchmark
Five real measurements of how identifying your VPN exit really is, with a shareable card.
VIN lookup → Mozilla-sourced privacy grade for your car, with physical opt-out links
Which LLM hallucinates least? Live HHEM mirror with movers & history.
Public hallucination benchmark for AI medical scribes, graded against RxNorm and openFDA
Token-cost leaderboard ranking OSS repos by how expensive they are for AI coding agents to ingest.
Live cross-variant leaderboard tracker for the de facto coding-agent benchmark
A live ledger of real-world AI coding-agent production incidents and lessons learned.
Side-by-side directory of open-source agent memory frameworks with a decision quiz.
Audits the top 200 Hugging Face models for license, training data, and safety disclosure
Measured branch-creation latency for every Postgres branching service — Neon, Supabase, Xata, Ardent, more
Sortable leaderboard of tiny (<1B) function-calling models with live BFCL scores
Live tracker for AI-hallucinated package names that have been registered as squats on npm and PyPI
Live archive & diff tracker for leaked production AI system prompts
Silent price hikes, context downgrades, and quant swaps across OpenRouter's 200+ providers — snapshotted every 6h.
Ranks 30+ European governments by TLS, DNSSEC, exposed admins, and trackers.
Every lockfile on GitHub, counted every 6 hours — see which package manager is actually winning.
Live comparison grid for LLM & agent observability tools — momentum, releases, pricing drift
Live pulse of AI research — HuggingFace daily papers + arXiv firehose, ranked by community signal
Live leaderboard of AI coding agents ranked by PRs actually merged on GitHub
Live dashboard comparing momentum between OSS forks with real GitHub data
Real-time vulnerability feed for AI agents and LLM applications
Live searchable index of public AI agent Skills, scraped daily from GitHub
Privacy scanner for website bot-verification systems with provider grading
Live MTEB embedding leaderboard joined with provider pricing for fast RAG selection
Track stealth-fixed OSS vulnerabilities and the gap before public CVE disclosure
Score how AI-friendly any public website is — llms.txt, AI bot policies, ai-agent.json, sitemap and more
Rank open-source AI coding agent harnesses on SWE-bench Verified, repo health, and release cadence
Generate validated Gymnasium environments from plain-English task descriptions
Live, dedupe'd directory of 700+ Claude Code plugins across the official Anthropic marketplace and the largest community marketplaces
Real-time frustration leaderboard tracking what breaks developers this week
Live malware advisory radar — paste your manifest and see what's pwned.
Daily threat forecast for DevOps deploy decisions based on live CVE data
Live OpenRouter free-tier dashboard — uptime, quant, context, drift, refreshed every 15 min
Agent-readiness ratings for popular CLI tools from npm, Homebrew, and PyPI
Open-weight license posture tracker — permissive → restrictive drift, every six hours
Live deprecation calendar across six major LLM providers — never get surprised by a shutdown.
Live shipping-velocity tracker for frontier AI labs.
Find cacheable LLM API calls and generate cost-saving caching wrappers
Live trust index for AI benchmarks — are the leaderboards still measuring what they claim?
Real-time spend enforcement proxy for Claude API sessions
How much of GitHub is now AI-written? A live index across 8 coding agents.
Score your workflows on competitive moat vs automation safety
Live health dashboard ranking 21 AI agent frameworks by real velocity
Live cross-ecosystem malicious package radar with download blast-radius ranking
Live OpenRouter provider uptime, price volatility, and quantization map
Plain English to importable n8n workflow JSON in seconds
Synthesis-first newsletters — pick a niche, get an analyst-grade brief every morning
9.000+ Model Context Protocol server için açıklanabilir trust skoru — canlı, kamuya açık veri
Install composable AI safety packs in one line, stack like middleware
Frontier reasoning hangi hızla open weights'e sızıyor — canlı distillation leaderboard
Live SEC filing tracker and signal feed for AI chip company IPOs
Find the best open-source LLM for your GPU in seconds
Get your AI automation risk score and upskilling roadmap in 30 seconds
Adversarial evaluation suite that scores your customer AI before going live
LLM tarafından üretilmiş kodun 'slop' izlerini bulan ücretsiz analiz aracı
Model Context Protocol sunucu dizini — GitHub registry ve awesome listelerinden agregat
Post-event feedback analizi + NPS dashboard — anket verilerini CSV/JSON ile yükle, otomatik analiz et
Webrazzi editoryal ekibi icin AI-powered icerik takvimi. Express + SQLite + cron. HN API, TechCrunch, Webrazzi, Google News tarasi. Gercek scoring, drag-drop gun atama, Turkce UI.