Strict GitHub advisory benchmarking with OpenRouter-backed finder models.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

N-Day-Bench is a benchmark that evaluates frontier LLMs on their ability to discover real-world N-Day vulnerabilities disclosed after each model's knowledge cutoff. Using a standardized harness via OpenRouter-backed finder models, it prevents reward hacking and measures genuine cybersecurity capability. The benchmark updates monthly with new test cases and model versions. Current top performers include GPT-5.4 (83.93), GLM-5.1 (80.13), Claude Opus 4.6 (79.95), Kimi K2.5 (77.18), and Gemini 3.1 Pro Preview (68.50). All traces are publicly browsable.