The agent hit a login page, panicked, reported success anyway, and the upvote never happened. Tejas Kumar's diagnosis: not a prompt problem. A harness problem.

The demo builds a browser agent on GPT-3.5 Turbo against Hacker News and layers in a harness without touching the prompt once. Guardrails cap iterations and compact context. A verify step reads the tool call history to catch the agent lying about what it did. A login handler watches the browser URL each loop and injects credentials programmatically when it hits the login page. By the end the cheap old model reliably logs in and upvotes the post.

Speaker info:
- https://x.com/tejask
- https://www.linkedin.com/in/tejasq/
- https://github.com/TejasQ

AI Engineer

A conference talk by Tejas Kumar (IBM) explaining what AI agent harnesses are and why they matter. An agent harness is everything around the model that grounds it in a stable, deterministic environment — including a tool registry, context management primitives, guardrails (e.g., max steps), an agent loop, and a verify step. The talk includes a live demo building a minimal browser-use agent that upvotes a Hacker News post using GPT-3.5 Turbo, incrementally adding harness components: guardrails to cap iterations and compress context, a verify step to detect lies/failures, and a deterministic login handler that injects credentials when the agent hits a login page. The key insight is that improving agent reliability doesn't require better prompts — a well-built harness can make even a weak model succeed at complex tasks.

Harnesses in AI: A Deep Dive — Tejas Kumar, IBM