A conference talk covering how to build responsible autonomous AI agents, focusing on safety, fairness, accountability, and security. Key topics include the OWASP Top 10 for agentic applications, agent failure modes (memory poisoning, prompt injection, cross-agent attacks), and a practical demo of a health-tracking agentic system called BioTracker. The talk covers defense-in-depth patterns: tool whitelisting and budgeting via middleware, Microsoft Entra Agent ID for agent identity and zero-trust between agents, independent reviewer agents for output validation, Azure AI Content Safety for groundedness and prompt shields, and OpenTelemetry-based observability. Red teaming with PyRIT and continuous adversarial evaluation in production are also discussed, along with emerging regulations like the EU AI Act.
Sort: