Please subscribe to our YouTube channel @ https://www.youtube.com/@DevoxxForever
Subscribe to LinkedIn @ https://www.linkedin.com/company/voxxed-days-amsterdam
Follow us on Twitter @ https://twitter.com/voxxedamsterdam

Prompt injection is one of the fastest-growing risks in AI applications. In this lightning talk, we’ll cover some practical techniques attackers use to manipulate LLMs, from simple instruction overrides to more advanced escalation tricks.

You’ll get a quick, no-nonsense overview of how these attacks work and why they matter, plus a set of defenses you can start applying right away.

Devoxx

A conference talk covering 5.5 prompt injection techniques against LLMs, demonstrated live. Techniques include direct instruction override, structured output attacks (requesting JSON to pressure compliance), roleplaying exploits, multi-turn manipulation (gradually extracting data across chat turns), payload splitting (combining fragments in one prompt), and delimiter confusion (hiding instructions in text). The speaker shows these attacks against GPT-3.5, GPT-4, and GPT-4.1, noting newer models are stronger but not immune, and that combining techniques can bypass even hardened models. Mitigation strategies covered include scanning CLAUDE.md/MCP files for injections, enforcing structured output, using input/output guardrails (LLM-as-a-judge), limiting user input scope, and keeping strict system messages. The speaker warns that blindly downloading skills files or MCP servers is a major attack vector.

5,5 prompt injection techniques in 15 minutes by Brian Vermeer