OpenAI published a paper detailing URL-based data exfiltration mitigations in language model agents. The mitigations use a web crawler to create an allow-list of safe URLs, blocking dynamically created ones. While this increases attack complexity, bypasses remain possible through techniques like mapping individual letters to

5m read time From embracethered.com
Post cover image
Table of contents
What Is the New MitigationAdditional Mitigation IdeasFinal Risk - Thorough Adoption of the MitigationReferences

Sort: