red-teaming
Fine-tuning AdvPrompter: A Novel AI Method to Generate Human-Readable Adversarial PromptNavigating the Threat Landscape: Understanding Exposure Management, Pentesting, Red Teaming and RBVMImport AI 368: 500% faster local LLMs; 38X more efficient red teaming; AI21's FrankenmodelA faster, better way to prevent an AI chatbot from giving toxic responsesEvaluating AI Model Security Using Red Teaming Approach: A Comprehensive Study on LLM and MLLM Robustness Against Jailbreak Attacks and Future ImprovementsWhy Are Large AI Models Being Red Teamed?Microsoft Releases PyRIT - A Red Teaming Tool for Generative AIWhen is ART useful? When it’s IBM’s Adversarial Robustness Toolbox for AIThis AI Paper from China Sheds Light on the Vulnerabilities of Vision-Language Models: Unveiling RTVLM, the First Red Teaming Dataset for Multimodal AI SecurityRust for Cyber Security and Red Teaming 🦀
All posts about red-teaming