Researchers have introduced a novel AI method called AdvPrompter to generate human-readable adversarial prompts quickly. It enhances human readability, demonstrates excellent attack success rates, and can generate adversarial suffixes using next-token prediction.
Sort: