CLIBE is a novel framework presented at NDSS 2025 for detecting dynamic backdoors in Transformer-based NLP models. Unlike static backdoor attacks that use fixed tokens or phrases as triggers, dynamic backdoor attacks exploit abstract latent features like text style, making them far stealthier. CLIBE works by injecting an
•1m read time• From securityboulevard.com
Sort: