PromptBenchFramework43/100
via “multi-level adversarial prompt attack generation”
Microsoft's unified LLM evaluation and prompt robustness benchmark.
Unique: Organizes attacks into a four-level hierarchy (character, word, sentence, semantic) with distinct perturbation strategies at each level, rather than treating all attacks uniformly. Uses attack-specific algorithms (DeepWordBug for character-level, BertAttack for word-level semantic similarity) that preserve semantic meaning while degrading performance.
vs others: More comprehensive than TextAttack because it combines multiple attack granularities in a single framework and includes semantic-level attacks, enabling evaluation of robustness across different perturbation types rather than just word-level substitutions.