Capability
Adversarial Robustness Evaluation
13 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
via “evaluation-metrics-and-classifier-robustness-benchmarking”
Microsoft's dataset for implicit toxicity detection.
Unique: Provides adversarial-specific metrics (adversarial success rate) in addition to standard classification metrics, enabling direct measurement of how well classifiers resist adversarial examples. The system supports per-group evaluation, revealing whether classifiers have disparate robustness across different target groups.
vs others: More comprehensive than standard classification metrics because it includes adversarial-specific measures and per-group analysis, enabling researchers to identify both overall robustness issues and fairness disparities across demographic groups.