Capability
2 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “contamination-evidence-analysis-and-reporting”
Continuously updated coding benchmark — new competitive programming problems, prevents contamination.
Unique: Provides concrete, evidence-based contamination detection by analyzing performance degradation at model training cutoffs, rather than relying on external audits or data provenance tracking. DeepSeek models' 'stark drop in performance on LeetCode problems released since September 2023' provides clear evidence of contamination that would be missed by static benchmarks.
vs others: More practical and automated than manual data audits because it uses temporal analysis to detect contamination automatically; more reliable than relying on model developers' claims about training data because it provides empirical evidence.
via “contamination detection and reporting”
Building an AI tool with “Contamination Evidence Analysis And Reporting”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.