Capability
2 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “expert-authored frontier mathematics problem curation”
Expert-level math problems created by mathematicians.
Unique: Uses unpublished, expert-authored problems across four mathematical subdisciplines with explicit tiering from undergraduate to research level, plus a separate collection of genuinely unsolved problems — avoiding contamination from public datasets and testing on problems that have resisted professional mathematician attempts
vs others: Differs from MATH and other public benchmarks by using original, unpublished problems authored by expert mathematicians with peer review, providing frontier-level difficulty calibration that public datasets cannot offer
via “competition-mathematics problem corpus construction and curation”
12.5K competition math problems across 7 subjects and 5 difficulty levels.
Unique: Curated from actual mathematics competitions (AMC/AIME) rather than synthetic or textbook problems, ensuring problems require genuine multi-step reasoning and cannot be solved by pattern matching alone. Includes difficulty stratification (1-5) and subject taxonomy across 7 mathematical domains, enabling fine-grained capability analysis. Verified solutions provided by domain experts, not generated by models.
vs others: More rigorous than general math benchmarks (e.g., SVAMP, MathQA) because it uses authentic competition problems with higher reasoning complexity; more comprehensive than single-domain datasets because it spans 7 mathematical subjects with 12,500 problems; more reliable than synthetic benchmarks because problems are human-authored and competition-tested.
Building an AI tool with “Expert Authored Frontier Mathematics Problem Curation”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.