Capability
Text Classification Dataset Sampling And Filtering
4 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
Dataset by m-a-p. 5,55,725 downloads.
Unique: Leverages HuggingFace's native filtering and sampling APIs (via .filter() and .select()) to enable in-memory or streaming-based subset extraction without full corpus download; supports seed-based reproducibility for deterministic splits across experiments
vs others: More flexible than static benchmark datasets (ImageNet, MNIST) because filtering is dynamic and user-defined; faster iteration than manual annotation while maintaining reproducibility through versioned dataset snapshots