Capability
Ocr Based Pii Detection And Redaction In Images And Dicom Medical Images
10 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
via “personally identifiable information redaction with multi-pattern detection”
783 GB curated code dataset from 86 languages with PII redaction.
Unique: Multi-pattern PII detection combining regex (emails, IPs, common key formats) with entropy-based heuristics for unknown credential types, applied at scale across 783 GB — most code datasets lack systematic PII redaction
vs others: More comprehensive PII redaction than CodeSearchNet (which has minimal redaction) and more transparent than GitHub-Code (which does not publish redaction methodology)