Capability
Image Extraction And Embedded Image Handling
5 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
Document preprocessing for RAG — parse PDFs, DOCX, images into clean structured elements.
Unique: Extracts images as first-class Element types with metadata preservation, and optionally applies OCR to make image content searchable. Integrates image handling across multiple document formats.
vs others: More integrated than separate image extraction tools; preserves image metadata and position. Less specialized than dedicated image processing libraries but sufficient for document-embedded images.