Creates an endpoint to extract text content, images and document from different files format. Can be used in pyspark UDFs for bigdata retrieval.
-
Updated
Jun 16, 2025 - Python
Creates an endpoint to extract text content, images and document from different files format. Can be used in pyspark UDFs for bigdata retrieval.
Add a description, image, and links to the filecontent topic page so that developers can more easily learn about it.
To associate your repository with the filecontent topic, visit your repo's landing page and select "manage topics."