Here are
28 public repositories
matching this topic...
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
Updated
Nov 22, 2024
Python
Headless document conversion and printing using LibreOffice or Microsoft Office
Updated
Sep 2, 2025
Python
Everything related to Bookalope and its REST API.
Updated
May 16, 2021
Python
Convert PowerPoint or LibreOffice Impress files to Beamer-friendly, Pandoc-style markdown
Updated
Feb 10, 2020
Python
To generate tufte-book style document for Stanford Encyclopedia of Philosophy (SEP) entries.
Updated
May 11, 2018
Python
Cairo-inspired dependency-free replacement for casting SVG to PNG or PDF format
Updated
Jun 14, 2025
Python
Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.
Updated
Aug 14, 2025
Python
A set of utility classes and functions to process documents with Python
Updated
Oct 31, 2025
Python
Self-hosted document conversion service with REST API
Updated
Jan 29, 2023
Python
Convert Notion pages to beautifully formatted Word documents with custom styling and templates
Updated
Sep 21, 2025
Python
📄 Professional MCP server for converting 29+ file formats to Markdown - Perfect for Claude Desktop and AI workflows!
Updated
Nov 17, 2025
Python
📄 Convert 29+ file formats to clean Markdown using the Model Context Protocol for seamless integration with AI workflows.
Updated
Nov 19, 2025
Python
Convert your documents in pdf format and extract information from them. Supports many extension like docs, docx, rtf etc
Updated
Oct 23, 2023
Python
Extract text from PDFs, PPTs, & URLs (with OCR support). Converts PPT to PDF & handles files or folders. 🦍
Updated
Jul 1, 2025
Python
Docker container with Pandoc, PdfTex, XeLaTeX and LuaTeX installed. Available for amd64 and arm64
Updated
Jul 24, 2025
Python
✨ Convert Notion pages into beautifully formatted Word documents quickly and reliably, with options for custom styling and templates.
Updated
Nov 19, 2025
Python
Lightweight Python script to convert directory of mdx files to pdf or docx
Updated
Mar 15, 2025
Python
Simple tool to convert a markdown file to a PDF.
Updated
May 22, 2024
Python
Created a python app that combines keyword-specific images to PDF document.
Updated
Sep 7, 2024
Python
Utility to add scripts from a repository to a markdown file.
Updated
May 22, 2024
Python
Improve this page
Add a description, image, and links to the
document-conversion
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
document-conversion
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.