Here are
19 public repositories
matching this topic...
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
Updated
Nov 22, 2024
Python
Headless document conversion and printing using LibreOffice or Microsoft Office
Updated
Dec 19, 2024
Python
Everything related to Bookalope and its REST API.
Updated
May 16, 2021
Python
Convert PowerPoint or LibreOffice Impress files to Beamer-friendly, Pandoc-style markdown
Updated
Feb 10, 2020
Python
To generate tufte-book style document for Stanford Encyclopedia of Philosophy (SEP) entries.
Updated
May 11, 2018
Python
Cairo-inspired dependency-free replacement for casting SVG to PNG or PDF format
Updated
May 23, 2023
Python
A set of utility classes and functions to process documents with Python
Updated
Dec 26, 2022
Python
Self-hosted document conversion service with REST API
Updated
Jan 29, 2023
Python
Convert your documents in pdf format and extract information from them. Supports many extension like docs, docx, rtf etc
Updated
Oct 23, 2023
Python
Extract text from PDFs, PPTs, & URLs (with OCR support). Converts PPT to PDF & handles files or folders. 🦍
Updated
Apr 14, 2025
Python
Lightweight Python script to convert directory of mdx files to pdf or docx
Updated
Mar 15, 2025
Python
Convert PDF/Excel/HTML to text maintaining layout
Updated
Nov 27, 2024
Python
Report generation from templates from JSON to pdf, xslx, docx, html. etc...
Updated
Aug 26, 2024
Python
Utility to generate a PDF from a GitHub repository.
Updated
May 23, 2024
Python
Utility to add scripts from a repository to a markdown file.
Updated
May 22, 2024
Python
Extract content from PDF's and convert or create new documents from the content in multiple output formats.
Updated
Mar 17, 2022
Python
A nekünk docx-ben küldött hirdetések feldolgozása a Wordpress Betheme témájához.
Updated
Jan 7, 2025
Python
Simple tool to convert a markdown file to a PDF.
Updated
May 22, 2024
Python
Created a python app that combines keyword-specific images to PDF document.
Updated
Sep 7, 2024
Python
Improve this page
Add a description, image, and links to the
document-conversion
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
document-conversion
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.