Open-source platform for extracting structured data from documents using AI.
-
Updated
May 15, 2025 - JavaScript
Open-source platform for extracting structured data from documents using AI.
Receipt scanner extracts information from your PDF or image receipts - built in NodeJS
Allows extracting data from DOM
Atomic Web Service (AWS, REST API) for converting DOC/DOCX files to plain/text, powered by catdoc, docx2txt and Node.js
A simple tool to parse and extract data from a resume.
Example project demonstrating how to use PDFix SDK WebAssembly build in Node.js. Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
A Mardown parser for extracting hierarchical content.
🕵️♂️ | A Chrome extension that collects all JavaScript (.js) links, form endpoints, and all other links from a webpage with a single click!
Automatically extracts packages root name for monorepos
Extract html snippets getting the minimal css rules from source or computing the css values
Javascript Query(Jquery) that can pull out the given data from any Skillshare profile teaching page.
A project to select only part of a PDF file. It's usefull when you want to extract informations with some python library like fitz.
Chrome extension to extract a select portion / section of a webpage into a PDF file
Designed for processing and cleaning HTML content from a JSON Lines input file, extracting meaningful text, and writing it to a text output file.
Extract certain data from github repositories using the v4 API offered by github itself.
Extracts sentences from txt files.
Full-Stack Developer (MERN) Assignment Jobsforce.ai LLC. To build a Job Recommendation System
Add a description, image, and links to the extract-data topic page so that developers can more easily learn about it.
To associate your repository with the extract-data topic, visit your repo's landing page and select "manage topics."