Skip to content
View thf24's full-sized avatar
🤖
_
🤖
_

Organizations

@SimFin @parsee-ai @boring-legal

Block or report thf24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A modern Python REST client for Apache Tika server

Python 17 5 Updated Mar 27, 2025

A modern Python REST client for Apache Tika server

Python 1 Updated Oct 24, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,585 282 Updated Jun 24, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,577 499 Updated Apr 21, 2025

Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-paragraphs. Full support for scans and images.

Python 58 6 Updated Feb 4, 2025

Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/par…

Jupyter Notebook 20 1 Updated May 15, 2024

Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.

Python 68 1 Updated Mar 21, 2025
VBA 4 Updated Sep 29, 2023

Spring Batch examples in Kotlin (from simple to advanced)

Kotlin 57 8 Updated Jul 3, 2019

Vue 3 compatible drag-and-drop component based on Sortable.js

JavaScript 4,186 547 Updated Sep 27, 2023

AngularJS fixed header scrollable table directive

JavaScript 67 43 Updated Mar 19, 2018

Makes 'SimFin' data (https://simfin.com/) easily accessible in R.

R 20 4 Updated Apr 17, 2024

Repository for R package simfinR

R 8 3 Updated Jun 16, 2021

Simple financial data for Python

Python 305 41 Updated Apr 3, 2024

Tutorials for SimFin - Simple financial data for Python

Jupyter Notebook 276 71 Updated Jul 30, 2024

Convert SimFin data set into quarterly table format with respect to daily data

Python 5 1 Updated Aug 20, 2019

Search engine implementing a web crawler, fuzzy search and a simple GUI. 1st semester project

Java 2 Updated Oct 6, 2018

Exchange Rates API

Python 1,929 311 Updated Dec 8, 2022

Community maintained fork of pdfminer - we fathom PDF

Python 6,394 963 Updated Apr 22, 2025

Some examples how the web-API can be used to retrieve data from SimFin.

Python 15 18 Updated Aug 4, 2020

Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Python 5,289 1,125 Updated Dec 7, 2022

SimFin's open source PDF crawler

Python 125 43 Updated Aug 22, 2019

Headless chrome/chromium automation library (unofficial port of puppeteer)

Python 3,573 370 Updated Aug 5, 2021

JavaScript API for Chrome and Firefox

TypeScript 90,455 9,194 Updated Apr 22, 2025

Simple PDF text extraction

Python 926 102 Updated Feb 10, 2025

Module that provides AngularJS-directives for formatting, validating and working with payments

HTML 558 265 Updated Sep 23, 2020

From my YouTube channel

Jupyter Notebook 53 29 Updated Aug 5, 2018