Skip to content
View WHaverals's full-sized avatar

Highlights

  • Pro

Block or report WHaverals

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository allows to perform the evaluation of author embedding on a writing style axis.

Python 5 1 Updated Oct 30, 2021

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Python 54 5 Updated Aug 18, 2024

🎉🌩️ Dynamic DNS (DDNS) service based on Cloudflare! Access your home network remotely via a custom domain name without a static IP!

Python 3,020 316 Updated Aug 23, 2024

🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.

Python 32 3 Updated Feb 24, 2021

Lord of Large Language Models Web User Interface

Vue 4,261 536 Updated Sep 29, 2024

BookNLP, a natural language processing pipeline for books

Python 783 92 Updated Jul 31, 2024

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Python 117 4 Updated Nov 13, 2023

The LLM Evaluation Framework

Python 3,077 239 Updated Sep 27, 2024

Implementation of KatKit as presented at DH2024

Jupyter Notebook 5 Updated Sep 16, 2024

Responsible Datasets in Context

TeX 64 2 Updated Aug 7, 2024

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 444 29 Updated Sep 22, 2024

Early English Books Demo for TEI Publisher 6

XQuery 2 Updated Feb 27, 2024

Custom recipe and utilities for document processing

Python 198 20 Updated Jun 19, 2022

🧬 A VS Code extension for annotating data with Prodigy

TypeScript 30 2 Updated Nov 25, 2021

Code and prompt templates for the "Post-OCR Correction with OpenAI’s GPT Models on Challenging English Prosody Texts" short-paper submission to DocEng 2024.

Python 1 Updated Jun 10, 2024

Automatic Collation for Diversifying Corpora

Python 9 2 Updated Sep 26, 2024

Easy to use test framework for Jupyter Notebooks

Python 304 28 Updated Aug 4, 2022

Distribute and run LLMs with a single file.

C++ 19,358 981 Updated Sep 28, 2024

A software to detect text reuse with BLAST.

Python 14 5 Updated Oct 8, 2019

Instruction Tuning with GPT-4

HTML 4,174 300 Updated Jun 11, 2023

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 91,382 7,193 Updated Sep 29, 2024

Infinite Photorealistic Worlds using Procedural Generation

Python 5,323 455 Updated Sep 27, 2024
Jupyter Notebook 317 47 Updated Jan 7, 2024

Catalog of abusive language data (PLoS 2020)

Python 300 74 Updated Jun 14, 2024

The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)

Jupyter Notebook 64 6 Updated May 12, 2022

LLM vulnerability scanner

Python 1,315 150 Updated Sep 26, 2024

The Go programming language

Go 123,170 17,564 Updated Sep 29, 2024

🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library

Cython 91 8 Updated Apr 18, 2024

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 6,465 657 Updated Aug 29, 2024

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

Jupyter Notebook 477 115 Updated Aug 4, 2024
Next