Stars
This pipeline can be used to create and evaluate variations of a probabilistic linkage model, using the Splink package, for linking and deduplicating record-level data at NHS England.
Concatenate a directory full of files into a single prompt for use with LLMs
A simple implementation of duckdb-wasm using only html and js
π performant confetti animation in the browser
Scriptable interface to a powerful, multi-lingual language server built on top of Tree-sitter
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Limbo is a project to build the modern evolution of SQLite.
A python client for the Bus Open Data Service API
Every bus stop, route and timetable, using (Geo)Django and things
A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption
Convert UK transport data (TransXchange / ATOC CIF) to GTFS format in R
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Python tool for converting files and office documents to Markdown.
π OpenHands: Code Less, Make More
DuckDB Pyroscope Extension for Continuous Profiling
Open, Multi-modal Catalog for Data & AI
Talk contents for my presentation on "Realtime Time Series Anomaly Detection in Production" in PyData Global 2024
A reactive notebook for Python β run reproducible experiments, execute as a script, deploy as an app, and version with git.
πͺπ» Blazing-fast system monitoring for your desktop (built with Rust, Tauri & Svelte)
Cross-platform application for easy encrypted file, folder, and text sharing between devices.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Polars extension for general data science use cases
Spark functions to run popular phonetic and string matching algorithms