Skip to content
View thedatajanitor's full-sized avatar

Block or report thedatajanitor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23…

Python 639 44 Updated Mar 28, 2024

Exca - Execution and caching tool for python

Python 60 1 Updated Jan 9, 2025

Limbo is a work-in-progress, in-process OLTP database management system, compatible with SQLite.

Rust 8,469 279 Updated Jan 10, 2025

Agent Framework / shim to use Pydantic with LLMs

Python 5,125 347 Updated Jan 11, 2025

Get your documents ready for gen AI

Python 17,861 947 Updated Jan 10, 2025

A system for agentic LLM-powered data processing and ETL

Python 1,413 127 Updated Jan 10, 2025
Python 135 8 Updated Dec 5, 2024

Anthropic's educational courses

Jupyter Notebook 8,617 693 Updated Nov 26, 2024

The framework for building scalable agentic applications.

TypeScript 1,435 147 Updated Jan 10, 2025

An extensible, state-of-the-art columnar file format

Rust 1,066 32 Updated Jan 11, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,579 1,813 Updated Oct 15, 2024

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…

Jupyter Notebook 2,494 246 Updated Jan 2, 2025

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,174 389 Updated Jan 8, 2025

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,873 754 Updated Dec 30, 2024

LLM abstractions that aren't obstructions

Python 858 59 Updated Jan 10, 2025

Python superprompt Claude Sonnet 3.5

13 Updated Oct 3, 2024

The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integration for optimal answers. Simulating a team that discusses, …

Python 141 22 Updated Oct 18, 2024

Entropy Based Sampling and Parallel CoT Decoding

Python 3,188 316 Updated Nov 13, 2024

Automated Design of Agentic Systems

Python 1,114 170 Updated Jan 7, 2025

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python 208 15 Updated Oct 16, 2024

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 499 57 Updated Mar 4, 2024

A curated list of ontology things

314 22 Updated Oct 4, 2024

A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)

Python 622 28 Updated Dec 20, 2024

A language model programming library.

Python 5,536 327 Updated Dec 18, 2024

📄 A curated list of awesome .cursorrules files

5,584 360 Updated Jan 6, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 17,982 1,683 Updated Jan 11, 2025

🦾 Take control of your AI agents

Python 1,055 79 Updated Jan 1, 2025

High-Performance Symbolic Regression in Python and Julia

Python 2,547 224 Updated Jan 10, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,532 573 Updated Jan 11, 2025

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Python 3,503 289 Updated Jan 3, 2025
Next