Skip to content
View haim-barad's full-sized avatar

Block or report haim-barad

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,676 277 Updated Aug 10, 2024

The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System

Python 107 20 Updated Jun 13, 2024

SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open sour…

TypeScript 20,845 1,428 Updated Mar 4, 2025

🔥The Web-scale GUI for MongoDB

TypeScript 1,341 90 Updated Aug 2, 2024

The fastest off-the-shelf inference algorithm for LLMs (ICLR’25)

Python 4 1 Updated Feb 17, 2025

Building AI agents, atomically

Python 2,911 232 Updated Feb 25, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,557 2,172 Updated Feb 1, 2025

Asynchronous Python ODM for MongoDB

Python 2,226 234 Updated Feb 15, 2025

Cache-Augmented Generation: A Simple, Efficient Alternative to RAG

Python 1,056 156 Updated Feb 16, 2025

This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augmented Generation (RAG) using various LLMs.

Python 25 4 Updated Dec 30, 2024

Official inference framework for 1-bit LLMs

C++ 12,778 899 Updated Feb 18, 2025

🚀 The easiest way to automate building and releasing your iOS and Android apps

Ruby 39,956 5,766 Updated Feb 24, 2025

Model Context Protocol Servers

JavaScript 12,306 1,333 Updated Mar 3, 2025

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 19,465 1,243 Updated Mar 4, 2025

This repo contains documents of the OPEA project

Python 30 67 Updated Feb 27, 2025
Python 81 7 Updated Dec 31, 2024

Chat first code editor. To download the packaged app:

TypeScript 5,365 369 Updated Nov 14, 2024

Tutorial for building LLM router

Python 185 18 Updated Jul 19, 2024

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Python 722 56 Updated Feb 20, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,349 2,264 Updated Mar 4, 2025

Efficient Retrieval Augmentation and Generation Framework

Python 1,474 135 Updated Jan 9, 2025

Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.

Shell 378 248 Updated Mar 4, 2025

An awesome repository of local AI tools

1,437 110 Updated Nov 13, 2024

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Python 83 7 Updated Mar 3, 2025

Daux.io is an documentation generator that uses a simple folder structure and Markdown files to create custom documentation on the fly. It helps you create great looking documentation in a develope…

JavaScript 805 195 Updated Mar 4, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,936 10,734 Updated Mar 4, 2025

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 19,000 2,499 Updated Sep 19, 2024

Grok open release

Python 50,201 8,366 Updated Aug 30, 2024
Next