- Haifa, Israel
- https://www.linkedin.com/in/haimbarad/
Stars
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open sour…
The fastest off-the-shelf inference algorithm for LLMs (ICLR’25)
Janus-Series: Unified Multimodal Understanding and Generation Models
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augmented Generation (RAG) using various LLMs.
🚀 The easiest way to automate building and releasing your iOS and Android apps
Model Context Protocol Servers
Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
This repo contains documents of the OPEA project
Chat first code editor. To download the packaged app:
Framework for enhancing LLMs for RAG tasks using fine-tuning.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Efficient Retrieval Augmentation and Generation Framework
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Daux.io is an documentation generator that uses a simple folder structure and Markdown files to create custom documentation on the fly. It helps you create great looking documentation in a develope…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…