Skip to content
View logan-markewich's full-sized avatar

Block or report logan-markewich

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,879 2,336 Updated Mar 14, 2025

A minimal web-UI for talking to Ollama (and OpenAI) servers

TypeScript 748 56 Updated Feb 22, 2025

The python library for real-time communication

Python 2,832 234 Updated Mar 13, 2025

CUDA on non-NVIDIA GPUs

Rust 10,918 699 Updated Mar 13, 2025

LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. It allows attaching metadata to prompts to ease their managem…

Python 81 10 Updated Mar 13, 2025

A Collection of BM25 Algorithms in Python

Python 1,118 93 Updated Oct 8, 2024

Chat UI components for LLM apps

TypeScript 226 20 Updated Mar 11, 2025

Markdown for the AI era

TypeScript 116 4 Updated Mar 13, 2025

Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.

Python 89 8 Updated Oct 12, 2024

A simple client and utils for interacting with OpenAI's Realtime API in Python

Python 226 47 Updated Nov 12, 2024

Lightweight Non-Parametric Embedding Fine-Tuning

Python 23 Updated Sep 26, 2024

Efficient BM25 indexing using rust

Rust 15 Updated Sep 17, 2024

Deploy your agentic worfklows to production

Python 1,979 218 Updated Mar 7, 2025

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 750 77 Updated Jan 28, 2025

The easiest way to get started with LlamaIndex

TypeScript 1,219 161 Updated Mar 12, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,792 1,186 Updated Mar 12, 2025

Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.

Python 2,898 189 Updated Mar 12, 2025

🤖 Headless IDE for AI agents

Go 170 6 Updated Feb 27, 2025

this is a repository that gives the power of mixture of workflows a concept inspired by the mixture of agents.

Python 13 Updated Aug 19, 2024

GenAI components at micro-service level; GenAI service composer to create mega-service

Python 123 177 Updated Mar 13, 2025

A non-official CLI for Llama Index Parser

TypeScript 212 17 Updated Jul 11, 2024

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,059 57 Updated Mar 11, 2025

The easiest way to use Agentic RAG in any enterprise

TypeScript 4,141 470 Updated Jan 22, 2025

AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark

Python 131 10 Updated Dec 18, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,491 1,460 Updated Jan 19, 2025

Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.

TypeScript 231 26 Updated Nov 4, 2024

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 34,590 2,599 Updated Mar 14, 2025

proof of concept prototype for generating and querying against an ever-expanding knowledge graph with ai

Python 882 106 Updated Apr 8, 2024
Next