#

llmops

Here are 255 public repositories matching this topic...

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Sep 8, 2025
Python

BerriAI / litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

gateway bedrock openai vertex-ai azure-openai llm langchain llmops anthropic openai-proxy litellm ai-gateway llm-gateway mcp-gateway

Updated Sep 7, 2025
Python

mlflow

mlflow / mlflow

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

open-source machine-learning ai apache-spark evaluation ml openai agents observability model-management mlops mlflow agentops prompt-engineering ai-governance langchain llmops llm-evaluation

Updated Sep 7, 2025
Python

serve

jina-ai / serve

☁️ Build multimodal AI applications with cloud-native stack

Updated Mar 24, 2025
Python

SuperAGI

TransformerOptimus / SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

python ai nextjs agi artificial-intelligence openai artificial-general-intelligence agents hacktoberfest autonomous-agents pinecone gpt-4 llm llmops superagi

Updated Jan 22, 2025
Python

raga-ai-hub / RagaAI-Catalyst

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

agents llmops agentneo ai-performance-optimization agentic-ai llm-testing ai-agent-monitoring llm-tracing ai-application-debugging agentic-ai-development ai-tool-interaction-monitoring ai-evaluation-tools

Updated Aug 13, 2025
Python

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

open-source playground openai llm prompt-engineering langchain llmops llama-index llm-evaluation llm-observability

Updated Sep 8, 2025
Python

bentoml / OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

llama mistral fine-tuning mlops bentoml vicuna llm model-inference llmops llm-serving llm-inference open-source-llm llama2 openllm llm-ops llama3-1 llama3-2 llama3-2-vision

Updated Sep 1, 2025
Python

explodinggradients / ragas

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops

Updated Sep 5, 2025
Python

metaflow

Netflix / metaflow

Build, Manage and Deploy AI/ML Systems

python kubernetes aws data-science machine-learning ai azure gcp ml datascience high-performance-computing agents model-management ml-infrastructure mlops ml-platform llm generative-ai llmops

Updated Sep 5, 2025
Python

BentoML

bentoml / BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

python machine-learning deep-learning model-serving multimodal mlops ml-engineering ai-inference llm generative-ai llmops llm-serving model-inference-service llm-inference inference-platform

Updated Sep 8, 2025
Python

traceloop / openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

python open-source monitoring metrics ml datascience help-wanted observability good-first-issue artifical-intelligence model-monitoring opentelemetry open-telemetry opentelemetry-python llm good-first-issues generative-ai llmops

Updated Sep 5, 2025
Python

superduper

superduper-io / superduper

Superduper: End-to-end framework for building custom AI applications and agents.

Updated Sep 1, 2025
Python

zenml

zenml-io / zenml

ZenML 🙏: MLOps for Reliable AI: from Classical AI to Agents. https://zenml.io.

workflow data-science machine-learning ai deep-learning tensorflow ml pipelines pytorch production-ready devops-tools hacktoberfest automl mlops zenml metadata-tracking llm llmops

Updated Sep 7, 2025
Python

giskard-oss

Giskard-AI / giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

ai-security mlops fairness-ai responsible-ai ml-validation red-team-tools trustworthy-ai ml-testing llm ai-red-team ai-testing llmops llm-security llm-eval llm-evaluation rag-evaluation agent-evaluation

Updated Sep 2, 2025
Python

cube-studio

tencentmusic / cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，mlops算法链路全流程，算力租赁平台，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU虚拟化，边缘计算，标注平台自动化标注，deepseek等大模型sft微调/奖励模型/强化学习训练，vllm/ollama/mindie大模型多机推理，私有知识库，AI模型市场，支持国产cpu/gpu/npu 昇腾生态，支持RDMA，支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式

kubernetes workflow ai spark pipeline notebook inference pytorch argo gpt automl kubeflow mlops vgpu aihub llmops deepseek

Updated Aug 31, 2025
Python

cognita

truefoundry / cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

python agent application data machine-learning framework typescript ai deep-learning model-deployment fine-tuning rag mlops llm generative-ai llmops retrieval-augmented-generation llm-ops

Updated Sep 1, 2025
Python

decodingml / llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

docker aws course infrastructure-as-code machine-learning-engineering comet-ml rag mlops pulumi qdrant large-language-models bytewax generative-ai qwak llmops ml-system-design superlinked

Updated Apr 26, 2025
Python

PacktPublishing / LLM-Engineers-Handbook

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

aws rag mlops llm llmops genai fine-tuning-llm llm-evaluation ml-system-design

Updated Mar 8, 2025
Python

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

transformers pytorch llama gpt lora model-serving fine-tuning llm llmops llm-serving llm-inference

Updated May 21, 2025
Python

Improve this page

Add a description, image, and links to the llmops topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llmops topic, visit your repo's landing page and select "manage topics."