Skip to content

A curated list of tools, frameworks, platforms, and resources for Large Language Model Operations (LLMOps).

Notifications You must be signed in to change notification settings

awesomelistsio/awesome-llmops

Repository files navigation

Awesome LLMOps Awesome Lists

Ko-Fi   PayPal   Stripe   X   Facebook

A curated list of tools, frameworks, platforms, and resources for Large Language Model Operations (LLMOps) — enabling production-ready, scalable, and reliable LLM applications.

LLMOps is the emerging practice of managing the lifecycle of large language models, including fine-tuning, deployment, monitoring, evaluation, versioning, and observability — similar to MLOps but optimized for LLMs and generative AI systems.

Contents

Overview & Learning

Model Training & Fine-Tuning

  • Hugging Face Transformers – Leading library for pre-trained and fine-tunable LLMs.
  • PEFT – Parameter-Efficient Fine-Tuning methods for LLMs.
  • LoRA – Lightweight fine-tuning strategy for large models.
  • Colossal-AI – Framework for efficient distributed LLM training.

Evaluation & Benchmarking

Serving & Inference

Monitoring & Observability

  • PromptLayer – Log, monitor, and manage prompts across LLM providers.
  • Arize AI – LLM monitoring, evaluation, and prompt tracing.
  • WhyLabs – Observability for ML and LLM deployments.
  • TruLens – Feedback loop framework for evaluating and improving LLM apps.

Prompt Engineering & Management

  • LangChain – Modular framework for chaining LLM calls and prompts.
  • Prompt Engineering Guide – Structured guide to writing effective prompts.
  • PromptFoo – Compare, test, and evaluate LLM prompts easily.
  • Guidance – Prompt programming with structured control over model output.

Data Management

  • Label Studio – Open-source data labeling for fine-tuning and RAG pipelines.
  • Weaviate – Vector database for semantic search and hybrid retrieval.
  • Pinecone – Managed vector DB for similarity search and retrieval-augmented generation.
  • ChromaDB – Open-source embeddings DB built for LLMs.

Security & Safety

  • Guardrails AI – Validating and controlling LLM outputs.
  • Rebuff – Open-source framework for prompt injection defense.
  • Giskard – Testing, debugging, and securing LLM applications.
  • OpenAI Moderation API – API for detecting harmful or unsafe content.

Platforms & Frameworks

  • LangChain – Infrastructure to build end-to-end LLM-powered apps.
  • LLamaIndex – Connect data sources to LLMs via indexing.
  • RAGStack (Haystack) – Retrieval-augmented generation framework.
  • FastChat – Open platform for serving and fine-tuning chat LLMs.

Tooling Ecosystem

  • Weights & Biases – Track and visualize model training and performance.
  • MLflow – Platform for managing the ML lifecycle.
  • PromptLayer – Middleware for logging and versioning prompt inputs and outputs.
  • OpenLLM – Open-source platform to deploy and manage LLMs in production.

Related Awesome Lists

Contribute

Contributions are welcome!

License

CC0

About

A curated list of tools, frameworks, platforms, and resources for Large Language Model Operations (LLMOps).

Topics

Resources

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages