LLM
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Code and documentation to train Stanford's Alpaca models, and generate the data.
Port of Facebook's LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++
Locally run an Instruction-Tuned Chat-Style LLM
Port of OpenAI's Whisper model in C/C++
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
Interactive bot demo using LLMs and MLRun
AI Studio is an independent app for utilizing LLM.
An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)
ChatPPT is powered by chatgpt/ollama, it could help you to generate PPT/slide. It supports output in English and Chinese
👩🏻🍳 A collection of example notebooks
🚀 A list of Haystack Integrations, maintained by the community or deepset.
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk
Educational materials on deep learning by Weights & Biases
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and Tradin…
A full-featured, hackable Next.js AI chatbot built by Vercel
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.