🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
-
Updated
Apr 30, 2025 - Python
A large language model (LLM) is a type of machine learning model designed for understanding, generating, and interacting with human language. These models are trained on extensive datasets containing text from books, articles, websites, and other sources to learn patterns, context, and semantics in language. LLMs are widely used in applications like chatbots, code generation, translation, summarization, and more. They are often built using transformer architectures and are central to the field of generative AI.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Pocket Flow Tutorial Project: Turns GitHub repo into Easy Tutorial with AI
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Building AI agents, atomically
Implementation for MatMul-free LM.
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digital/.
A Home Assistant integration & Model to control your smart home using a Local LLM
Speech, Language, Audio, Music Processing with Large Language Model
An easy-to-use Python framework to generate adversarial jailbreak prompts.
[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
Medical Graph RAG: Graph RAG for the Medical Data
LLMs and Machine Learning done easily
This repository collects an extensive list of awesome papers about Story Generation / Storytelling, exclusively focusing on the era of Large Language Models (LLMs).