-
MedicalGPT Public
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
-
github-hot Public
Tracking the hot Github repos and update daily 每天自动追踪Github热门项目
-
AIDailyNews Public
Forked from zhangferry/AIDailyNewsauto push daily news with ai
-
agentica Public
Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。
-
weather-forecast-server Public
weather-forecast-server, MCP server, without api key, free. Get weather for cities around the world. 免费天气预报MCP工具。
-
mcp-run-python-code Public
Python interpreter, MCP server, no API key, free. Get results from running Python code. Python代码解释器MCP,功能包括:执行python代码,运行python库安装,运行python脚本。
-
-
pycorrector Public
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
-
imgocr Public
Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。
-
similarity Public
similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。
-
ChatPilot Public
ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。
-
text2vec Public
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
-
-
thinking-intervention Public
Used for thinking process intervention of reasoning models such as DeepSeek-R1, effectively controlling the reasoning thinking process. 用于DeepSeek-R1等推理模型的思维过程干预,有效控制推理思考过程
-
zh-normalization Public
Chinese(zh) sentence NSW(Non-Standard-Word) Normalization
-
ChatPDF Public
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。
-
deep-research Public
Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.
-
cvnet Public
have fun with image AI
-
open-webui Public
Forked from open-webui/open-webuiUser-friendly AI Interface (Supports Ollama, OpenAI API, ...)
-
pinyin-tokenizer Public
pinyintokenizer, 拼音分词器,将连续的拼音切分为单字拼音列表。
-
SearchGPT Public
Forked from leptonai/search_with_leptonSearchGPT: Building a quick conversation-based search engine with LLMs.
-
open-o1 Public
open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains
-
-
parrots Public
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
-
addressparser Public
Forked from DQinYuan/chinese_province_city_area_mapper中文地址提取工具,支持中国三级区划地址(省、市、区)提取和映射,支持地址热力图绘制。
-
similarities Public
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
-
pytextclassifier Public
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
-
textgen Public
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
-
chatgpt-webui Public
ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面
-
dialogbot Public
dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。