-
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedFeb 11, 2025 -
ragflow Public
Forked from infiniflow/ragflowRAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python Apache License 2.0 UpdatedJan 17, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python Apache License 2.0 UpdatedJan 3, 2025 -
deep-learning-pytorch-huggingface Public
Forked from philschmid/deep-learning-pytorch-huggingfaceJupyter Notebook MIT License UpdatedDec 25, 2024 -
MedicalGPT Public
Forked from shibing624/MedicalGPTMedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Python Apache License 2.0 UpdatedDec 22, 2024 -
modelscope-agent-gadget Public
Forked from modelscope/modelscope-agentModelscope-agent-Gadgets: A series of gadgets developed based on the modelscope-agent framework for testing the capabilities of large model-based agents.
Python Apache License 2.0 UpdatedApr 29, 2024 -
-
Reinforcement-Learning Public
Forked from LiSir-HIT/Reinforcement-Learningkinds of reinforcement learning model by Pytorch
Python UpdatedMar 19, 2023