gaoqunshu

Follow

gaoqunshu

Follow

0 followers · 1 following

Stars

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Python 458 54 Updated Dec 18, 2024

Chinese-Tiny-LLM / Chinese-Tiny-LLM

Python 219 16 Updated May 10, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,258 463 Updated Nov 6, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,615 321 Updated May 21, 2024

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,337 159 Updated Apr 20, 2024

charent / Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 514 56 Updated Jul 11, 2024

jiahe7ay / MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 380 58 Updated Apr 24, 2024

wdndev / tiny-llm-zh

从零实现一个小参数量中文大语言模型。

Python 398 47 Updated Aug 22, 2024

lansinuote / Simple_RLHF

Jupyter Notebook 61 11 Updated Nov 18, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,592 4,635 Updated Jan 8, 2025

iscyy / External-Attention-pytorch

Forked from xmu-xiaoma666/External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 16 3 Updated Feb 12, 2023

xing61 / zzz-api

优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy，支持ChatGPT的API调用，支持openai的API接口，支持：gpt-4，gpt-3.5。不需要openai Key, 不需要买openai的账号，不需要美元的银行卡，通通不用的，直接调用就行，稳定好用！！智增增

PHP 690 58 Updated Nov 6, 2024

chatanywhere / GPT_API_free

Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。

Python 26,638 1,981 Updated Dec 8, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,625 1,881 Updated Apr 30, 2024

meta-llama / llama

Inference code for Llama models

Python 57,147 9,649 Updated Aug 18, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,958 5,237 Updated Jun 27, 2024

tensorflow / models

Models and examples built with TensorFlow

Python 77,290 45,726 Updated Jan 9, 2025

sophgo / tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 642 162 Updated Dec 31, 2024

sophon-ai-algo / tpu_op_contest_s1

C 4 1 Updated Aug 1, 2022

yuanas / tpucontest

C 1 6 Updated Jan 22, 2022

Diego999 / pyGAT

Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)

Python 2,949 692 Updated Jul 6, 2023

tkipf / pygcn

Graph Convolutional Networks in PyTorch

Python 5,225 1,229 Updated Sep 20, 2020

wmathor / nlp-tutorial

Forked from graykode/nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

Jupyter Notebook 1,087 363 Updated Mar 20, 2022

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,870 3,396 Updated Jul 23, 2024

DIG-Beihang / RobustART

The first comprehensive Robustness investigation benchmark on large-scale dataset ImageNet regarding ARchitecture design and Training techniques towards diverse noises.

Python 146 15 Updated Feb 19, 2022

MUGE-2021 / image-retrieval-baseline

Python 57 11 Updated Nov 17, 2022

xmu-xiaoma666 / External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,665 1,946 Updated Dec 6, 2024

CCIIPLab / BiSyn_GAT_plus

The source code for "BiSyn-GAT+: Bi-Syntax Aware Graph Attention Network for Aspect-based Sentiment Analysis"

Python 40 8 Updated Oct 18, 2022

yangheng95 / PyABSA

Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial defense, etc.;

Jupyter Notebook 966 164 Updated Jan 9, 2025

MANLP-suda / JML

Python 35 3 Updated Dec 16, 2021