Skip to content
View gaoqunshu's full-sized avatar

Block or report gaoqunshu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train a 1B LLM with 1T tokens from scratch by personal

Python 458 54 Updated Dec 18, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,258 463 Updated Nov 6, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,615 321 Updated May 21, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,337 159 Updated Apr 20, 2024

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 514 56 Updated Jul 11, 2024

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 380 58 Updated Apr 24, 2024

从零实现一个小参数量中文大语言模型。

Python 398 47 Updated Aug 22, 2024
Jupyter Notebook 61 11 Updated Nov 18, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,592 4,635 Updated Jan 8, 2025

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 16 3 Updated Feb 12, 2023

优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增

PHP 690 58 Updated Nov 6, 2024

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 26,638 1,981 Updated Dec 8, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,625 1,881 Updated Apr 30, 2024

Inference code for Llama models

Python 57,147 9,649 Updated Aug 18, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,958 5,237 Updated Jun 27, 2024

Models and examples built with TensorFlow

Python 77,290 45,726 Updated Jan 9, 2025

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 642 162 Updated Dec 31, 2024
C 1 6 Updated Jan 22, 2022

Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)

Python 2,949 692 Updated Jul 6, 2023

Graph Convolutional Networks in PyTorch

Python 5,225 1,229 Updated Sep 20, 2020

Natural Language Processing Tutorial for Deep Learning Researchers

Jupyter Notebook 1,087 363 Updated Mar 20, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,870 3,396 Updated Jul 23, 2024

The first comprehensive Robustness investigation benchmark on large-scale dataset ImageNet regarding ARchitecture design and Training techniques towards diverse noises.

Python 146 15 Updated Feb 19, 2022

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,665 1,946 Updated Dec 6, 2024

The source code for "BiSyn-GAT+: Bi-Syntax Aware Graph Attention Network for Aspect-based Sentiment Analysis"

Python 40 8 Updated Oct 18, 2022

Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial defense, etc.;

Jupyter Notebook 966 164 Updated Jan 9, 2025
Python 35 3 Updated Dec 16, 2021
Next