Stars
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A curated list of different papers and datasets in various areas of audio-visual processing
MoVQGAN - model for the image encoding and reconstruction
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
[NeurIPS'24 Spotlight] Observational Scaling Laws
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
FacTool: Factuality Detection in Generative AI
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
A quick guide (especially) for trending instruction finetuning datasets
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
TigerBot: A multi-language multi-task LLM