Skip to content
View Quehry's full-sized avatar

Highlights

  • Pro

Block or report Quehry

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,321 45 Updated Mar 12, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,986 5,378 Updated Mar 13, 2025

A curated list of different papers and datasets in various areas of audio-visual processing

693 69 Updated Jan 30, 2024

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 221 15 Updated Oct 31, 2023

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Python 38 1 Updated Nov 26, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,262 58 Updated Mar 14, 2024

[NeurIPS'24 Spotlight] Observational Scaling Laws

Jupyter Notebook 53 3 Updated Oct 2, 2024

[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.

Python 1,322 158 Updated Apr 16, 2024

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 518 50 Updated Jan 11, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

487 16 Updated Oct 11, 2024

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,519 438 Updated Sep 26, 2024

FacTool: Factuality Detection in Generative AI

Python 857 66 Updated Aug 19, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,913 523 Updated Mar 13, 2025

Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"

Python 365 60 Updated Mar 24, 2023

Long Document Summarization Papers

145 11 Updated Aug 3, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,370 4,294 Updated Mar 12, 2025

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 53,354 6,980 Updated Nov 17, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 52,077 6,161 Updated Mar 13, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 59,173 6,001 Updated Aug 24, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,892 2,605 Updated Mar 4, 2025

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,353 731 Updated Aug 5, 2024

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 121,475 16,326 Updated Mar 12, 2025

A quick guide (especially) for trending instruction finetuning datasets

2,932 191 Updated Nov 28, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,747 1,891 Updated Apr 30, 2024

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,152 419 Updated Nov 14, 2024

骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

Jupyter Notebook 3,637 248 Updated Sep 3, 2023

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,246 561 Updated Oct 24, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,071 766 Updated Oct 16, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,692 509 Updated Jul 18, 2024

TigerBot: A multi-language multi-task LLM

Python 2,258 192 Updated Dec 28, 2024
Next