Skip to content
View moyi-qwq's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report moyi-qwq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
13 stars written in Python
Clear filter

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,414 1,441 Updated Feb 25, 2025

Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。

Python 9,962 1,630 Updated Nov 26, 2024

A framework for few-shot evaluation of language models.

Python 8,238 2,192 Updated Mar 14, 2025

The LLM Evaluation Framework

Python 5,530 460 Updated Mar 14, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,920 523 Updated Mar 13, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,618 254 Updated Jan 14, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,132 184 Updated Mar 13, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,397 141 Updated Jan 6, 2025

[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.

Python 233 21 Updated Oct 30, 2024

[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Python 214 18 Updated Dec 24, 2023

We present the first systematic study on the scaling property of raw agents instantiated by LLMs. We find that performance scales with the increase in the number of agents, using the simple(st) way…

Python 112 13 Updated Oct 8, 2024

CycleQD is a framework for parameter space model merging.

Python 34 3 Updated Feb 1, 2025