Skip to content
View moyi-qwq's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report moyi-qwq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,131 184 Updated Mar 11, 2025

CycleQD is a framework for parameter space model merging.

Python 34 3 Updated Feb 1, 2025

The LLM Evaluation Framework

Python 5,523 458 Updated Mar 12, 2025

A framework for few-shot evaluation of language models.

Python 8,229 2,191 Updated Mar 13, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,915 523 Updated Mar 13, 2025

Awesome-LLM: a curated list of Large Language Model

22,058 1,809 Updated Mar 4, 2025

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 306 23 Updated Sep 30, 2024

[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Python 214 18 Updated Dec 24, 2023

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,681 261 Updated Dec 27, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,315 430 Updated Jul 28, 2024

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…

14,401 1,319 Updated Dec 21, 2024

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,395 141 Updated Jan 6, 2025

[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.

Python 233 21 Updated Oct 30, 2024

Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。

Python 9,960 1,630 Updated Nov 26, 2024

We present the first systematic study on the scaling property of raw agents instantiated by LLMs. We find that performance scales with the increase in the number of agents, using the simple(st) way…

Python 111 13 Updated Oct 8, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,400 1,442 Updated Feb 25, 2025

A platform for building proxies to bypass network restrictions.

Go 45,825 8,943 Updated Jan 21, 2025

算法导论第三版答案(从其他git摘取得, 供自己学习对照使用)

HTML 42 20 Updated Jan 3, 2019