Skip to content
View 18140663659's full-sized avatar
  • ccnu
  • china

Block or report 18140663659

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Codebase for Instruction Following without Instruction Tuning

Python 15 1 Updated Sep 24, 2024

Making LLaVA Tiny via MoE-Knowledge Distillation

25 Updated Aug 26, 2024

Model components of the Llama Stack APIs

Python 2,432 245 Updated Sep 29, 2024
Jupyter Notebook 71 1 Updated Dec 29, 2023

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Python 153 3 Updated Sep 29, 2024

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

C++ 89 4 Updated Sep 20, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 4,657 379 Updated Sep 29, 2024
14 Updated Sep 5, 2024

AI for all: Build the large graph of the language models

Python 226 20 Updated Jun 3, 2024
Python 329 34 Updated Sep 23, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,325 224 Updated Nov 26, 2023

Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)

141 9 Updated Sep 22, 2024
Python 16 1 Updated Sep 14, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,496 97 Updated Jun 1, 2023

This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)

HTML 285 50 Updated Sep 24, 2024

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 3,333 306 Updated Sep 27, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,903 213 Updated Sep 27, 2024

Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".

Python 42 3 Updated Jul 24, 2024

This project aims to implements quiet_star algoithm

Python 3 2 Updated Apr 8, 2024

A comprehensive survey on Internal Consistency and Self-Feedback in Large Language Models.

Jupyter Notebook 148 3 Updated Sep 19, 2024

aider is AI pair programming in your terminal

Python 19,411 1,789 Updated Sep 29, 2024

The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

123 1 Updated Sep 12, 2024

[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models

Python 33 3 Updated Sep 22, 2024

Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)

Python 33 2 Updated Aug 8, 2024

Code for Quiet-STaR

Python 546 78 Updated Aug 21, 2024

【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!

Python 2,046 243 Updated Sep 29, 2024
Next