Skip to content
View aJupyter's full-sized avatar
🤗
Focusing
🤗
Focusing

Block or report aJupyter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ACL'25] SocialEval: Evaluating Social Intelligence of Large Language Models

Python 5 Updated Aug 9, 2025
Python 401 24 Updated Sep 8, 2025

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 335 34 Updated Sep 12, 2025

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 1 Updated Oct 11, 2023

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 434 21 Updated Oct 11, 2023

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 1 Updated Apr 27, 2024

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 1,014 101 Updated Apr 27, 2024
5 Updated Jun 5, 2025

Recent Advances on MLLM's Reasoning Ability

25 Updated Apr 11, 2025

My learning notes/codes for ML SYS.

Python 3,585 221 Updated Sep 12, 2025

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

805 23 Updated Aug 26, 2025

A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.

37 3 Updated Jul 15, 2025

slime is a LLM post-training framework for RL Scaling.

Python 1,741 156 Updated Sep 12, 2025

以rag_黄帝内经项目多次迭代不断优化rag技术

Python 7 Updated Aug 2, 2025

Official Repo for Open-Reasoner-Zero

Python 2,038 111 Updated Jun 2, 2025

Open-source unified multimodal model

Python 4,993 439 Updated Aug 22, 2025

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

5,949 546 Updated Aug 27, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 472 30 Updated Jan 23, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 310 17 Updated Aug 26, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,040 1,080 Updated Sep 12, 2025

从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连接视觉编码器与语言模型。

Python 25 2 Updated Feb 15, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,181 1,501 Updated Apr 24, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,023 380 Updated Sep 10, 2025

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 401 32 Updated Sep 12, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,205 198 Updated Sep 2, 2025

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 333 38 Updated Feb 25, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 24,262 2,441 Updated Sep 4, 2025

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 1,647 460 Updated Sep 12, 2025
HTML 1 Updated Jun 7, 2025
Next