aJupyter

Follow

🤗

Focusing

aJupyter

🤗

Focusing

Follow

Think, Plan, Do, and Do It Better. 🤗

71 followers · 87 following

Nankai University
Tianjin
http://ajupytetr.blog.csdn.net
https://www.yuque.com/ajupyter
https://www.zhihu.com/people/grit-35-86/posts

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

thu-coai / SocialEval

[ACL'25] SocialEval: Evaluating Social Intelligence of Large Language Models

Python 5 Updated Aug 9, 2025

inclusionAI / ASearcher

Python 401 24 Updated Sep 8, 2025

modelscope / Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 335 34 Updated Sep 12, 2025

aJupyter / LLM-RLHF-Tuning

Forked from Joyce94/LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 1 Updated Oct 11, 2023

Joyce94 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 434 21 Updated Oct 11, 2023

aJupyter / LLM-Tuning

Forked from beyondguo/LLM-Tuning

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 1 Updated Apr 27, 2024

beyondguo / LLM-Tuning

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 1,014 101 Updated Apr 27, 2024

Cyno2232 / UBENCH

5 Updated Jun 5, 2025

WillDreamer / Awesome-MLLM-Reasoning

Recent Advances on MLLM's Reasoning Ability

25 Updated Apr 11, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 3,585 221 Updated Sep 12, 2025

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

805 23 Updated Aug 26, 2025

aJupyter / Awesome_LLM_Search_AdTech

2 Updated Aug 13, 2025

ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-tools

A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.

37 3 Updated Jul 15, 2025

THUDM / slime

slime is a LLM post-training framework for RL Scaling.

Python 1,741 156 Updated Sep 12, 2025

good-lwb / rag_learn

以rag_黄帝内经项目多次迭代不断优化rag技术

Python 7 Updated Aug 2, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,038 111 Updated Jun 2, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 4,993 439 Updated Aug 22, 2025

karminski / one-small-step

这是一个简单的技术科普教程项目，主要聚焦于解释一些有趣的，前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

5,949 546 Updated Aug 27, 2025

CaraJ7 / MMSearch

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 472 30 Updated Jan 23, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 310 17 Updated Aug 26, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,040 1,080 Updated Sep 12, 2025

yujunhuics / Reyes

从零到一实现了一个多模态大模型，并命名为Reyes（睿视），R：睿，eyes：眼。Reyes的参数量为8B，视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct，Reyes也通过一个两层MLP投影层连接视觉编码器与语言模型。

Python 25 2 Updated Feb 15, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,181 1,501 Updated Apr 24, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,023 380 Updated Sep 10, 2025

TIGER-AI-Lab / VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 401 32 Updated Sep 12, 2025

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,205 198 Updated Sep 2, 2025

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 333 38 Updated Feb 25, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 24,262 2,441 Updated Sep 4, 2025

dw-dengwei / daily-arXiv-ai-enhanced

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 1,647 460 Updated Sep 12, 2025

aJupyter / aJupyter.github.io

HTML 1 Updated Jun 7, 2025