feature(sunjx): add rejection sampling in grm_training by Jiaxuan-Sun · Pull Request #38 · opendilab/LightRFT

Jiaxuan-Sun · 2026-02-06T07:06:34Z

Rejection Sampling for GRM Training

This directory contains scripts and tools for preparing rejection sampling training data and training GRM (Generative Reward Model) models on both text-to-image (T2I) and text-to-video (T2V) tasks.

Overview

Rejection sampling is a technique to filter high-quality training samples by:

Running inference on a dataset using a trained GRM model
Filtering correctly predicted samples (where model prediction matches ground truth)
Converting filtered samples into training format with Chain-of-Thought (CoT) reasoning
Training the model on these high-quality filtered samples

feature(sunjx): add rejection sampling in grm_training

1658142

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feature(sunjx): add rejection sampling in grm_training#38

feature(sunjx): add rejection sampling in grm_training#38
Jiaxuan-Sun wants to merge 1 commit intoopendilab:mainfrom
Jiaxuan-Sun:feature/t2i-rejective-sampling-0206

Jiaxuan-Sun commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

Jiaxuan-Sun commented Feb 6, 2026

Rejection Sampling for GRM Training

Overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant